Skip to content

Pull requests: NVIDIA/TransformerEngine

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[PyTorch] Support single parameter for GroupedLinear
#2731 opened Mar 4, 2026 by ksivaman Loading…
9 of 13 tasks
fix: scope get_full_cu_seqlens cache key by device and inference mode
#2728 opened Mar 3, 2026 by DmCarpe93 Loading…
8 of 13 tasks
[CI] Refactor CI build on GitHub
#2723 opened Mar 2, 2026 by ptrendx Draft
1 of 13 tasks
[Common, pyTorch] Grouped MXFP8 dequantize support
#2722 opened Mar 2, 2026 by ptrendx Draft
1 of 13 tasks
Add MXFP8 attention
#2719 opened Mar 1, 2026 by cyanguwa Draft
13 tasks
pass params_dtype to qk_norm creation
#2718 opened Feb 28, 2026 by pstjohn Loading…
Enable dequantization from MXFP8 tensor with only columnwise data
#2712 opened Feb 26, 2026 by ptrendx Loading…
13 tasks
[JAX] Support calling MOE router kernels from JAX side
#2711 opened Feb 26, 2026 by tdophung Loading…
1 of 13 tasks
[Draft] Newton-Schulz via cuSOLVERMp
#2706 opened Feb 25, 2026 by vcherepanov-nv Loading…
6 of 13 tasks
[All] Added better error messages
#2705 opened Feb 25, 2026 by ptrendx Loading…
[Draft][PyTorch] torch.compile support for TE Linear
#2701 opened Feb 24, 2026 by pggPL Draft
13 tasks
[PyTorch] Zero-initialize learnable softmax_offset in DotProductAttention
#2694 opened Feb 20, 2026 by fjosw Loading…
7 of 13 tasks
[PyTorch] Error out if constructing LayerNormLinear with row tensor parallelism bug Something isn't working
#2688 opened Feb 17, 2026 by timmoon10 Loading…
6 of 13 tasks
[PyTorch] torch.compile support for permutation functions
#2686 opened Feb 17, 2026 by pggPL Loading…
9 of 13 tasks
ProTip! Mix and match filters to narrow down what you’re looking for.