Skip to content

Pull requests: NVIDIA/Fuser

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

Add Blackwell MMA macros
#4079 opened Mar 14, 2025 by zasdfgbnm Loading…
TMem check the stride of outer dims
#4070 opened Mar 13, 2025 by zasdfgbnm Loading…
indexAccumulate python api
#4066 opened Mar 12, 2025 by jjsjann123 Draft
2 tasks
Update SDPA flash attention API
#4065 opened Mar 12, 2025 by Priya2698 Loading…
Add a backprop test
#4064 opened Mar 12, 2025 by wujingyue Loading…
Adding IndexAccumulateOp
#4063 opened Mar 12, 2025 by jjsjann123 Loading…
2 tasks
Simplify selfAllocationReplay
#4057 opened Mar 11, 2025 by wujingyue Draft
s/reshape/set when no transforms are applied
#4056 opened Mar 10, 2025 by wujingyue Loading…
[WIP] Basic latency tests
#4053 opened Mar 9, 2025 by csarofeen Draft
supporting vectorized load on IndexSelectOp
#4048 opened Mar 8, 2025 by jjsjann123 Loading…
2 tasks
WIP
#4038 opened Mar 7, 2025 by zasdfgbnm Draft
WIP
#4037 opened Mar 7, 2025 by zasdfgbnm Draft
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.