Skip to content

Pull requests: pytorch/torchtitan

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[graph_trainer] Fix H100 CI failure from DeepEP compilation break ciflow/h100.8 Trigger H100.8 CI ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3390 opened May 17, 2026 by SherlockNoMad Contributor Loading…
2 tasks
[MoE][6/n] Extract local_reorder, split DeepEP/HybridEP dispatchers ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3389 opened May 17, 2026 by acisseJZhong Contributor Loading…
3 tasks
Add xpugraph pass for graph trainer ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3388 opened May 17, 2026 by jemitche1 Draft
[MoE][5/n] Refactor MoE to clean DTensor boundaries for shared/routed experts ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3386 opened May 17, 2026 by acisseJZhong Contributor Loading…
4 tasks done
Update ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3385 opened May 17, 2026 by acisseJZhong Contributor Loading…
Fix memory snapshot pickle protocol for compatibility with memory visualizers ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3375 opened May 16, 2026 by francesco-bertolotti Contributor Loading…
Re-enable compile for gpt-oss integration tests ciflow/h100.8 Trigger H100.8 CI ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3373 opened May 15, 2026 by aditvenk Contributor Loading…
[qwen3_5] evolve qwen3_vl to qwen3_5 ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3371 opened May 15, 2026 by shuhuayu Contributor Loading…
[graph_trainer] Use separate EP process groups for overlap ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3369 opened May 15, 2026 by sanketpurandare Contributor Loading…
[notforland] bitwise infra ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3366 opened May 15, 2026 by pianpwk Contributor Draft
[rl] fix CI timeout issue by properly tear down for vllm engine V2 ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3365 opened May 15, 2026 by wwwjn Contributor Loading…
to_local() for ChunkedCELoss + no LP ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3364 opened May 15, 2026 by pianpwk Contributor Loading…
[graph_trainer] Add EP overlap eager chunking scaffolding ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3363 opened May 15, 2026 by sanketpurandare Contributor Loading…
[graph_trainer] Support hinted symbolic input dims in tracing ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3362 opened May 15, 2026 by sanketpurandare Contributor Loading…
[graph_trainer] Add DeepSeek V3 16B SDPA config ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3361 opened May 15, 2026 by sanketpurandare Contributor Loading…
Enables HybridEP torch.compile support and adds eager integration tests ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3360 opened May 14, 2026 by syed-ahmed Collaborator Loading…
[rl] Route RL CI with pytest markers ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3357 opened May 14, 2026 by felipemello1 Contributor Loading…
Fix optimizer state and module state coupling ciflow/h100.8 Trigger H100.8 CI ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3356 opened May 14, 2026 by tugsbayasgalan Contributor Loading…
[graph_trainer] Defer cudagraph compatibility check to pass execution… ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3355 opened May 14, 2026 by IvanKobzarev Contributor Loading…
[graph_trainer] Add bucketing ops to precompile serialization filter ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3354 opened May 14, 2026 by IvanKobzarev Contributor Loading…
[graph_trainer] Make JointManualOverlapScheduler standalone with direct node moves ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3352 opened May 14, 2026 by SherlockNoMad Contributor Draft
1 of 3 tasks
Scope FlexShard recompute policy by provenance ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3349 opened May 14, 2026 by weifengpy Contributor Draft
Support grouped FlexShard reshard-after-forward buckets ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3348 opened May 14, 2026 by weifengpy Contributor Draft
[rl] Add Batcher in RL Loop ciflow/rl ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3347 opened May 14, 2026 by wwwjn Contributor Loading…
[graph_trainer] Rework full_inductor_compilation_pass via regional_inductor + CPU attr migration ciflow/8gpu CLA Signed This label is managed by the Meta Open Source bot.
#3346 opened May 13, 2026 by tugsbayasgalan Contributor Loading…
ProTip! Filter pull requests by the default branch with base:main.