-
Notifications
You must be signed in to change notification settings - Fork 35
Pull requests: Azure/MS-AMP
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bug Fixed] Support MS-AMP+TE+DDP and MS-AMP+TE+DeepSpeed
#144
by wkcn
was merged Dec 14, 2023
Loading…
Support writing optimizer checkpoint only on rank0 and make UT pass on A100
#142
by tocean
was merged Mar 1, 2024
Loading…
[Bugfix] LinearReplacer.replace(linear, Dtypes.kbfloat16) raises error.
#136
by wkcn
was merged Nov 29, 2023
Loading…
[Bugfix] when parameters has no grad or ScalingParameter has no is_meta property it will crash
#135
by tocean
was merged Nov 30, 2023
Loading…
Bump axios, @docusaurus/core, @docusaurus/preset-classic and @docusaurus/theme-search-algolia in /website
dependencies
Pull requests that update a dependency file
#134
by dependabot
bot
was closed Feb 21, 2024
Loading…
Optimize performance by fuse adding high precision tensor to fp8 tensor
#132
by tocean
was merged Nov 24, 2023
Loading…
[Bug Fixed] scale=INF when casting a tensor to scaling FP32/BF16 tensors
#131
by wkcn
was merged Nov 28, 2023
Loading…
add the bibtex of "FP8-LM: Training FP8 Large Language Models" in README
#113
by wkcn
was merged Nov 1, 2023
Loading…
[Bug Fixed] The scaling weight is not updated in the optimizer
LBAdamW
#112
by wkcn
was merged Nov 1, 2023
Loading…
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.