tantan的博客
Notes, ideas, and observations
首页
分类
28
归档
160
关于
搜索
Megatron-LM
标签
2025
04-05
Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism 论文阅读
0%
Theme NexT works best with JavaScript enabled