サーベイ: Topoopt: Co-optimizing network topology and parallelization strategy for distributed training jobs (2022)

分散深層学習論文サーベイ深層学習最適化

Wang, Weiyang, et al. "Topoopt: Co-optimizing network topology and parallelization strategy for distributed training jobs." arXiv preprint arXiv:2202.00433 (2022). [paper] 概要どんなもの? Metaにおける分散DNNトレーニングジョブの解析それを…

#TOPOOPT

2023-01-02

サーベイ: Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning

論文サーベイ深層学習分散深層学習

@article{zheng2022alpa, title={Alpa: Automating Inter-and Intra-Operator Parallelism for Distributed Deep Learning}, author={Zheng, Lianmin and Li, Zhuohan and Zhang, Hao and Zhuang, Yonghao and Chen, Zhifeng and Huang, Yanping and Wang, Y…

2022-07-12

ML Commons

深層学習

ML Commonsとは、機械学習アプリケーションのベンチマークであるMLPerfの管理を行う団体のこと。 website: github: 2022/06/14時点でのベンチマーク項目は下記。 Training Training: HPC Inference: Datacenter Inference: Edge Inference: Mobile Inference…

2022-05-26

サーベイ: ZeRO-Offload: Democratizing Billion-Scale Model Training

論文サーベイ省メモリ深層学習

@inproceedings{ren2021zero, title={$\{$ZeRO-Offload$\}$: Democratizing $\{$Billion-Scale$\}$ Model Training}, author={Ren, Jie and Rajbhandari, Samyam and Aminabadi, Reza Yazdani and Ruwase, Olatunji and Yang, Shuangyan and Zhang, Minjia a…

2022-05-24

サーベイ: Training Deep Nets with Sublinear Memory Cost

論文サーベイ省メモリ深層学習

Chen, Tianqi, et al. "Training deep nets with sublinear memory cost." arXiv preprint arXiv:1604.06174 (2016). @article{chen2016training, title={Training deep nets with sublinear memory cost}, author={Chen, Tianqi and Xu, Bing and Zhang, Ch…

2022-05-20

サーベイ: GPUメモリ管理の実行時最適化による大規模深層学習の高速化 (2018)

論文サーベイ深層学習省メモリ

@article{伊藤祐貴2018gpu, title={GPU メモリ管理の実行時最適化による大規模深層学習の高速化}, author={伊藤祐貴 and 今井晴基 and 根岸康 and 河内谷清久仁 and 松宮遼 and 遠藤敏夫 and others}, journal={研究報告ハイパフォーマンスコンピューティン…

Sabrou-mal サブロウ丸

主にプログラミングと数学