Abstract
Based on a DCN with OCS, we propose a pattern-aware scheduling and fast convergence strategy for the distributed machine learning jobs. Experimental results show significant accelerations for completion time and convergence of the jobs.
© 2021 The Author(s)
PDF Article | Presentation Video