论文标题
持续深度学习:将模型投入生产的工作流程
Continuous Deep Learning: A Workflow to Bring Models into Production
论文作者
论文摘要
研究人员非常活跃地研究了经典的机器学习工作流程并整合了软件工程生命周期的最佳实践。但是,深度学习表现出这种概念开发过程中尚未涵盖的偏差。这包括专用硬件,可配置功能工程,广泛的超参数优化,大规模数据管理和模型压缩的要求,以减少尺寸和推理潜伏期。深度学习的个体问题正在彻底检查中,许多概念和实施都受到了关注。不幸的是,完整的端到端开发过程仍然未指定。在本文中,我们定义了一个详细的深度学习工作流程,该工作流程结合了古典机器学习工作流程基线上的上述特征。我们通过使用市场上的一些最新技术来构建原型深度学习系统,将概念思想进一步转移到实践中。为了检查工作流程的可行性,将两种用例应用于原型。
Researchers have been highly active to investigate the classical machine learning workflow and integrate best practices from the software engineering lifecycle. However, deep learning exhibits deviations that are not yet covered in this conceptual development process. This includes the requirement of dedicated hardware, dispensable feature engineering, extensive hyperparameter optimization, large-scale data management, and model compression to reduce size and inference latency. Individual problems of deep learning are under thorough examination, and numerous concepts and implementations have gained traction. Unfortunately, the complete end-to-end development process still remains unspecified. In this paper, we define a detailed deep learning workflow that incorporates the aforementioned characteristics on the baseline of the classical machine learning workflow. We further transferred the conceptual idea into practice by building a prototypic deep learning system using some of the latest technologies on the market. To examine the feasibility of the workflow, two use cases are applied to the prototype.