论文标题
通过仿射模型转换转移学习
Transfer learning with affine model transformation
论文作者
论文摘要
监督的转移学习受到了相当大的关注,因为它有可能在数据稀缺的情况下提高机器学习的预测能力。通常,通过统计学学习域的偏移和特定于域的因素,使用了给定的一组源模型和来自目标域的数据集将预训练的模型调整为目标域。尽管这种程序和直观上合理的方法在广泛的现实应用中取得了巨大的成功,但缺乏理论基础阻碍了进一步的方法论发展。本文遵循预期平方损失最小化的原理,介绍了称为仿射模型转移的一般转移学习回归。结果表明,仿射模型传输广泛包含各种现有方法,包括基于神经特征提取器的最常见过程。此外,当前论文阐明了仿射模型转移的理论特性,例如概括误差和多余的风险。通过几个案例研究,我们证明了与仿射型转移模型分别对建模和估计域间的共同点和特定于域特异性因素的实际好处。
Supervised transfer learning has received considerable attention due to its potential to boost the predictive power of machine learning in scenarios where data are scarce. Generally, a given set of source models and a dataset from a target domain are used to adapt the pre-trained models to a target domain by statistically learning domain shift and domain-specific factors. While such procedurally and intuitively plausible methods have achieved great success in a wide range of real-world applications, the lack of a theoretical basis hinders further methodological development. This paper presents a general class of transfer learning regression called affine model transfer, following the principle of expected-square loss minimization. It is shown that the affine model transfer broadly encompasses various existing methods, including the most common procedure based on neural feature extractors. Furthermore, the current paper clarifies theoretical properties of the affine model transfer such as generalization error and excess risk. Through several case studies, we demonstrate the practical benefits of modeling and estimating inter-domain commonality and domain-specific factors separately with the affine-type transfer models.