论文标题
衡量交通预测模型的信心:技术,实验比较和针对其可行性的准则
Measuring the Confidence of Traffic Forecasting Models: Techniques, Experimental Comparison and Guidelines towards Their Actionability
论文作者
论文摘要
近年来,预测机器学习模型所表现出的不确定性量的估计已获得了巨大的动力。不确定性估计为用户提供了有关模型对预测结果的信心的增强信息。尽管此信息对用户的可信度具有固有的实用性,但在机器学习模型中可以衡量的不同类型的不确定性以及可用于量化特定模型的不确定性的不同技术的适用性,但在不同类型的不确定性方面达成了薄薄的共识。尽管与流量预测相关的信心可以极大地利用其在实际的交通管理系统中的可行性,但该主题在交通建模领域中大多不存在。这项工作旨在通过审查文献中可用的不同技术和不确定性的指标来涵盖这一缺乏研究,并通过严格地讨论如何计算出对流量预测模型的信心水平如何有助于在该研究领域工作的研究人员和从业人员。为了获得经验证据,通过不同的不确定性估计技术对在马德里(西班牙)收集的真实交通数据产生的实验结果进一步介绍了这一批判性讨论,从而总体概述了每种技术的益处和警告,它们如何彼此相比如何相比,它们如何降低未确定的数量,依赖于数量和多样性的数据,以降低质量和多样性的数据,这些数据的质量和多样性的数据均可降低。
The estimation of the amount of uncertainty featured by predictive machine learning models has acquired a great momentum in recent years. Uncertainty estimation provides the user with augmented information about the model's confidence in its predicted outcome. Despite the inherent utility of this information for the trustworthiness of the user, there is a thin consensus around the different types of uncertainty that one can gauge in machine learning models and the suitability of different techniques that can be used to quantify the uncertainty of a specific model. This subject is mostly non existent within the traffic modeling domain, even though the measurement of the confidence associated to traffic forecasts can favor significantly their actionability in practical traffic management systems. This work aims to cover this lack of research by reviewing different techniques and metrics of uncertainty available in the literature, and by critically discussing how confidence levels computed for traffic forecasting models can be helpful for researchers and practitioners working in this research area. To shed light with empirical evidence, this critical discussion is further informed by experimental results produced by different uncertainty estimation techniques over real traffic data collected in Madrid (Spain), rendering a general overview of the benefits and caveats of every technique, how they can be compared to each other, and how the measured uncertainty decreases depending on the amount, quality and diversity of data used to produce the forecasts.