论文标题
与机器学习和数据大规模数据相关的庇护相关迁移流
Forecasting asylum-related migration flows with machine learning and data at scale
论文作者
论文摘要
2015 - 16年所谓的“难民危机”的影响继续主导欧洲的政治议程。迁移流动是突然的,意外的,使政府没有准备好,并在移民预测领域揭示了重大缺陷。迁移是一个复杂的系统,该系统由情节变化,其因果因素是相互作用,高度依赖和短暂寿命的因果因素的基础。相应地,迁移监测依赖于分散的数据,而预测关注特定迁移流的方法通常会产生不一致的结果,而结果很难在区域或全球水平上概括。 在这里,我们表明,自适应机器学习算法将官方统计数据和非传统数据源整合到大规模上可以有效地预测与庇护相关的迁移流。我们专注于全球所有原籍国国民在欧盟国家(EU)提出的庇护申请;提供相同的方法可以在任何情况下使用足够的迁移或庇护数据。 我们利用了三层数据 - 在原产地国家进行地理上的事件和互联网搜索,欧盟边境不规则穿越的发现以及目的地国家的庇护所认可率 - 有效地预测了个人庇护迁移流量,最多可以提前四个星期。我们的方法很独特,我们的方法a)监视原籍国的潜在移民驱动因素,以提早检测变化; b)单独和移动时间窗口单独的国家到国家迁移流; c)估计各个驱动因素的影响,包括滞后效应; d)提供长达四个星期的庇护申请的预测; e)评估驱动程序的模式如何随着时间的流逝而变化,以描述迁移系统的功能和变化。
The effects of the so-called "refugee crisis" of 2015-16 continue to dominate the political agenda in Europe. Migration flows were sudden and unexpected, leaving governments unprepared and exposing significant shortcomings in the field of migration forecasting. Migration is a complex system typified by episodic variation, underpinned by causal factors that are interacting, highly context dependent and short-lived. Correspondingly, migration monitoring relies on scattered data, while approaches to forecasting focus on specific migration flows and often have inconsistent results that are difficult to generalise at the regional or global levels. Here we show that adaptive machine learning algorithms that integrate official statistics and non-traditional data sources at scale can effectively forecast asylum-related migration flows. We focus on asylum applications lodged in countries of the European Union (EU) by nationals of all countries of origin worldwide; the same approach can be applied in any context provided adequate migration or asylum data are available. We exploit three tiers of data - geolocated events and internet searches in countries of origin, detections of irregular crossings at the EU border, and asylum recognition rates in countries of destination - to effectively forecast individual asylum-migration flows up to four weeks ahead with high accuracy. Uniquely, our approach a) monitors potential drivers of migration in countries of origin to detect changes early onset; b) models individual country-to-country migration flows separately and on moving time windows; c) estimates the effects of individual drivers, including lagged effects; d) provides forecasts of asylum applications up to four weeks ahead; e) assesses how patterns of drivers shift over time to describe the functioning and change of migration systems.