论文标题
预测网络调查中的受访者难度:基于鼠标运动功能的机器学习方法
Predicting respondent difficulty in web surveys: A machine-learning approach based on mouse movement features
论文作者
论文摘要
调查研究的一个核心目标是从受访者那里收集强大而可靠的数据。但是,尽管研究人员在设计问卷上做出了最大的努力,但受访者可能会遇到困难理解问题的意图,因此可能难以做出适当的回应。如果可以检测到这种困难,则可以使用这些知识来通过响应式问卷设计为实时干预提供信息,或者在事实后指示和纠正测量错误。在网络调查背景下,先前的研究使用了Paradata,特别是响应时间,以发现困难并帮助提高用户体验和数据质量。但是,现在可以使用受访者与鼠标进行的动作形式获得更丰富的数据源,这是对受访者 - 调查相互作用的额外且更详细的指标。本文使用机器学习技术来探讨受访者难度的鼠标跟踪数据的预测价值。我们使用有关受访者的就业历史和人口统计信息的调查中的数据,在这些调查中,我们通过实验操纵了几个问题的难度。使用从光标运动得出的功能,我们预测受访者是使用并比较几种最先进的监督学习方法来回答问题的简单或困难版本。此外,我们开发了一种个性化方法,该方法可以调整受访者的基线鼠标行为并评估其性能。对于所有三个操纵的调查问题,我们发现包括整个鼠标运动的全套具有改进的预测性能,而不是嵌套交叉验证中仅响应时间的模型。考虑鼠标运动的个体差异,导致进一步的改进。
A central goal of survey research is to collect robust and reliable data from respondents. However, despite researchers' best efforts in designing questionnaires, respondents may experience difficulty understanding questions' intent and therefore may struggle to respond appropriately. If it were possible to detect such difficulty, this knowledge could be used to inform real-time interventions through responsive questionnaire design, or to indicate and correct measurement error after the fact. Previous research in the context of web surveys has used paradata, specifically response times, to detect difficulties and to help improve user experience and data quality. However, richer data sources are now available, in the form of the movements respondents make with the mouse, as an additional and far more detailed indicator for the respondent-survey interaction. This paper uses machine learning techniques to explore the predictive value of mouse-tracking data with regard to respondents' difficulty. We use data from a survey on respondents' employment history and demographic information, in which we experimentally manipulate the difficulty of several questions. Using features derived from the cursor movements, we predict whether respondents answered the easy or difficult version of a question, using and comparing several state-of-the-art supervised learning methods. In addition, we develop a personalization method that adjusts for respondents' baseline mouse behavior and evaluate its performance. For all three manipulated survey questions, we find that including the full set of mouse movement features improved prediction performance over response-time-only models in nested cross-validation. Accounting for individual differences in mouse movements led to further improvements.