论文标题
衡量基于位置的社交媒体的相对意见:2016年美国总统选举的案例研究
Measuring relative opinion from location-based social media: A case study of the 2016 U.S. presidential election
论文作者
论文摘要
社交媒体已成为公众舆论收集的意见民意调查的一种新兴替代品,而它仍然作为被动数据来源(例如无结构性,量化性和代表性)提出了许多挑战。带有地理标签的社交媒体数据提供了新的机会,可以揭示用户表达意见的地理位置。本文旨在回答两个问题:1)是否可以从社交媒体中获得对公众舆论的量化测量,以及与意见民意调查相比,它是否可以产生更好的或补充措施。这项研究提出了一种新颖的方法,以衡量Twitter用户对公共问题的相对意见,以适应更复杂的意见结构并利用与公共问题有关的地理。为了确保这种新措施在技术上是可行的,开发了一个建模框架,包括通过采用最先进的方法来构建培训数据集并设计一种称为意见为导向的单词嵌入的新的深度学习方法。通过对为2016年美国总统大选选择的推文的案例研究,我们证明了我们相对意见方法的预测性优势,并展示了它如何帮助视觉分析和支持意见预测。尽管与民意调查相比,相对意见措施被证明更为强大,但我们的研究还表明,前者可以有利地补充意见预测中的后期。
Social media has become an emerging alternative to opinion polls for public opinion collection, while it is still posing many challenges as a passive data source, such as structurelessness, quantifiability, and representativeness. Social media data with geotags provide new opportunities to unveil the geographic locations of users expressing their opinions. This paper aims to answer two questions: 1) whether quantifiable measurement of public opinion can be obtained from social media and 2) whether it can produce better or complementary measures compared to opinion polls. This research proposes a novel approach to measure the relative opinion of Twitter users towards public issues in order to accommodate more complex opinion structures and take advantage of the geography pertaining to the public issues. To ensure that this new measure is technically feasible, a modeling framework is developed including building a training dataset by adopting a state-of-the-art approach and devising a new deep learning method called Opinion-Oriented Word Embedding. With a case study of the tweets selected for the 2016 U.S. presidential election, we demonstrate the predictive superiority of our relative opinion approach and we show how it can aid visual analytics and support opinion predictions. Although the relative opinion measure is proved to be more robust compared to polling, our study also suggests that the former can advantageously complement the later in opinion prediction.