论文标题
作为社交网络的刮擦:划痕项目中的主题建模和情感分析
Scratch as Social Network: Topic Modeling and Sentiment Analysis in Scratch Projects
论文作者
论文摘要
诸如Black Lives Matter(BLM)运动之类的社会事务影响了软件工程,因为最近关于取代某些歧视性术语(例如白名单/黑名单)的辩论已显示。识别相关和趋势的社会事务很重要,并且经常使用社交网络分析对Twitter等传统社交媒体渠道进行。在本文中,我们探讨了是否还可以通过查看年轻的Scratch程序员的繁荣场景来探讨这种类型的分析。教育编程语言划痕不仅用于教授编程概念,还为年轻程序员提供了一个平台,以表达和分享其在任何相关性主题上的创造力。通过在106.032刮擦项目的数据集中分析标题和项目评论,我们探讨了哪些主题在Scratch社区中很常见,是否反映了与社会相关的事件以及评论中的情感如何。事实证明,刮擦项目中主题的多样性使分析过程具有挑战性。然而,我们的结果表明,尤其是流行和净文化的话题存在,即使是最近的社会事件,例如COVID-19大流行或BLM,在某种程度上也反映在从头开始。评论中的语气主要具有吸引人的青年语言是积极的。因此,尽管面临挑战,但可以以与社交网络相同的方式研究刮擦项目,这为我们开辟了新的可能性,以提高我们对新手程序员的行为和动机的理解。
Societal matters like the Black Lives Matter (BLM) movement influence software engineering, as the recent debate on replacing certain discriminatory terms such as whitelist/blacklist has shown. Identifying relevant and trending societal matters is important, and often done using social network analysis for traditional social media channels such as twitter. In this paper we explore whether this type of analysis can also be used for introspection of the software world, by looking at the thriving scene of young Scratch programmers. The educational programming language Scratch is not only used for teaching programming concepts, but offers a platform for young programmers to express and share their creativity on any topics of relevance. By analyzing titles and project comments in a dataset of 106.032 Scratch projects, we explore which topics are common in the Scratch community, whether socially relevant events are reflected and how how the sentiment in the comments is. It turns out that the diversity of topics within the Scratch projects make the analysis process challenging. Our results nevertheless show that topics from pop and net culture in particular are present, and even recent societal events such as the Covid-19 pandemic or BLM are to some extent reflected in Scratch. The tone in the comments is mostly positive with catchy youth language. Hence, despite the challenges, Scratch projects can be studied in the same way as social networks, which opens up new possibilities to improve our understanding of the behavior and motivation of novice programmers.