论文标题

基于词汇语义变化的基于无监督的嵌入检测

Unsupervised Embedding-based Detection of Lexical Semantic Changes

论文作者

Asgari, Ehsaneddin, Ringlstetter, Christoph, Schütze, Hinrich

论文摘要

本文介绍了Emblexchange,这是一个由Semeval-2020任务1的“生活语言”团队介绍的系统,该系统是针对词汇语义变化的无监督检测的。 emblexchange定义为源w和目标域中的单词w(相对于一组参考单词计算的基于嵌入的配置文件)之间的差异(源和目标域可以只是两个时间范围T1和T2)。基本的假设是单词w的词法语义变化会影响其同时发生的单词,然后随后改变嵌入空间中的邻居。我们表明,使用重新采样框架来选择参考词,我们可以可靠地检测到英语,德语,瑞典语和拉丁语中的词汇语义变化。 Emblexchange在Semeval-2020的语义变化的二进制检测中获得了第二名。

This paper describes EmbLexChange, a system introduced by the "Life-Language" team for SemEval-2020 Task 1, on unsupervised detection of lexical-semantic changes. EmbLexChange is defined as the divergence between the embedding based profiles of word w (calculated with respect to a set of reference words) in the source and the target domains (source and target domains can be simply two time frames t1 and t2). The underlying assumption is that the lexical-semantic change of word w would affect its co-occurring words and subsequently alters the neighborhoods in the embedding spaces. We show that using a resampling framework for the selection of reference words, we can reliably detect lexical-semantic changes in English, German, Swedish, and Latin. EmbLexChange achieved second place in the binary detection of semantic changes in the SemEval-2020.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源