论文标题
濒危Skolt Sami语言的FST形态
FST Morphology for the Endangered Skolt Sami Language
论文作者
论文摘要
我们介绍了Skolt Sami的基于FST的形态分析仪和发电机的发展。像其他少数乌拉利语一样,Skolt Sami一方面表现出丰富的形态,另一方面几乎没有黄金标准材料。如果没有扎实的形态分析,这使得NLP的研究很困难。该语言受到严重濒危,本文介绍的工作构成了更大整体的振兴工作。此外,我们将描述插入了促进和描述实践中的描述,在基础架构中没有很好地记录。目前,分析仪涵盖了148个拐点范式中的30,000多个Skolt Sami单词,超过12种派生形式。
We present advances in the development of a FST-based morphological analyzer and generator for Skolt Sami. Like other minority Uralic languages, Skolt Sami exhibits a rich morphology, on the one hand, and there is little golden standard material for it, on the other. This makes NLP approaches for its study difficult without a solid morphological analysis. The language is severely endangered and the work presented in this paper forms a part of a greater whole in its revitalization efforts. Furthermore, we intersperse our description with facilitation and description practices not well documented in the infrastructure. Currently, the analyzer covers over 30,000 Skolt Sami words in 148 inflectional paradigms and over 12 derivational forms.