论文标题
迈向立陶宛语法错误校正
Towards Lithuanian grammatical error correction
论文作者
论文摘要
每个人都想编写美丽而正确的文字,但是缺乏语言技能,经验或仓促打字会导致错误。通过采用变压器体系结构的最新进展,我们为立陶宛语(富含古老特征的语言)构建了语法误差校正模型。我们比较子字和字节级别的方法,并在线开放源代码存储库中分享我们最佳训练的模型,并获得F $ _ {0.5} $ = 0.92,并随附的代码。
Everyone wants to write beautiful and correct text, yet the lack of language skills, experience, or hasty typing can result in errors. By employing the recent advances in transformer architectures, we construct a grammatical error correction model for Lithuanian, the language rich in archaic features. We compare subword and byte-level approaches and share our best trained model, achieving F$_{0.5}$=0.92, and accompanying code, in an online open-source repository.