论文标题

专利文本中的高nym和替象检索的技术分类法

Technological taxonomies for hypernym and hyponym retrieval in patent texts

论文作者

Zuo, You, Li, Yixuan, García, Alma Parias, Gerdes, Kim

论文摘要

本文提出了一种根据合作专利分类(CPC)创建技术术语分类法的自动方法。由此产生的分类法包含9个单独的技术分支中的约170k节点,并且可以自由使用。我们还表明,可以对文本转换变压器(T5)模型进行微调,以生成具有相对较高精度的高音和huse词,从而确认了资源的手动评估质量。 T5模型将分类学开放给可以生成高nym的任何新技术术语,从而使资源可以使用新术语更新,这是技术术语不断发展的领域的重要特征。

This paper presents an automatic approach to creating taxonomies of technical terms based on the Cooperative Patent Classification (CPC). The resulting taxonomy contains about 170k nodes in 9 separate technological branches and is freely available. We also show that a Text-to-Text Transfer Transformer (T5) model can be fine-tuned to generate hypernyms and hyponyms with relatively high precision, confirming the manually assessed quality of the resource. The T5 model opens the taxonomy to any new technological terms for which a hypernym can be generated, thus making the resource updateable with new terms, an essential feature for the constantly evolving field of technological terminology.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源