摘要
主要讨论了由三个词组成的术语的抽取问题。首先从实验语料中抽取了三个词组成的词串,利用语法规则剔除了不符合要求的三词串,最后对剩下的三词串进行了人工判别,判断其是否为术语。研究发现:1)由三个词组成的术语数量相对较少;2)所获得术语中被赋予新含义的术语占多数;3)同一领域术语间的流通性不同;4)三个以上的词组成的术语仍然存在,只是数量直线下降。
This paper mainly discussed the topic of tri-word ter^n extraction. We extracted all the linguistic strings for^ned by three words from the corpus, and filtered those illegal phrases based on the rule of grammar, and judge whether the rest of tri-word linguistic strings could be identified as terms. Our conclusions a re: 1) there are a relatively small number of tri-word terms in the corpus; 2) many tri-word terms have been given new meanings; 3) terms in the same field have different negotiability; 4) there are terms formed by more than three words in the corpus,but the number of this kind term is falling sharply.
出处
《中国科技术语》
2017年第3期10-13,共4页
CHINA TERMINOLOGY
基金
国家自然科学基金项目"基于语料库的术语自动处理关键技术研究"(J1025001)