摘要
本文分析了当前自然语言处理发展的4个特点:基于句法—语义规则的理性主义方法受到质疑,随着语料库建设和语料库语言学的崛起,大规模真实文本的处理成为自然语言处理的主要战略目标;自然语言处理中越来越多地使用机器自动学习的方法来获取语言知识;统计数学方法越来越受到重视;自然语言处理中越来越重视词汇的作用,出现了强烈的“词汇主义”的倾向。
The present paper illustrates four features of the current development of natural language processing studies: the rationalistic approaches based on syntactic-semantic rules have been oppugned and processing of large - scaled authentic texts is becoming the primary objective of natural language processing with the construction of corpora and the growing of corpus linguistics ; natural language processing acquires language knowledge by increasingly relying on automatic machine learning; more and more statistical mathematical approaches are used in natural language processing, and, the function of lexicon is being recognized increasingly and the strong "Lexicalism" tendency is to be presented.
出处
《暨南大学华文学院学报》
2006年第1期34-40,共7页
Journal of College of Chinese Language and Culture of Jinan University
关键词
自然语言处理
语料库
机器自动学习
统计数学
词汇主义
Natural Language Processing (NLP)
Corpora
Automatic Machine Learning
Statistical Mathematics
Lexicalism