摘要
近年来,计算语言学(ComputationalLinguistics:CL)在学术界和工业界均得到了越来越多的关注,这主要得益于其在互联网领域越来越广泛的应用,如搜索引擎、在线翻译系统、社交网络等。计算语言学的很多技术在互联网应用中都能找到用武之地。这其中既包括词法、句法、语义等基础技术,也包括问答、翻译、文摘等应用技术。面对海量、高噪声的互联网数据及真实互联网应用需求,计算语言学技术也需要进行调整与改进。本文将主要讨论在互联网大背景下的计算语言学研究,包括新应用、新资源、新挑战,以及新方法等。
Computational Linguistics (CL) has attracted more and more interest in both academic and industry communities in recent yeats, since it plays an essential role in many Intemet applications, including search engines, online translation systems, social networks, and so forth. Almost all CL techniques, ranging from morphological, syntactic, and semantic analysis of texts, to question answering, machine translation, summarization, and oth- er complex techniques, can find their scopes in the Internet applications. However, conventional methodologies in CL research need to be adapted and improved, so as to deal with new challenges arising from the new application requirements as well as the large-scale and highly noisy web corpora. This paper discusses the CL research in the background of Intemet, including the new applications, resources, challenges and methodologies.
作者
王海峰
赵世奇
WANG Haifeng, ZHAO Shiqi (Baidu Inc., Beijing 100085, China)
出处
《智能计算机与应用》
2011年第1X期8-12,23,共6页
Intelligent Computer and Applications
关键词
计算语言学
互联网
Computational Linguistics
Web