摘要
Currently, very little reference material can be found on the research of non-login word recognition. Solu-tions based on rules and syntaxes can't satisfactorily solve all kinds of problems of non-login word recognition. Thispaper will study and compare several existing solutions. The proposed solution is to extract N-grams after words sep-aration, from which non-login words can be extracted by means of probability statistics. Experiments have demon-strated that this method has favorable efficiency, recall ratio, and accuracy.
Currently, very little reference material can be found on the research of non-login word recognition. Solutions based on rules and syntaxes can't satisfactorily solve all kinds of problems of non-login word recognition. This paper will study and compare several existing solutions. The proposed solution is to extract N-grams after words separation, from which non-login words can be extracted by means of probability statistics. Experiments have demonstrated that this method has favorable efficiency, recall ratio, and accuracy.
出处
《计算机科学》
CSCD
北大核心
2002年第12期155-156,共2页
Computer Science