Chinese pinyin, the commonly used system for Romanizing standard Chinese, is a special form of the language. Compared with Chinese characters, pin,in boasts an advantage in the process of spreading Chinese culture aro...Chinese pinyin, the commonly used system for Romanizing standard Chinese, is a special form of the language. Compared with Chinese characters, pin,in boasts an advantage in the process of spreading Chinese culture around the world.展开更多
In this paper a novel word-segmentation algorithm is presented todelimit words in Chinese natural language queries in NChiql system, a Chinese natural language query interface to databases. Although there are sizable ...In this paper a novel word-segmentation algorithm is presented todelimit words in Chinese natural language queries in NChiql system, a Chinese natural language query interface to databases. Although there are sizable literatureson Chinese segmentation, they cannot satisfy particular requirements in this system. The novel word-segmentation algorithm is based on the database semantics,namely Semantic Conceptual Model (SCM) for specific domain knowledge. Basedon SCM, the segmenter labels the database semantics to words directly, which easesthe disambiguation and translation (from natural language to database query) inNChiql.展开更多
文摘Chinese pinyin, the commonly used system for Romanizing standard Chinese, is a special form of the language. Compared with Chinese characters, pin,in boasts an advantage in the process of spreading Chinese culture around the world.
文摘In this paper a novel word-segmentation algorithm is presented todelimit words in Chinese natural language queries in NChiql system, a Chinese natural language query interface to databases. Although there are sizable literatureson Chinese segmentation, they cannot satisfy particular requirements in this system. The novel word-segmentation algorithm is based on the database semantics,namely Semantic Conceptual Model (SCM) for specific domain knowledge. Basedon SCM, the segmenter labels the database semantics to words directly, which easesthe disambiguation and translation (from natural language to database query) inNChiql.