期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
基于倒谱距离窗移最小失真分割的语种辨识 被引量:2
1
作者 缪炜 侯丽敏 《上海大学学报(自然科学版)》 CAS CSCD 北大核心 2007年第2期116-120,共5页
提出一种语种辨识的新方法.采用一种无需对语音文件进行标注的方法,提出基于倒谱距离窗移最小失真分割子词,在语种辨识前端用子词的自动分割方法把语音信号分割成许多子词.对得到的所有子词进行聚类并对每一类建立一个隐马尔可夫模型(HM... 提出一种语种辨识的新方法.采用一种无需对语音文件进行标注的方法,提出基于倒谱距离窗移最小失真分割子词,在语种辨识前端用子词的自动分割方法把语音信号分割成许多子词.对得到的所有子词进行聚类并对每一类建立一个隐马尔可夫模型(HMM),最后利用得到的所有的子词模型对输入语音进行语种辨识.实验表明,该方法是一种简洁而且有效的语种辨识方法. 展开更多
关键词 隐马尔可夫模型 语种辨识 分割
下载PDF
中文专利侵权检索模型研究 被引量:6
2
作者 汪雪锋 刘玉琴 刘佳 《计算机工程与应用》 CSCD 北大核心 2009年第9期212-215,共4页
结合中文专利权利要求的结构特征,首次将中文专利按照"分割词"重新分类,以重新划分的"新类别"进行词性选择,构造向量空间,设计了中文专利侵权检索模型,通过我国集成电路封装技术领域的发明专利进行技术侵权检索实... 结合中文专利权利要求的结构特征,首次将中文专利按照"分割词"重新分类,以重新划分的"新类别"进行词性选择,构造向量空间,设计了中文专利侵权检索模型,通过我国集成电路封装技术领域的发明专利进行技术侵权检索实证分析与对比实验,实验结果显示该模型的检索效果明显优于一般的侵权检索方法。 展开更多
关键词 专利侵权检索 分割词 专利类别 向量空间
下载PDF
A New Word Detection Method for Chinese Based on Local Context Information 被引量:1
3
作者 曾华琳 周昌乐 郑旭玲 《Journal of Donghua University(English Edition)》 EI CAS 2010年第2期189-192,共4页
Finding out out-of-vocabulary words is an urgent and difficult task in Chinese words segmentation. To avoid the defect causing by offline training in the traditional method, the paper proposes an improved prediction b... Finding out out-of-vocabulary words is an urgent and difficult task in Chinese words segmentation. To avoid the defect causing by offline training in the traditional method, the paper proposes an improved prediction by partical match (PPM) segmenting algorithm for Chinese words based on extracting local context information, which adds the context information of the testing text into the local PPM statistical model so as to guide the detection of new words. The algorithm focuses on the process of online segmentatien and new word detection which achieves a good effect in the close or opening test, and outperforms some well-known Chinese segmentation system to a certain extent. 展开更多
关键词 new word detection improved PPM model context information Chinese words segmentation
下载PDF
Context Information and Fragments Based Cross-Domain Word Segmentation 被引量:8
4
作者 Huang Degen Tong Deqin 《China Communications》 SCIE CSCD 2012年第3期49-57,共9页
A new joint decoding strategy that combines the character-based and word-based conditional random field model is proposed.In this segmentation framework,fragments are used to generate candidate Out-of-Vocabularies(OOV... A new joint decoding strategy that combines the character-based and word-based conditional random field model is proposed.In this segmentation framework,fragments are used to generate candidate Out-of-Vocabularies(OOVs).After the initial segmentation,the segmentation fragments are divided into two classes as "combination"(combining several fragments as an unknown word) and "segregation"(segregating to some words).So,more OOVs can be recalled.Moreover,for the characteristics of the cross-domain segmentation,context information is reasonably used to guide Chinese Word Segmentation(CWS).This method is proved to be effective through several experiments on the test data from Sighan Bakeoffs 2007 and Bakeoffs 2010.The rates of OOV recall obtain better performance and the overall segmentation performances achieve a good effect. 展开更多
关键词 cross-domain CWS Conditional Ran-dem Fields(CRFs) joint decoding context variables segmentation fragments
下载PDF
Song Ci Style Automatic Identification
5
作者 郑旭玲 周昌乐 曾华琳 《Journal of Donghua University(English Edition)》 EI CAS 2010年第2期181-184,共4页
To identify Song Ci style automatically,we put forward a novel stylistic text categorization approach based on words and their semantic in this paper. And a modified special word segmentation method,a new semantic rel... To identify Song Ci style automatically,we put forward a novel stylistic text categorization approach based on words and their semantic in this paper. And a modified special word segmentation method,a new semantic relativity computing method based on HowNet along with the corresponding word sense disambiguation method are proposed to extract words and semantic features from Song Ci. Experiments are carried out and the results show that these methods are effective. 展开更多
关键词 stylistic text categorization word sense disambiguation (WSD) word segmentation HOWNET Song Ci
下载PDF
Knowledge Automatic Indexing Based on Concept Lexicon and Segm-entation Algorithm
6
作者 王兰成 蒋丹 乐嘉锦 《Journal of Donghua University(English Edition)》 EI CAS 2005年第1期26-30,共5页
This paper is based on two existing theories about automatic indexing of thematic knowledge concept. The prohibit-word table with position information has been designed. The improved Maximum Matching-Minimum Backtrack... This paper is based on two existing theories about automatic indexing of thematic knowledge concept. The prohibit-word table with position information has been designed. The improved Maximum Matching-Minimum Backtracking method has been researched. Moreover it has been studied on improved indexing algorithm and application technology based on rules and thematic concept word table. 展开更多
关键词 Concept Lexicon Segmentation Algorithm Knowledge Indexing.
下载PDF
Chinese to Braille Translation Based on Braille Word Segmentation Using Statistical Model 被引量:2
7
作者 王向东 杨阳 +3 位作者 张金超 姜文斌 刘宏 钱跃良 《Journal of Shanghai Jiaotong university(Science)》 EI 2017年第1期82-86,共5页
Automatic translation of Chinese text to Chinese Braille is important for blind people in China to acquire information using computers or smart phones. In this paper, a novel scheme of Chinese-Braille translation is p... Automatic translation of Chinese text to Chinese Braille is important for blind people in China to acquire information using computers or smart phones. In this paper, a novel scheme of Chinese-Braille translation is proposed. Under the scheme, a Braille word segmentation model based on statistical machine learning is trained on a Braille corpus, and Braille word segmentation is carried out using the statistical model directly without the stage of Chinese word segmentation. This method avoids establishing rules concerning syntactic and semantic information and uses statistical model to learn the rules stealthily and automatically. To further improve the performance, an algorithm of fusing the results of Chinese word segmentation and Braille word segmentation is also proposed. Our results show that the proposed method achieves accuracy of 92.81% for Braille word segmentation and considerably outperforms current approaches using the segmentation-merging scheme. 展开更多
关键词 Chinese Braille word segmentation perceptron algorithm TP 391.1 A
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部