期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Generating Chinese named entity data from parallel corpora 被引量:1
1
作者 Ruiji FU Bing QIN Ting LIU 《Frontiers of Computer Science》 SCIE EI CSCD 2014年第4期629-641,共13页
Annotating named entity recognition (NER) training corpora is a costly but necessary process for supervised NER approaches. This paper presents a general framework to generate large-scale NER training data from para... Annotating named entity recognition (NER) training corpora is a costly but necessary process for supervised NER approaches. This paper presents a general framework to generate large-scale NER training data from parallel corpora. In our method, we first employ a high performance NER system on one side of a bilingual corpus. Then, we project the named entity (NE) labels to the other side according to the word level alignments. Finally, we propose several strategies to select high-quality auto-labeled NER training data. We apply our approach to Chinese NER using an English-Chinese parallel corpus. Experimental results show that our approach can collect high-quality labeled data and can help improve Chinese NER. 展开更多
关键词 named entity recognition chinese named entity training data generating parallel corpora
原文传递
Fault Information Recognition for On-board Equipment of High-speed Railway Based on Multi-neural Network Collaboration
2
作者 Lu-Jie Zhou Jian-Wu Dang Zhen-Hai Zhang 《International Journal of Automation and computing》 EI CSCD 2021年第6期935-946,共12页
It is of great significance to guarantee the efficient statistics of high-speed railway on-board equipment fault information,which also improves the efficiency of fault analysis. Considering this background, this pape... It is of great significance to guarantee the efficient statistics of high-speed railway on-board equipment fault information,which also improves the efficiency of fault analysis. Considering this background, this paper presents an empirical exploration of named entity recognition(NER) of on-board equipment fault information. Based on the historical fault records of on-board equipment, a fault information recognition model based on multi-neural network collaboration is proposed. First, considering Chinese recorded data characteristics, a method of constructing semantic features and additional features based on character granularity is proposed. Then, the two feature representations are concatenated and passed into the gated convolutional layer to extract the dependencies from multiple different subspaces and adjacent characters in parallel. Next, the local features are transmitted to the bidirectional long short-term memory(BiLSTM) to learn long-term dependency information. On top of BiLSTM, the sequential conditional random field(CRF) is used to jointly decode the optimized tag sequence of the whole sentence. The model is tested and compared with other representative baseline models. The results show that the proposed model not only considers the language characteristics of on-board fault records, but also has obvious advantages on the performance of fault information recognition. 展开更多
关键词 Train control system chinese named entity recognition(NER) character feature gating mechanism bidirectional long short-term memory(BiLSTM)
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部