期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
A New Part-of-Speech Tagging System Based on Closed-words,Word Form and Rules
1
作者 吴岩 李修昆 王开铸 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1999年第1期43-45,共3页
A method of part-of-speech tagging of English text based on closed-words, wold-form and rules, its abstract model and formal description of its realizing procedure are presented. Finally, an experimental example is gi... A method of part-of-speech tagging of English text based on closed-words, wold-form and rules, its abstract model and formal description of its realizing procedure are presented. Finally, an experimental example is givento illustrate the application of this method. 展开更多
关键词 part-of-speech tagging closed-words
下载PDF
Jointly Part-of-Speech Tagging and Semantic Role Labeling Using Auxiliary Deep Neural Network Model
2
作者 Yatian Shen Yubo Mai +2 位作者 Xiajiong Shen Wenke Ding Mengjiao Guo 《Computers, Materials & Continua》 SCIE EI 2020年第10期529-541,共13页
Previous studies have shown that there is potential semantic dependency between part-of-speech and semantic roles.At the same time,the predicate-argument structure in a sentence is important information for semantic r... Previous studies have shown that there is potential semantic dependency between part-of-speech and semantic roles.At the same time,the predicate-argument structure in a sentence is important information for semantic role labeling task.In this work,we introduce the auxiliary deep neural network model,which models semantic dependency between part-of-speech and semantic roles and incorporates the information of predicate-argument into semantic role labeling.Based on the framework of joint learning,part-of-speech tagging is used as an auxiliary task to improve the result of the semantic role labeling.In addition,we introduce the argument recognition layer in the training process of the main task-semantic role labeling,so the argument-related structural information selected by the predicate through the attention mechanism is used to assist the main task.Because the model makes full use of the semantic dependency between part-of-speech and semantic roles and the structural information of predicate-argument,our model achieved the F1 value of 89.0%on the WSJ test set of CoNLL2005,which is superior to existing state-of-the-art model about 0.8%. 展开更多
关键词 part-of-speech tagging semantic role labeling multi-task learning
下载PDF
Improved hidden Markov model for speech recognition and POS tagging 被引量:4
3
作者 袁里驰 《Journal of Central South University》 SCIE EI CAS 2012年第2期511-516,共6页
In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language proc... In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language processing. The speaker independently continuous speech recognition experiments and the part-of-speech tagging experiments show that Markov family model has higher performance than hidden Markov model. The precision is enhanced from 94.642% to 96.214% in the part-of-speech tagging experiments, and the work rate is reduced by 11.9% in the speech recognition experiments with respect to HMM baseline system. 展开更多
关键词 hidden Markov model Markov family model speech recognition part-of-speech tagging
下载PDF
A lexicalized second-order-HMM for ambiguity resolution in Chinese segmentation and POS tagging 被引量:1
4
作者 陈鄞 Yang Muyun Zhao Tiejun Yu Hao Li Sheng 《High Technology Letters》 EI CAS 2005年第4期346-350,共5页
Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation anti POS (part-of-speech) tagging. While most previous works tot HMM-based Chinese segmentation anti POS tagging eonsult POS infor... Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation anti POS (part-of-speech) tagging. While most previous works tot HMM-based Chinese segmentation anti POS tagging eonsult POS informatiou in contexts, they do not utilize lexieal information which is crucial for resoMng certain morphologieal ambiguity. This paper proposes a method which incorporates lexieal information and wider context information into HMM. Model induction anti related smoothing technique are presented in detail. Experiments indicate that this technique improves the segmentation and tagging accuracy by nearly 1%. 展开更多
关键词 hidden Markov model chinese segmentation part-of-speech tagging
下载PDF
TPOS Tagging Method Based on BiLSTM_CRF Model
5
作者 Lili Wang Ziyan Chen Hongwu Yang 《国际计算机前沿大会会议论文集》 2019年第1期501-503,共3页
Part of speech (POS) tagging determines the attributes of each word, and it is the fundamental work in machine translation, speech recognition, information retrieval and other fields. For Tibetan part-of-speech (TPOS)... Part of speech (POS) tagging determines the attributes of each word, and it is the fundamental work in machine translation, speech recognition, information retrieval and other fields. For Tibetan part-of-speech (TPOS) tagging, a tagging method is proposed based on bidirectional long short-term memory with conditional random field model (BiLSTM_CRF). Firstly, the designed TOPS tagging set and manual tagging corpus were used to get word vectors by embedding Tibetan words and corresponding TPOS tags in continuous bag-of-words (CBOW) model. Secondly, the word vectors were input into the BiLSTM_CRF model. To obtain the predictive score matrix, this model using the past input features and future input feature information respectively learned by forward long short-term memory (LSTM) and backward LSTM performs non-linear operations on the softmax layer. The prediction score matrix was input into the CRF model to judge the threshold value and calculate the sequence score error. Lastly, a Tibetan part of speech tagging model was got based on the BiLSTM_CRF model. The experimental results indicate that the accuracy of TPOS tagging model based on the BiLSTM_CRF model can reach 92.7%. 展开更多
关键词 DEEP LEARNING WORD EMBEDDING MODEL TIBETAN part-of-speech
下载PDF
A Study on Comparison of Words in Ancient and Modern Chinese and Foreign Fantasy Novels
6
作者 朱旭罡 宋肇鹏 杨昊堃 《海外英语》 2018年第9期166-168,共3页
Fantasy novel is a kind of novel literature, which is different from other fictions. With the passage of time, fantasy novels have great development with human society. There are differences and similarities between C... Fantasy novel is a kind of novel literature, which is different from other fictions. With the passage of time, fantasy novels have great development with human society. There are differences and similarities between Chinese and Western fantasy novels,which make literary works widely spread and popular. This study uses corpus linguistics software(Tree-Tagger, Range) to analyze four famous Chinese and English fantasy novels and their English translation, observing the different of proportion of noun, verb,adverb and adjective and the lexical difficulty of the novel. 展开更多
关键词 Fantasy novel part-of-speech Range I106 A
下载PDF
Research on calculation method of text similarity based on smooth inverse frequency 被引量:2
7
作者 Yuan Ye Yu Minmin Liu Jiming 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2020年第2期56-64,共9页
In order to improve the accuracy of text similarity calculation,this paper presents a text similarity function part of speech and word order-smooth inverse frequency(PO-SIF)based on sentence vector,which optimizes the... In order to improve the accuracy of text similarity calculation,this paper presents a text similarity function part of speech and word order-smooth inverse frequency(PO-SIF)based on sentence vector,which optimizes the classical SIF calculation method in two aspects:part of speech and word order.The classical SIF algorithm is to calculate sentence similarity by getting a sentence vector through weighting and reducing noise.However,the different methods of weighting or reducing noise would affect the efficiency and the accuracy of similarity calculation.In our proposed PO-SIF,the weight parameters of the SIF sentence vector are first updated by the part of speech subtraction factor,to determine the most crucial words.Furthermore,PO-SIF calculates the sentence vector similarity taking into the account of word order,which overcomes the drawback of similarity analysis that is mostly based on the word frequency.The experimental results validate the performance of our proposed PO-SIF on improving the accuracy of text similarity calculation. 展开更多
关键词 word2vec SIF part-of-speech word order similarity
原文传递
Pretrained Models and Evaluation Data for the Khmer Language
8
作者 Shengyi Jiang Sihui Fu +1 位作者 Nankai Lin Yingwen Fu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2022年第4期709-718,共10页
Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(N... Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(NLP)tasks.In recent years,PTMs have been widely used in most NLP applications,especially for high-resource languages,such as English and Chinese.However,scarce resources have discouraged the progress of PTMs for low-resource languages.Transformer-based PTMs for the Khmer language are presented in this work for the first time.We evaluate our models on two downstream tasks:Part-of-speech tagging and news categorization.The dataset for the latter task is self-constructed.Experiments demonstrate the effectiveness of the Khmer models.In addition,we find that the current Khmer word segmentation technology does not aid performance improvement.We aim to release our models and datasets to the community in hopes of facilitating the future development of Khmer NLP applications. 展开更多
关键词 pretrained models Khmer language word segmentation part-of-speech(POS)tagging news categorization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部