期刊文献+
共找到7篇文章
< 1 >
每页显示 20 50 100
A New Part-of-Speech Tagging System Based on Closed-words,Word Form and Rules
1
作者 吴岩 李修昆 王开铸 《Journal of Harbin Institute of Technology(New Series)》 EI CAS 1999年第1期43-45,共3页
A method of part-of-speech tagging of English text based on closed-words, wold-form and rules, its abstract model and formal description of its realizing procedure are presented. Finally, an experimental example is gi... A method of part-of-speech tagging of English text based on closed-words, wold-form and rules, its abstract model and formal description of its realizing procedure are presented. Finally, an experimental example is givento illustrate the application of this method. 展开更多
关键词 part-of-speech tagging closed-words
下载PDF
Jointly Part-of-Speech Tagging and Semantic Role Labeling Using Auxiliary Deep Neural Network Model
2
作者 Yatian Shen Yubo Mai +2 位作者 Xiajiong Shen Wenke Ding Mengjiao Guo 《Computers, Materials & Continua》 SCIE EI 2020年第10期529-541,共13页
Previous studies have shown that there is potential semantic dependency between part-of-speech and semantic roles.At the same time,the predicate-argument structure in a sentence is important information for semantic r... Previous studies have shown that there is potential semantic dependency between part-of-speech and semantic roles.At the same time,the predicate-argument structure in a sentence is important information for semantic role labeling task.In this work,we introduce the auxiliary deep neural network model,which models semantic dependency between part-of-speech and semantic roles and incorporates the information of predicate-argument into semantic role labeling.Based on the framework of joint learning,part-of-speech tagging is used as an auxiliary task to improve the result of the semantic role labeling.In addition,we introduce the argument recognition layer in the training process of the main task-semantic role labeling,so the argument-related structural information selected by the predicate through the attention mechanism is used to assist the main task.Because the model makes full use of the semantic dependency between part-of-speech and semantic roles and the structural information of predicate-argument,our model achieved the F1 value of 89.0%on the WSJ test set of CoNLL2005,which is superior to existing state-of-the-art model about 0.8%. 展开更多
关键词 part-of-speech tagging semantic role labeling multi-task learning
下载PDF
Improved hidden Markov model for speech recognition and POS tagging 被引量:4
3
作者 袁里驰 《Journal of Central South University》 SCIE EI CAS 2012年第2期511-516,共6页
In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language proc... In order to overcome defects of the classical hidden Markov model (HMM), Markov family model (MFM), a new statistical model was proposed. Markov family model was applied to speech recognition and natural language processing. The speaker independently continuous speech recognition experiments and the part-of-speech tagging experiments show that Markov family model has higher performance than hidden Markov model. The precision is enhanced from 94.642% to 96.214% in the part-of-speech tagging experiments, and the work rate is reduced by 11.9% in the speech recognition experiments with respect to HMM baseline system. 展开更多
关键词 hidden Markov model Markov family model speech recognition part-of-speech tagging
下载PDF
A lexicalized second-order-HMM for ambiguity resolution in Chinese segmentation and POS tagging 被引量:1
4
作者 陈鄞 Yang Muyun Zhao Tiejun Yu Hao Li Sheng 《High Technology Letters》 EI CAS 2005年第4期346-350,共5页
Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation anti POS (part-of-speech) tagging. While most previous works tot HMM-based Chinese segmentation anti POS tagging eonsult POS infor... Hidden Markov Model(HMM) is a main solution to ambiguities in Chinese segmentation anti POS (part-of-speech) tagging. While most previous works tot HMM-based Chinese segmentation anti POS tagging eonsult POS informatiou in contexts, they do not utilize lexieal information which is crucial for resoMng certain morphologieal ambiguity. This paper proposes a method which incorporates lexieal information and wider context information into HMM. Model induction anti related smoothing technique are presented in detail. Experiments indicate that this technique improves the segmentation and tagging accuracy by nearly 1%. 展开更多
关键词 hidden Markov model chinese segmentation part-of-speech tagging
下载PDF
A Study on Comparison of Words in Ancient and Modern Chinese and Foreign Fantasy Novels
5
作者 朱旭罡 宋肇鹏 杨昊堃 《海外英语》 2018年第9期166-168,共3页
Fantasy novel is a kind of novel literature, which is different from other fictions. With the passage of time, fantasy novels have great development with human society. There are differences and similarities between C... Fantasy novel is a kind of novel literature, which is different from other fictions. With the passage of time, fantasy novels have great development with human society. There are differences and similarities between Chinese and Western fantasy novels,which make literary works widely spread and popular. This study uses corpus linguistics software(Tree-Tagger, Range) to analyze four famous Chinese and English fantasy novels and their English translation, observing the different of proportion of noun, verb,adverb and adjective and the lexical difficulty of the novel. 展开更多
关键词 Fantasy novel part-of-speech Range I106 A
下载PDF
Research on calculation method of text similarity based on smooth inverse frequency 被引量:2
6
作者 Yuan Ye Yu Minmin Liu Jiming 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2020年第2期56-64,共9页
In order to improve the accuracy of text similarity calculation,this paper presents a text similarity function part of speech and word order-smooth inverse frequency(PO-SIF)based on sentence vector,which optimizes the... In order to improve the accuracy of text similarity calculation,this paper presents a text similarity function part of speech and word order-smooth inverse frequency(PO-SIF)based on sentence vector,which optimizes the classical SIF calculation method in two aspects:part of speech and word order.The classical SIF algorithm is to calculate sentence similarity by getting a sentence vector through weighting and reducing noise.However,the different methods of weighting or reducing noise would affect the efficiency and the accuracy of similarity calculation.In our proposed PO-SIF,the weight parameters of the SIF sentence vector are first updated by the part of speech subtraction factor,to determine the most crucial words.Furthermore,PO-SIF calculates the sentence vector similarity taking into the account of word order,which overcomes the drawback of similarity analysis that is mostly based on the word frequency.The experimental results validate the performance of our proposed PO-SIF on improving the accuracy of text similarity calculation. 展开更多
关键词 word2vec SIF part-of-speech word order similarity
原文传递
Pretrained Models and Evaluation Data for the Khmer Language
7
作者 Shengyi Jiang Sihui Fu +1 位作者 Nankai Lin Yingwen Fu 《Tsinghua Science and Technology》 SCIE EI CAS CSCD 2022年第4期709-718,共10页
Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(N... Trained on a large corpus,pretrained models(PTMs)can capture different levels of concepts in context and hence generate universal language representations,which greatly benefit downstream natural language processing(NLP)tasks.In recent years,PTMs have been widely used in most NLP applications,especially for high-resource languages,such as English and Chinese.However,scarce resources have discouraged the progress of PTMs for low-resource languages.Transformer-based PTMs for the Khmer language are presented in this work for the first time.We evaluate our models on two downstream tasks:Part-of-speech tagging and news categorization.The dataset for the latter task is self-constructed.Experiments demonstrate the effectiveness of the Khmer models.In addition,we find that the current Khmer word segmentation technology does not aid performance improvement.We aim to release our models and datasets to the community in hopes of facilitating the future development of Khmer NLP applications. 展开更多
关键词 pretrained models Khmer language word segmentation part-of-speech(POS)tagging news categorization
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部