期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Prokaryote phylogeny based on ribosomal proteins and aminoacyl tRNA synthetases by using the compositional distance approach 被引量:1
1
作者 WEI Haibin QI Ji HAO Bailin 《Science China(Life Sciences)》 SCIE CAS 2004年第4期313-321,共9页
In order to show that the newly developed K-string composition distance method, based on counting oligopeptide frequencies, for inferring phylogenetic relations of prokaryotes works equally well without requiring the ... In order to show that the newly developed K-string composition distance method, based on counting oligopeptide frequencies, for inferring phylogenetic relations of prokaryotes works equally well without requiring the whole proteome data, we used all ribosomal proteins and the set of aminoacyl tRNA synthetases for each species. The latter group has been known to yield inconsistent trees if used individually. Our trees are obtained without making any sequence alignment. Altogether 16 Archaea, 105 Bacteria and 2 Eucarya are represented on the tree. Most of the lower branchings agree well with the latest, 2003, Outline of the second edition of the Bergeys Manual of Systematic Bacteriology and the trees also suggest some relationships among higher taxa. 展开更多
关键词 PROKARYOTE Archaea phylogeny phylogenetic tree composition distance.
原文传递
A Multiple Feature Approach for Disorder Normalization in Clinical Notes
2
作者 Lü Chen CHEN Bo +2 位作者 Lü Chaozhen QIU Likun JI Donghong 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第6期482-490,共9页
In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We d... In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We develop a two-step method to acquire a list of candidate CUIs and their associated preferred names using UMLS API and to choose the closest CUI by calculating the similarity between the input disorder mention and each candidate. The similarity calculation step is formulated as a classification problem and multiple features(string features,ranking features,similarity features,and contextual features) are used to normalize the disorder mentions. The results show that the multiple feature approach improves the accuracy of the normalization task from 32.99% to 67.08% compared with the Meta Map baseline. 展开更多
关键词 natural language processing disorder normalization Levenshtein distance semantic composition multiple features
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部