期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Generalized Normalized Euclidean Distance Based Fuzzy Soft Set Similarity for Data Classification
1
作者 Rahmat Hidayat Iwan Tri Riyadi Yanto +2 位作者 Azizul Azhar Ramli Mohd Farhan Md.Fudzee Ansari Saleh Ahmar 《Computer Systems Science & Engineering》 SCIE EI 2021年第7期119-130,共12页
Classification is one of the data mining processes used to predict predetermined target classes with data learning accurately.This study discusses data classification using a fuzzy soft set method to predict target cl... Classification is one of the data mining processes used to predict predetermined target classes with data learning accurately.This study discusses data classification using a fuzzy soft set method to predict target classes accurately.This study aims to form a data classification algorithm using the fuzzy soft set method.In this study,the fuzzy soft set was calculated based on the normalized Hamming distance.Each parameter in this method is mapped to a power set from a subset of the fuzzy set using a fuzzy approximation function.In the classification step,a generalized normalized Euclidean distance is used to determine the similarity between two sets of fuzzy soft sets.The experiments used the University of California(UCI)Machine Learning dataset to assess the accuracy of the proposed data classification method.The dataset samples were divided into training(75%of samples)and test(25%of samples)sets.Experiments were performed in MATLAB R2010a software.The experiments showed that:(1)The fastest sequence is matching function,distance measure,similarity,normalized Euclidean distance,(2)the proposed approach can improve accuracy and recall by up to 10.3436%and 6.9723%,respectively,compared with baseline techniques.Hence,the fuzzy soft set method is appropriate for classifying data. 展开更多
关键词 Soft set fuzzy soft set CLASSIFICATION normalized euclidean distance SIMILARITY
下载PDF
New Information Distance Measure and Its Application in Question Answering System 被引量:3
2
作者 张显 郝宇 +1 位作者 朱小燕 李明 《Journal of Computer Science & Technology》 SCIE EI CSCD 2008年第4期557-572,共16页
In a question answering (QA) system, the fundamental problem is how to measure the distance between a question and an answer, hence ranking different answers. We demonstrate that such a distance can be precisely and... In a question answering (QA) system, the fundamental problem is how to measure the distance between a question and an answer, hence ranking different answers. We demonstrate that such a distance can be precisely and mathematically defined. Not only such a definition is possible, it is actually provably better than any other feasible definitions. Not only such an ultimate definition is possible, but also it can be conveniently and fruitfully applied to construct a QA system. We have built such a system -- QUANTA. Extensive experiments are conducted to justify the new theory. 展开更多
关键词 information distance normalized information distance question answering system
原文传递
A Multiple Feature Approach for Disorder Normalization in Clinical Notes
3
作者 Lü Chen CHEN Bo +2 位作者 Lü Chaozhen QIU Likun JI Donghong 《Wuhan University Journal of Natural Sciences》 CAS CSCD 2016年第6期482-490,共9页
In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We d... In this paper we propose a multiple feature approach for the normalization task which can map each disorder mention in the text to a unique unified medical language system(UMLS)concept unique identifier(CUI). We develop a two-step method to acquire a list of candidate CUIs and their associated preferred names using UMLS API and to choose the closest CUI by calculating the similarity between the input disorder mention and each candidate. The similarity calculation step is formulated as a classification problem and multiple features(string features,ranking features,similarity features,and contextual features) are used to normalize the disorder mentions. The results show that the multiple feature approach improves the accuracy of the normalization task from 32.99% to 67.08% compared with the Meta Map baseline. 展开更多
关键词 natural language processing disorder normalization Levenshtein distance semantic composition multiple features
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部