期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Ontology-based Knowledge Extraction from Hidden Web 被引量:1
1
作者 宋晖 马范援 刘晓强 《Journal of Donghua University(English Edition)》 EI CAS 2004年第5期73-78,共6页
Hidden Web provides great amount of domain-specific data for constructing knowledge services. Most previous knowledge extraction researches ignore the valuable data hidden in Web database, and related works do not ref... Hidden Web provides great amount of domain-specific data for constructing knowledge services. Most previous knowledge extraction researches ignore the valuable data hidden in Web database, and related works do not refer how to make extracted information available for knowledge system. This paper describes a novel approach to build a domain-specific knowledge service with the data retrieved from Hidden Web. Ontology serves to model the domain knowledge. Queries forms of different Web sites are translated into machine-understandable format, defined knowledge concepts, so that they can be accessed automatically. Also knowledge data are extracted from Web pages and organized in ontology format knowledge. The experiment proves the algorithm achieves high accuracy and the system facilitates constructing knowledge services greatly. 展开更多
关键词 knowledge service hidden web ONTOLOGY data extraction
下载PDF
Developing Two Different Novel Techniques for Arabic Text Stemming
2
作者 Mohammad Mustafa Afag Salah Aldeen +2 位作者 Mohammed E. Zidan Rihab E. Ahmed Yasir Eltigani 《Intelligent Information Management》 2019年第1期1-23,共23页
Stemming is used to produce stem or root of words. The process is vital to different research fields such as text mining, sentiment analysis, and text categorization, etc. Several techniques have been proposed to stem... Stemming is used to produce stem or root of words. The process is vital to different research fields such as text mining, sentiment analysis, and text categorization, etc. Several techniques have been proposed to stemming Arabic text and among them, Khoja and light-10 stemmers are the most widely used. In this paper, we propose and evaluate two different stemming techniques to Arabic that are based on light stemming techniques. The new stemmers are compared to best reported light stemmer, which is light-10. Results and experiments, which were conducted using standard collections, reveal that The proposed stemmers yield 5.13% and 13.1% improvement in retrieval performance over light 10 with 0.369 average precision and 0.397, respectively and the improvement is statistically significant. 展开更多
关键词 ARABIC LANGUAGE ARABIC Information RETRIEVAL LIGHT STEMMING LIGHT 10 Extended Light-Stemmer Linguistic-Based Stemmer
下载PDF
Advanced Fuzzy C-Means Algorithm Based on Local Density and Distance 被引量:1
3
作者 Shaochun PANG Yijie +1 位作者 SHAO Sen JIANG Keyuan 《Journal of Shanghai Jiaotong university(Science)》 EI 2018年第5期636-642,共7页
This paper presents an advanced fuzzy C-means(FCM) clustering algorithm to overcome the weakness of the traditional FCM algorithm, including the instability of random selecting of initial center and the limitation of ... This paper presents an advanced fuzzy C-means(FCM) clustering algorithm to overcome the weakness of the traditional FCM algorithm, including the instability of random selecting of initial center and the limitation of the data separation or the size of clusters. The advanced FCM algorithm combines the distance with density and improves the objective function so that the performance of the algorithm can be improved. The experimental results show that the proposed FCM algorithm requires fewer iterations yet provides higher accuracy than the traditional FCM algorithm. The advanced algorithm is applied to the influence of stars' box-office data, and the classification accuracy of the first class stars achieves 92.625%. 展开更多
关键词 objective function clustering center fuzzy C-means (FCM) clustering algorithm degree of member-ship
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部