期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Clustering Student Discussion Messages on Online Forumby Visualization and Non-Negative Matrix Factorization
1
作者 xiaodi huang Jianhua Zhao +1 位作者 Jeff Ash Wei Lai 《Journal of Software Engineering and Applications》 2013年第7期7-12,共6页
The use of online discussion forum can?effectively engage students in their studies. As the number of messages posted on the forum is increasing, it is more difficult for instructors to read and respond to them in a p... The use of online discussion forum can?effectively engage students in their studies. As the number of messages posted on the forum is increasing, it is more difficult for instructors to read and respond to them in a prompt way. In this paper, we apply non-negative matrix factorization and visualization to clustering message data, in order to provide a summary view of messages that disclose their deep semantic relationships. In particular, the NMF is able to find the underlying issues hidden in the messages about which most of the students are concerned. Visualization is employed to estimate the initial number of clusters, showing the relation communities. The experiments and comparison on a real dataset have been reported to demonstrate the effectiveness of the approaches. 展开更多
关键词 Online FORUM Cluster Non-Negative Matrix FACTORIZATION VISUALIZATION
下载PDF
Element Retrieval Using Namespace Based on Keyword Search over XML Documents
2
作者 Yang WANG Zhikui CHEN xiaodi huang 《Journal of Software Engineering and Applications》 2010年第1期65-72,共8页
Querying over XML elements using keyword search is steadily gaining popularity. The traditional similarity measure is widely employed in order to effectively retrieve various XML documents. A number of authors have al... Querying over XML elements using keyword search is steadily gaining popularity. The traditional similarity measure is widely employed in order to effectively retrieve various XML documents. A number of authors have already proposed different similarity-measure methods that take advantage of the structure and content of XML documents. However, they do not consider the similarity between latent semantic information of element texts and that of keywords in a query. Although many algorithms on XML element search are available, some of them have the high computational complexity due to searching for a huge number of elements. In this paper, we propose a new algorithm that makes use of the se-mantic similarity between elements instead of between entire XML documents, considering not only the structure and content of an XML document, but also semantic information of namespaces in elements. We compare our algorithm with the three other algorithms by testing on real datasets. The experiments have demonstrated that our proposed method is able to improve the query accuracy, as well as to reduce the running time. 展开更多
关键词 SEMANTICS NAMESPACE SVD TEXT MATCHING
下载PDF
Self-selective memristor-enabled in-memory search for highly efficient data mining
3
作者 Ling Yang xiaodi huang +12 位作者 Yi Li Houji Zhou Yingjie Yu Han Bao Jiancong Li Shengguang Ren Feng Wang Lei Ye Yuhui He Jia Chen Guiyou Pu Xiang Li Xiangshui Miao 《InfoMat》 SCIE CSCD 2023年第5期121-133,共13页
Similarity search,that is,finding similar items in massive data,is a fundamental computing problem in many fields such as data mining and information retrieval.However,for large-scale and high-dimension data,it suffer... Similarity search,that is,finding similar items in massive data,is a fundamental computing problem in many fields such as data mining and information retrieval.However,for large-scale and high-dimension data,it suffers from high computational complexity,requiring tremendous computation resources.Here,based on the low-power self-selective memristors,for the first time,we propose an in-memory search(IMS)system with two innovative designs.First,by exploiting the natural distribution law of the devices resistance,a hardware locality sensitive hashing encoder has been designed to transform the realvalued vectors into more efficient binary codes.Second,a compact memristive ternary content addressable memory is developed to calculate the Hamming distances between the binary codes in parallel.Our IMS system demonstrated a 168energy efficiency improvement over all-transistors counterparts in clustering and classification tasks,while achieving a software-comparable accuracy,thus providing a low-complexity and low-power solution for in-memory data mining applications. 展开更多
关键词 in-memory search self-selective memristor similarity search ternary content addressable memory
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部