期刊文献+
共找到2篇文章
< 1 >
每页显示 20 50 100
Word Net-based lexical semantic classification for text corpus analysis
1
作者 龙军 王鲁达 +2 位作者 李祖德 张祖平 杨柳 《Journal of Central South University》 SCIE EI CAS CSCD 2015年第5期1833-1840,共8页
Many text classifications depend on statistical term measures to implement document representation. Such document representations ignore the lexical semantic contents of terms and the distilled mutual information, lea... Many text classifications depend on statistical term measures to implement document representation. Such document representations ignore the lexical semantic contents of terms and the distilled mutual information, leading to text classification errors.This work proposed a document representation method, Word Net-based lexical semantic VSM, to solve the problem. Using Word Net,this method constructed a data structure of semantic-element information to characterize lexical semantic contents, and adjusted EM modeling to disambiguate word stems. Then, in the lexical-semantic space of corpus, lexical-semantic eigenvector of document representation was built by calculating the weight of each synset, and applied to a widely-recognized algorithm NWKNN. On text corpus Reuter-21578 and its adjusted version of lexical replacement, the experimental results show that the lexical-semantic eigenvector performs F1 measure and scales of dimension better than term-statistic eigenvector based on TF-IDF. Formation of document representation eigenvectors ensures the method a wide prospect of classification applications in text corpus analysis. 展开更多
关键词 document representation lexical semantic content CLASSIFICATION EIGENVECTOR
下载PDF
FAST TEXT LOCATION BASED ON DISCRETE WAVELET TRANSFORM 被引量:2
2
作者 LiXiaohua ShenLansun 《Journal of Electronics(China)》 2005年第4期385-394,共10页
The paper describes a texture-based fast text location scheme which operates directly in the Discrete Wavelet Transform (DWT) domain. By the distinguishing texture characteristics encoded in wavelet transform domain, ... The paper describes a texture-based fast text location scheme which operates directly in the Discrete Wavelet Transform (DWT) domain. By the distinguishing texture characteristics encoded in wavelet transform domain, the text is fast detected from complex background images stored in the compressed format such as JPEG2000 without full decompress. Compared with some traditional character location methods, the proposed scheme has the advantages of low computational cost, robust to size and font of characters and high accuracy. Preliminary experimental results show that the proposed scheme is efficient and effective. 展开更多
关键词 Text location Discrete Wavelet Transform (DWT) semantic content Texture analysis Image/video indexing
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部