This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schem...This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.展开更多
The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a widely used information theoretic measure, in a descript...The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a widely used information theoretic measure, in a descriptive way, to measure the stochastic dependency of discrete random variables. The measure method was used as a criterion to reduce high dimensionality of feature vectors in text classification on Web. Feature selections or conversions were performed by using maximum mutual information including linear and non-linear feature conversions. Entropy was used and extended to find right features commendably in pattern recognition systems. Favorable foundation would be established for text classification mining.展开更多
In order to solve the poor performance in text classification when using traditional formula of mutual information (MI) , a feature selection algorithm were proposed based on improved mutual information. The improve...In order to solve the poor performance in text classification when using traditional formula of mutual information (MI) , a feature selection algorithm were proposed based on improved mutual information. The improved mutual information algorithm, which is on the basis of traditional improved mutual information methods that enbance the MI value of negative characteristics and feature' s frequency, supports the concept of concentration degree and dispersion degree. In accordance with the concept of concentration degree and dispersion degree, formulas which embody concentration degree and dispersion degree were constructed and the improved mutual information was implemented based on these. In this paper, the feature selection algorithm was applied based on improved mutual information to a text classifier based on Biomimetic Pattern Recognition and it was compared with several other feature selection methods. The experimental results showed that the improved mutu- al information feature selection method greatly enhances the performance compared with traditional mutual information feature selection methods and the performance is better than that of information gain. Through the introduction of the concept of concentration degree and dispersion degree, the improved mutual information feature selection method greatly improves the performance of text classification system.展开更多
Textual informativity is one of the seven standards of textuality. This paper focuses on the shift among three orders of textual informativity. And also probe into some strategies to compensate for the different level...Textual informativity is one of the seven standards of textuality. This paper focuses on the shift among three orders of textual informativity. And also probe into some strategies to compensate for the different level of informativity.展开更多
The fourth international conference on Web information systems and applications (WISA 2007) has received 409 submissions and has accepted 37 papers for publication in this issue. The papers cover broad research area...The fourth international conference on Web information systems and applications (WISA 2007) has received 409 submissions and has accepted 37 papers for publication in this issue. The papers cover broad research areas, including Web mining and data warehouse, Deep Web and Web integration, P2P networks, text processing and information retrieval, as well as Web Services and Web infrastructure. After briefly introducing the WISA conference, the survey outlines the current activities and future trends concerning Web information systems and applications based on the papers accepted for publication.展开更多
Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which refl...Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which refl...Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Ada Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which refle...Ada Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community
Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which refl...Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scienti...Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scienti...Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scienti...Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific com- munity which reflect the information on a worldwide basis. The journal publishes scholarly papers on marine science and technology, including physics, chemistry, biology, hydrology, meteorolagy, geology, engineering, remote sensing, etc. Progress reports on research projects are also in- cluded.展开更多
Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scienti...Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific com- munity which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scienti...Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which refl...Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.展开更多
文摘This study introduces the Orbit Weighting Scheme(OWS),a novel approach aimed at enhancing the precision and efficiency of Vector Space information retrieval(IR)models,which have traditionally relied on weighting schemes like tf-idf and BM25.These conventional methods often struggle with accurately capturing document relevance,leading to inefficiencies in both retrieval performance and index size management.OWS proposes a dynamic weighting mechanism that evaluates the significance of terms based on their orbital position within the vector space,emphasizing term relationships and distribution patterns overlooked by existing models.Our research focuses on evaluating OWS’s impact on model accuracy using Information Retrieval metrics like Recall,Precision,InterpolatedAverage Precision(IAP),andMeanAverage Precision(MAP).Additionally,we assessOWS’s effectiveness in reducing the inverted index size,crucial for model efficiency.We compare OWS-based retrieval models against others using different schemes,including tf-idf variations and BM25Delta.Results reveal OWS’s superiority,achieving a 54%Recall and 81%MAP,and a notable 38%reduction in the inverted index size.This highlights OWS’s potential in optimizing retrieval processes and underscores the need for further research in this underrepresented area to fully leverage OWS’s capabilities in information retrieval methodologies.
文摘The frame of text classification system was presented. The high dimensionality in feature space for text classification was studied. The mutual information is a widely used information theoretic measure, in a descriptive way, to measure the stochastic dependency of discrete random variables. The measure method was used as a criterion to reduce high dimensionality of feature vectors in text classification on Web. Feature selections or conversions were performed by using maximum mutual information including linear and non-linear feature conversions. Entropy was used and extended to find right features commendably in pattern recognition systems. Favorable foundation would be established for text classification mining.
基金Sponsored by the National Nature Science Foundation Projects (Grant No. 60773070,60736044)
文摘In order to solve the poor performance in text classification when using traditional formula of mutual information (MI) , a feature selection algorithm were proposed based on improved mutual information. The improved mutual information algorithm, which is on the basis of traditional improved mutual information methods that enbance the MI value of negative characteristics and feature' s frequency, supports the concept of concentration degree and dispersion degree. In accordance with the concept of concentration degree and dispersion degree, formulas which embody concentration degree and dispersion degree were constructed and the improved mutual information was implemented based on these. In this paper, the feature selection algorithm was applied based on improved mutual information to a text classifier based on Biomimetic Pattern Recognition and it was compared with several other feature selection methods. The experimental results showed that the improved mutu- al information feature selection method greatly enhances the performance compared with traditional mutual information feature selection methods and the performance is better than that of information gain. Through the introduction of the concept of concentration degree and dispersion degree, the improved mutual information feature selection method greatly improves the performance of text classification system.
文摘Textual informativity is one of the seven standards of textuality. This paper focuses on the shift among three orders of textual informativity. And also probe into some strategies to compensate for the different level of informativity.
文摘The fourth international conference on Web information systems and applications (WISA 2007) has received 409 submissions and has accepted 37 papers for publication in this issue. The papers cover broad research areas, including Web mining and data warehouse, Deep Web and Web integration, P2P networks, text processing and information retrieval, as well as Web Services and Web infrastructure. After briefly introducing the WISA conference, the survey outlines the current activities and future trends concerning Web information systems and applications based on the papers accepted for publication.
文摘Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Ada Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community
文摘Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific com- munity which reflect the information on a worldwide basis. The journal publishes scholarly papers on marine science and technology, including physics, chemistry, biology, hydrology, meteorolagy, geology, engineering, remote sensing, etc. Progress reports on research projects are also in- cluded.
文摘Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific com- munity which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica (AOS) is a comprehensive academic journal edited by the Editorial Committee of Acta Oceanologica Sinica and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.
文摘Acta Oceanologica Sinica is a comprehensive academic journal edited by the Chinese Society of Oceanography and is designed to provide a forum for important research papers of the marine scientific community which reflect the information on a worldwide basis.