A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decompos...A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decomposition (SVD) to reduce the dimension of the matrix and extract features, and then the sentence similarity is computed. The sentences are clustered according to similarity of sentences. The centroid sentences are selected from each class. Finally, the selected sentences are ordered to generate the summarization. The evaluation and results are presented, which prove that the proposed methods are efficient.展开更多
With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(C...With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(CCD)method and latent semantic indexing(LSI).In the first stage,a novel CCD method is proposed to select the most effective features for text classification,which is more effective than the traditional feature selection method.In the second stage,document representation requires a high dimensionality of the feature space and does not take into account the semantic relation between features,which leads to a poor categorization accuracy.So LSI method is proposed to solve these problems by using statistically derived conceptual indices to replace the individual terms which can discover the important correlative relationship between features and reduce the feature space dimension.Firstly,each feature in our algorithm is ranked depending on their importance of classification using CCD method.Secondly,we construct a new semantic space based on LSI method among features.The experimental results have proved that our method can reduce effectively the dimension of text vector and improve the performance of text categorization.展开更多
To avoid the scalability of the existing systems that employed centralized indexing,index flooding or query flooding,we proposed an efficient peer-to-peer information retrieval system SPIRS (Semantic P2P-based Informa...To avoid the scalability of the existing systems that employed centralized indexing,index flooding or query flooding,we proposed an efficient peer-to-peer information retrieval system SPIRS (Semantic P2P-based Information Retrieval System) that supported state-of-the-art content and semantic searches. SPIRS distributes document indices through P2P network hierarchically by Latent Semantic Indexing (LSI) and organizes nodes into a hierarchical overlay through CAN and TRIE. Comparing with other P2P search techniques,those based on simple keyword matching,SPIRS has better accuracy for considering the advanced relevance among documents. Given a query,only a small number of nodes are needed for SPIRS to identify the matching documents. Furthermore,both theoretical analysis and experimental results show that SPIRS possesses higher accuracy and less logic hops.展开更多
Donation-based crowdfunding,as part of impact investment,plays a vital role in promoting economic development and alleviating poverty.In order to enhance the lender's enthusiasm for lending behavior,some platforms...Donation-based crowdfunding,as part of impact investment,plays a vital role in promoting economic development and alleviating poverty.In order to enhance the lender's enthusiasm for lending behavior,some platforms,for example Kiva,have introduced groups to facilitate lending.This study examines how the group environment can affect the lenders’behaviors in crowdfunding.It has been found that lenders who join groups contribute 1.2 more loans(about$30-$42)per month than those who do not,but the theoretical mechanism of these differences is unclear.To understand in depth how the group environment affects lending behaviors,we introduce and develop the PersonOrganization fit theory and Free-rider theory in this study.Combining machine-learning techniques with empirical analysis,the results show that the matching degree of motivation between group and lender has a positive effect on the lender behavior,i.e.,lending to loans,and this relationship is weakened by free-riding in large groups.In addition,the group openness can have different effects on lenders of different group sizes.Our research enriches the existing crowdfunding literature and fills the gap in the research on new lending models in crowdfunding,and it will also be useful for crowdfunding platforms in setting the rules for building groups.展开更多
文摘A multi-document summarization method based on Latent Semantic Indexing (LSI) is proposed. The method combines several reports on the same issue into a matrix of terms and sentences, and uses a Singular Value Decomposition (SVD) to reduce the dimension of the matrix and extract features, and then the sentence similarity is computed. The sentences are clustered according to similarity of sentences. The centroid sentences are selected from each class. Finally, the selected sentences are ordered to generate the summarization. The evaluation and results are presented, which prove that the proposed methods are efficient.
基金the National Natural Science Foundation of China(Nos.61073193 and 61300230)the Key Science and Technology Foundation of Gansu Province(No.1102FKDA010)+1 种基金the Natural Science Foundation of Gansu Province(No.1107RJZA188)the Science and Technology Support Program of Gansu Province(No.1104GKCA037)
文摘With the purpose of improving the accuracy of text categorization and reducing the dimension of the feature space,this paper proposes a two-stage feature selection method based on a novel category correlation degree(CCD)method and latent semantic indexing(LSI).In the first stage,a novel CCD method is proposed to select the most effective features for text classification,which is more effective than the traditional feature selection method.In the second stage,document representation requires a high dimensionality of the feature space and does not take into account the semantic relation between features,which leads to a poor categorization accuracy.So LSI method is proposed to solve these problems by using statistically derived conceptual indices to replace the individual terms which can discover the important correlative relationship between features and reduce the feature space dimension.Firstly,each feature in our algorithm is ranked depending on their importance of classification using CCD method.Secondly,we construct a new semantic space based on LSI method among features.The experimental results have proved that our method can reduce effectively the dimension of text vector and improve the performance of text categorization.
基金the Nartional Basic Research Programof China(Grant No.2002CB312002)the Science and Technology Commission of Shanghai Munic-ipality Project(Grant No.03dz15027 and 03dz15028).
文摘To avoid the scalability of the existing systems that employed centralized indexing,index flooding or query flooding,we proposed an efficient peer-to-peer information retrieval system SPIRS (Semantic P2P-based Information Retrieval System) that supported state-of-the-art content and semantic searches. SPIRS distributes document indices through P2P network hierarchically by Latent Semantic Indexing (LSI) and organizes nodes into a hierarchical overlay through CAN and TRIE. Comparing with other P2P search techniques,those based on simple keyword matching,SPIRS has better accuracy for considering the advanced relevance among documents. Given a query,only a small number of nodes are needed for SPIRS to identify the matching documents. Furthermore,both theoretical analysis and experimental results show that SPIRS possesses higher accuracy and less logic hops.
基金The work was supported by National Natural Science Foundation of China(No.71722005,71571133,71790594,71790590,71802068)Natural Science Foundation of Tianjin City(No.18JCJQJC45900)This study was partially funded by the financial supports of Tianjin University(Grant 2020XSC-0019).
文摘Donation-based crowdfunding,as part of impact investment,plays a vital role in promoting economic development and alleviating poverty.In order to enhance the lender's enthusiasm for lending behavior,some platforms,for example Kiva,have introduced groups to facilitate lending.This study examines how the group environment can affect the lenders’behaviors in crowdfunding.It has been found that lenders who join groups contribute 1.2 more loans(about$30-$42)per month than those who do not,but the theoretical mechanism of these differences is unclear.To understand in depth how the group environment affects lending behaviors,we introduce and develop the PersonOrganization fit theory and Free-rider theory in this study.Combining machine-learning techniques with empirical analysis,the results show that the matching degree of motivation between group and lender has a positive effect on the lender behavior,i.e.,lending to loans,and this relationship is weakened by free-riding in large groups.In addition,the group openness can have different effects on lenders of different group sizes.Our research enriches the existing crowdfunding literature and fills the gap in the research on new lending models in crowdfunding,and it will also be useful for crowdfunding platforms in setting the rules for building groups.