To study the capacity of artificial neural network (ANN) applying to battlefield target classification and result of classification, according to the characteristics of battlefield target acoustic and seismic sign...To study the capacity of artificial neural network (ANN) applying to battlefield target classification and result of classification, according to the characteristics of battlefield target acoustic and seismic signals, an on the spot experiment was carried out to derive acoustic and seismic signals of a tank and jeep by special experiment system. Experiment data processed by fast Fourier transform(FFT) were used to train the ANN to distinguish the two battlefield targets. The ANN classifier was performed by the special program based on the modified back propagation (BP) algorithm. The ANN classifier has high correct identification rates for acoustic and seismic signals of battlefield targets, and is suitable for the classification of battlefield targets. The modified BP algorithm eliminates oscillations and local minimum of the standard BP algorithm, and enhances the convergence rate of the ANN.展开更多
Sentiment analysis is the computational study of how opinions, attitudes, emotions, and perspectives are expressed in language, and has been the important task of natural language processing. Sentiment analysis is hig...Sentiment analysis is the computational study of how opinions, attitudes, emotions, and perspectives are expressed in language, and has been the important task of natural language processing. Sentiment analysis is highly valuable for both research and practical applications. The focuses were put on the difficulties in the construction of sentiment classifiers which normally need tremendous labeled domain training data, and a novel unsupervised framework was proposed to make use of the Chinese idiom resources to develop a general sentiment classifier. Furthermore, the domain adaption of general sentiment classifier was improved by taking the general classifier as the base of a self-training procedure to get a domain self-training sentiment classifier. To validate the effect of the unsupervised framework, several experiments were carried out on publicly available Chinese online reviews dataset. The experiments show that the proposed framework is effective and achieves encouraging results. Specifically, the general classifier outperforms two baselines(a Na?ve 50% baseline and a cross-domain classifier), and the bootstrapping self-training classifier approximates the upper bound domain-specific classifier with the lowest accuracy of 81.5%, but the performance is more stable and the framework needs no labeled training dataset.展开更多
Machine Learning(ML) techniques have been widely applied in recent traffic classification.However, the problems of both discriminator bias and class imbalance decrease the accuracies of ML based traffic classifier. In...Machine Learning(ML) techniques have been widely applied in recent traffic classification.However, the problems of both discriminator bias and class imbalance decrease the accuracies of ML based traffic classifier. In this paper, we propose an accurate and extensible traffic classifier. Specifically, to address the discriminator bias issue, our classifier is built by making an optimal cascade of binary sub-classifiers, where each binary sub-classifier is trained independently with the discriminators used for identifying application specific traffic. Moreover, to balance a training dataset,we apply SMOTE algorithm in generating artificial training samples for minority classes.We evaluate our classifier on two datasets collected from different network border routers.Compared with the previous multi-class traffic classifiers built in one-time training process,our classifier achieves much higher F-Measure and AUC for each application.展开更多
Automatic classification of sentiment data(e.g., reviews, blogs) has many applications in enterprise user management systems, and can help us understand people's attitudes about products or services. However, it is...Automatic classification of sentiment data(e.g., reviews, blogs) has many applications in enterprise user management systems, and can help us understand people's attitudes about products or services. However, it is difficult to train an accurate sentiment classifier for different domains. One of the major reasons is that people often use different words to express the same sentiment in different domains, and we cannot easily find a direct mapping relationship between them to reduce the differences between domains. So, the accuracy of the sentiment classifier will decline sharply when we apply a classifier trained in one domain to other domains. In this paper, we propose a novel approach called words alignment based on association rules(WAAR) for cross-domain sentiment classification,which can establish an indirect mapping relationship between domain-specific words in different domains by learning the strong association rules between domain-shared words and domain-specific words in the same domain. In this way, the differences between the source domain and target domain can be reduced to some extent, and a more accurate cross-domain classifier can be trained. Experimental results on Amazon~ datasets show the effectiveness of our approach on improving the performance of cross-domain sentiment classification.展开更多
文摘To study the capacity of artificial neural network (ANN) applying to battlefield target classification and result of classification, according to the characteristics of battlefield target acoustic and seismic signals, an on the spot experiment was carried out to derive acoustic and seismic signals of a tank and jeep by special experiment system. Experiment data processed by fast Fourier transform(FFT) were used to train the ANN to distinguish the two battlefield targets. The ANN classifier was performed by the special program based on the modified back propagation (BP) algorithm. The ANN classifier has high correct identification rates for acoustic and seismic signals of battlefield targets, and is suitable for the classification of battlefield targets. The modified BP algorithm eliminates oscillations and local minimum of the standard BP algorithm, and enhances the convergence rate of the ANN.
基金Projects(61170156,60933005)supported by the National Natural Science Foundation of China
文摘Sentiment analysis is the computational study of how opinions, attitudes, emotions, and perspectives are expressed in language, and has been the important task of natural language processing. Sentiment analysis is highly valuable for both research and practical applications. The focuses were put on the difficulties in the construction of sentiment classifiers which normally need tremendous labeled domain training data, and a novel unsupervised framework was proposed to make use of the Chinese idiom resources to develop a general sentiment classifier. Furthermore, the domain adaption of general sentiment classifier was improved by taking the general classifier as the base of a self-training procedure to get a domain self-training sentiment classifier. To validate the effect of the unsupervised framework, several experiments were carried out on publicly available Chinese online reviews dataset. The experiments show that the proposed framework is effective and achieves encouraging results. Specifically, the general classifier outperforms two baselines(a Na?ve 50% baseline and a cross-domain classifier), and the bootstrapping self-training classifier approximates the upper bound domain-specific classifier with the lowest accuracy of 81.5%, but the performance is more stable and the framework needs no labeled training dataset.
基金supported by the National Natural Science Foundation of China under Grant No.61402485National Natural Science Foundation of China under Grant No.61303061supported by the Open fund from HPCL No.201513-01
文摘Machine Learning(ML) techniques have been widely applied in recent traffic classification.However, the problems of both discriminator bias and class imbalance decrease the accuracies of ML based traffic classifier. In this paper, we propose an accurate and extensible traffic classifier. Specifically, to address the discriminator bias issue, our classifier is built by making an optimal cascade of binary sub-classifiers, where each binary sub-classifier is trained independently with the discriminators used for identifying application specific traffic. Moreover, to balance a training dataset,we apply SMOTE algorithm in generating artificial training samples for minority classes.We evaluate our classifier on two datasets collected from different network border routers.Compared with the previous multi-class traffic classifiers built in one-time training process,our classifier achieves much higher F-Measure and AUC for each application.
基金Project supported by the National Natural Science Foundation of China(Nos.61703013,91546111,91646201,61672070,and61672071)the Beijing Municipal Natural Science Foundation(No.4152005)+1 种基金the Key Projects of Beijing Municipal Education Commission(Nos.KZ201610005009 and KM201810005024)the International Cooperation Seed Grant from Beijing University of Technology of 2016(No.007000514116520)
文摘Automatic classification of sentiment data(e.g., reviews, blogs) has many applications in enterprise user management systems, and can help us understand people's attitudes about products or services. However, it is difficult to train an accurate sentiment classifier for different domains. One of the major reasons is that people often use different words to express the same sentiment in different domains, and we cannot easily find a direct mapping relationship between them to reduce the differences between domains. So, the accuracy of the sentiment classifier will decline sharply when we apply a classifier trained in one domain to other domains. In this paper, we propose a novel approach called words alignment based on association rules(WAAR) for cross-domain sentiment classification,which can establish an indirect mapping relationship between domain-specific words in different domains by learning the strong association rules between domain-shared words and domain-specific words in the same domain. In this way, the differences between the source domain and target domain can be reduced to some extent, and a more accurate cross-domain classifier can be trained. Experimental results on Amazon~ datasets show the effectiveness of our approach on improving the performance of cross-domain sentiment classification.