期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Optimized Modeling Method for Unbalanced Data in High-Level Visual Semantic Concept Classification
1
作者 谭励 曹元大 +1 位作者 杨明华 贺巧艳 《Journal of Beijing Institute of Technology》 EI CAS 2009年第2期186-191,共6页
To solve the unbalanced data problems of learning models for semantic concepts, an optimized modeling method based on the posterior probability support vector machine (PPSVM) is presented. A neighborbased posterior ... To solve the unbalanced data problems of learning models for semantic concepts, an optimized modeling method based on the posterior probability support vector machine (PPSVM) is presented. A neighborbased posterior probability estimator for visual concepts is provided. The proposed method has been applied in a high-level visual semantic concept classification system and the experiment results show that it results in enhanced performance over the baseline SVM models, as well as in improved robustness with respect to high-level visual semantic concept classification. 展开更多
关键词 visual concept modeling posterior probability support vector machine unbalanced data
下载PDF
Adaptive Optimization Swarm Algorithm Ensemble Model Applied to the Classification of Unbalanced Data
2
作者 Qingqing He Chao Qin 《Intelligent Information Management》 2021年第5期251-267,共17页
In order to solve the problem that, the <span style="white-space:normal;">hyper-parameters</span> of the existing random forest-based classification prediction model depend on empirical settings,... In order to solve the problem that, the <span style="white-space:normal;">hyper-parameters</span> of the existing random forest-based classification prediction model depend on empirical settings, which leads to unsatisfactory model performance. We propose a based on adaptive particle swarm optimization algorithm random forest model to optimize data classification and an adaptive particle swarm algorithm for optimizing hyper-parameters in the random forest to ensure that the model can better predict unbalanced data. Aiming at the premature convergence problem in the particle swarm optimization algorithm, the population is adaptively divided according to the fitness of the population, and an adaptive update strategy is introduced to enhance the ability of particles to jump out of the local optimum. The main steps of the model are as follows: Normalize the data set, initialize the model on the training set, and then use the particle swarm optimization algorithm to optimize the modeling process to establish a classification model. Experimental results show that our proposed algorithm is better than traditional algorithms, especially in terms of F1-Measure and ACC evaluation standards. The results of the six-keel imbalanced data set demonstrate the advantages of our proposed algorithm. 展开更多
关键词 Random Forest APSO unbalanced data Parameter Optimization
下载PDF
Analysis of Variance in an Unbalanced Two-Way Mixed Effect Interactive Model 被引量:1
3
作者 F. C. Eze E. U. Nwankwo 《Open Journal of Statistics》 2016年第2期310-319,共10页
The expected mean squares for unbalanced mixed effect interactive model were derived using Brute Force Method. From the expected mean squares, there are no obvious denominators for testing for the main effects when th... The expected mean squares for unbalanced mixed effect interactive model were derived using Brute Force Method. From the expected mean squares, there are no obvious denominators for testing for the main effects when the factors are mixed. An expression for F-test for testing for the main effects was derived which was proved to be unbiased. 展开更多
关键词 Mixed Model Expected Mean Squares unbalanced data
下载PDF
Fault diagnosis for on-board equipment of train control system based on CNN and PSO-SVM hybrid model 被引量:1
4
作者 LU Renjie LIN Haixiang +3 位作者 XU Li LU Ran ZHAO Zhengxiang BAI Wansheng 《Journal of Measurement Science and Instrumentation》 CAS CSCD 2022年第4期430-438,共9页
Rapid and precise location of the faults of on-board equipment of train control system is a significant factor to ensure reliable train operation.Text data of the fault tracking table of on-board equipment are taken a... Rapid and precise location of the faults of on-board equipment of train control system is a significant factor to ensure reliable train operation.Text data of the fault tracking table of on-board equipment are taken as samples,and an on-board equipment fault diagnosis model is designed based on the combination of convolutional neural network(CNN)and particle swarm optimization-support vector machines(PSO-SVM).Due to the characteristics of high dimensionality and sparseness of fault text data,CNN is used to achieve feature extraction.In order to decrease the influence of the imbalance of the fault sample data category on the classification accuracy,the PSO-SVM algorithm is introduced.The fully connected classification part of CNN is replaced by PSO-SVM,the extracted features are classified precisely,and the intelligent diagnosis of on-board equipment fault is implemented.According to the test analysis of the fault text data of on-board equipment recorded by a railway bureau and comparison with other models,the experimental results indicate that this model can obviously upgrade the evaluation indexes and can be used as an effective model for fault diagnosis for on-board equipment. 展开更多
关键词 on-board equipment fault diagnosis convolutional neural network(CNN) unbalanced text data particle swarm optimization-support vector machines(PSO-SVM)
下载PDF
DGR:dynamic gradient-based routing protocol for unbalanced and persistent data transmission in wireless sensor and actor networks
5
作者 Yi GUO Zhe-zhuang XU Cai-lian CHEN Xin-ping GUAN 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2011年第4期273-279,共7页
This paper is concerned with the routing protocol design for large-scale wireless sensor and actor networks (WSANs).The actor-sensor-actor communication (ASAC) strategy is first proposed to guarantee the reliability o... This paper is concerned with the routing protocol design for large-scale wireless sensor and actor networks (WSANs).The actor-sensor-actor communication (ASAC) strategy is first proposed to guarantee the reliability of persistent actor-actor communication.To keep network connectivity and prolong network lifetime,we propose a dynamic gradient-based routing protocol (DGR) to balance the energy consumption of the network.With the different communication ranges of sensors and actors,the DGR protocol uses a data load expansion strategy to significantly prolong the network lifetime.The balance coefficient and the routing re-establishment threshold are also introduced to make the tradeoff between network lifetime and routing efficiency.Simulation results show the effectiveness of the proposed DGR protocol for unbalanced and persistent data transmission. 展开更多
关键词 Wireless sensor and actor networks unbalanced and persistent data transmission Gradient-based routing
原文传递
Determinant Factors of Capital Structure of Firms-An Empirical Analysis Based on Evidence From Chinese Listed Retail Companies
6
作者 Weihan FENG 《Management Studies》 2022年第1期32-43,共12页
This paper investigates the effectiveness of various factors upon the capital structure decisions of Chinese firms by conducting an empirical analysis of Chinese-listed retail companies.An unbalanced panel dataset was... This paper investigates the effectiveness of various factors upon the capital structure decisions of Chinese firms by conducting an empirical analysis of Chinese-listed retail companies.An unbalanced panel dataset was formed with a sample of 110 companies observed for 12 years(2010~2021).Each observation is measured quarterly.Traditional explanatory variables are adopted in the study,including profitability,company size,tangibility of assets,internal financing ability,tax ratio,growth opportunities,and volatility.By employing the Fama-Macbeth approach,the regression results are interpreted to determine the impact of independent variables upon the leverage a company takes on.To solve the reverse causality problem,we include the lag term(last quarter’s data)of the debt-to-equity ratio as control variables.Consistent with previous theoretical and empirical studies,firms’leverage ratio is positively related to size,tangibility,tax ratio,and last quarter’s debt level.Companies’profitability and internal financing ability are negatively correlated with their debt-to-equity ratio.Firms’earning volatility and growth opportunities do not show significant relationship with the debt-to-equity ratio.The study has provided more empirical evidence on capital structure theories regarding emerging financial markets. 展开更多
关键词 capital structure theories Chinese-listed retail companies unbalanced panel data set leverage ratio emerging financial markets
下载PDF
Unbalanced classification method using least squares support vector machine with sparse strategy for steel surface defects with label noise
7
作者 Li-ming Liu Mao-xiang Chu +1 位作者 Rong-fen Gong Xin-yu Qi 《Journal of Iron and Steel Research International》 SCIE EI CAS CSCD 2020年第12期1407-1419,共13页
Least squares support vector machine (LS-SVM) plays an important role in steel surface defects classification because of its high speed. However, the defect samples obtained from the real production line may be noise.... Least squares support vector machine (LS-SVM) plays an important role in steel surface defects classification because of its high speed. However, the defect samples obtained from the real production line may be noise. LS-SVM suffers from the poor classification performance in the classification stage when there are noise samples. Thus, in the classification stage, it is necessary to design an effective algorithm to process the defects dataset obtained from the real production line. To this end, an adaptive weight function was employed to reduce the adverse effect of noise samples. Moreover, although LSSVM offers fast speed, it still suffers from a high computational complexity if the number of training samples is large. The time for steel surface defects classification should be as short as possible. Therefore, a sparse strategy was adopted to prune the training samples. Finally, since the steel surface defects classification belongs to unbalanced data classification, LSSVM algorithm is not applicable. Hence, the unbalanced data information was introduced to improve the classification performance. Comprehensively considering above-mentioned factors, an improved LS-SVM classification model was proposed, termed as ILS-SVM. Experimental results show that the new algorithm has the advantages of high speed and great anti-noise ability. 展开更多
关键词 Steel surface defect Least squares support vector machine ANTI-NOISE SPARSENESS unbalanced data
原文传递
Research on text fault recognition for on-board equipment of a C3 train control system based on an integrated XGBoost algorithm
8
作者 Lili Yue Luyue Liu +2 位作者 Maoqing Li Baodi Xiao Xiaochun Wu 《Transportation Safety and Environment》 EI 2023年第4期36-44,共9页
The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train.A fault diagnostic model of on-board equipment is built utilizing the integrated learning X... The robust guarantee of train control on-board equipment is inextricably linked to the safe functioning of a high-speed train.A fault diagnostic model of on-board equipment is built utilizing the integrated learning XGBoost(eXtreme Gradient Boosting)algorithm to help technicians assess the malfunction category of high-speed train control on-board equipment accurately and rapidly.The XGBoost algorithm iterates multiple decision tree models to improve the accuracy of fault diagnosis by lifting the predicted residual and adding regular terms.To begin,the text features were extracted using the improved TF-IDF(Term Frequency-Inverse Document Frequency)approach,and 24 fault feature words were chosen and converted into weight word vectors.Secondly,considering the imbalanced fault categories in the data set,the ADASYN(Adaptive Synthetic sampling)adaptive synthetically oversampling technique was used to synthesize a few category fault samples.Finally,the data samples were split into training and test sets based on the fault text data of CTCS-3train control on-board equipment recorded by Guangzhou Railway Group maintenance personnel.The XGBoost model was utilized to realize the automatic fault location of the test set after optimized parameter tuning through grid search.Compared with other methods,the evaluation index of the XGBoost model was significantly improved.The diagnostic accuracy reached 95.43%,which verifies the effectiveness of the method in text fault diagnosis. 展开更多
关键词 vehicle on-board equipment unbalanced data sets text feature extraction XGBoost model fault diagnosis
原文传递
Research on Equipment Fault Diagnosis Classification Model Based on Integrated Incremental Dynamic Weight Combination
9
作者 Haipeng Ji Xinduo Liu +2 位作者 Aoqi Tan Zhijie Wang Bing Yu 《国际计算机前沿大会会议论文集》 2020年第2期475-489,共15页
This study proposes a classification model of equipment fault diagnosis based on integrated incremental learning mechanism on the basis of characteristics of industrial equipment status data.The model first proposes a... This study proposes a classification model of equipment fault diagnosis based on integrated incremental learning mechanism on the basis of characteristics of industrial equipment status data.The model first proposes a dynamic weight combination classification model based on long short-term memory(LSTM)and support vector machine(SVM).It solved the problem of fault feature extraction and classification in high noise equipment state data.Then,in this model,integrated incremental learning mechanism and unbalanced data processing technology were introduced to solve problems of massive unbalanced new data feature extraction and classification and sample category imbalance under equipment status data.Finally,an equipment fault diagnosis classification model based on integrated incremental dynamic weight combination is formed.Experiments prove that the model can effectively overcome the problems of excessive data volume,unbalanced,high noise,and inability to correlate data samples in the process of equipment fault diagnosis. 展开更多
关键词 Neural network Support Vector Machine Integrated increment unbalanced data processing Fault diagnosis
原文传递
Improved Random Forest Algorithm Based on Adaptive Step Size Artificial Bee Colony Optimization
10
作者 Jiuyuan Huo Xuan Qin +2 位作者 Hamzah Murad Mohammed Al-Neshmi Lin Mu Tao Ju 《国际计算机前沿大会会议论文集》 2020年第2期216-233,共18页
The traditional random forest algorithm works along with unbalanced data,cannot achieve satisfactory prediction results for minority class,and suffers from the parameter selection dilemma.In view of this problem,this ... The traditional random forest algorithm works along with unbalanced data,cannot achieve satisfactory prediction results for minority class,and suffers from the parameter selection dilemma.In view of this problem,this paper proposes an unbalanced accuracy weighted random forest algorithm(UAW_RF)based on the adaptive step size artificial bee colony optimization.It combines the ideas of decision tree optimization,sampling selection,and weighted voting to improve the ability of stochastic forest algorithm when dealing with biased data classification.The adaptive step size and the optimal solution were introduced to improve the position updating formula of the artificial bee colony algorithm,and then the parameter combination of the random forest algorithm was iteratively optimized with the advantages of the algorithm.Experimental results show satisfactory accuracies and prove that the method can effectively improve the classification accuracy of the random forest algorithm. 展开更多
关键词 Random forest algorithm Artificial bee colony algorithm unbalanced data Classification problem
原文传递
A Nonparametric and Semiparametric Analysis on Inequality and Development: Evidence from OECD and Non-OECD Countries
11
作者 KUI-WAI LI XIANBO ZHOU 《Economic and Political Studies》 2013年第2期55-79,共25页
This paper studies the income inequality and economic development relationship by using unbalanced panel data of OECD and non-OECD countries(regions)for the period 1962-2003.The nonparametric estimation results show t... This paper studies the income inequality and economic development relationship by using unbalanced panel data of OECD and non-OECD countries(regions)for the period 1962-2003.The nonparametric estimation results show that income inequality in OECD countries is almost on the backside of the inverted-U relationship,while non-OECD countries are approximately on the foreside,except that the relationship in both country groups shows an upturn at a high level of development.Development has an indirect effect on inequality through control variables,but the modes are different in the two country groups.The model specification tests show that the relationship is not necessarily captured by the conventional quadratic function.The cubic and fourthdegree polynomials,respectively,fit the OECD and non-OECD country groups best.Our finding is robust regardless of whether the specification uses control variables.Development plays a dominant role in mitigating inequality. 展开更多
关键词 Kuznets inverted-U nonparametric and semiparametric models unbalanced panel data
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部