In engineering application,there is only one adaptive weights estimated by most of traditional early warning radars for adaptive interference suppression in a pulse reputation interval(PRI).Therefore,if the training s...In engineering application,there is only one adaptive weights estimated by most of traditional early warning radars for adaptive interference suppression in a pulse reputation interval(PRI).Therefore,if the training samples used to calculate the weight vector does not contain the jamming,then the jamming cannot be removed by adaptive spatial filtering.If the weight vector is constantly updated in the range dimension,the training data may contain target echo signals,resulting in signal cancellation effect.To cope with the situation that the training samples are contaminated by target signal,an iterative training sample selection method based on non-homogeneous detector(NHD)is proposed in this paper for updating the weight vector in entire range dimension.The principle is presented,and the validity is proven by simulation results.展开更多
Graph convolutional networks(GCNs)have received significant attention from various research fields due to the excellent performance in learning graph representations.Although GCN performs well compared with other meth...Graph convolutional networks(GCNs)have received significant attention from various research fields due to the excellent performance in learning graph representations.Although GCN performs well compared with other methods,it still faces challenges.Training a GCN model for large-scale graphs in a conventional way requires high computation and storage costs.Therefore,motivated by an urgent need in terms of efficiency and scalability in training GCN,sampling methods have been proposed and achieved a significant effect.In this paper,we categorize sampling methods based on the sampling mechanisms and provide a comprehensive survey of sampling methods for efficient training of GCN.To highlight the characteristics and differences of sampling methods,we present a detailed comparison within each category and further give an overall comparative analysis for the sampling methods in all categories.Finally,we discuss some challenges and future research directions of the sampling methods.展开更多
In view of class imbalance in data-driven modeling for Prognostics and Health Management(PHM),existing classification methods may fail in generating effective fault prediction models for the on-board high-speed train ...In view of class imbalance in data-driven modeling for Prognostics and Health Management(PHM),existing classification methods may fail in generating effective fault prediction models for the on-board high-speed train control equipment.A virtual sample generation solution based on Generative Adversarial Network(GAN)is proposed to overcome this shortcoming.Aiming at augmenting the sample classes with the imbalanced data problem,the GAN-based virtual sample generation strategy is embedded into the establishment of fault prediction models.Under the PHM framework of the on-board train control system,the virtual sample generation principle and the detailed procedures are presented.With the enhanced class-balancing mechanism and the designed sample augmentation logic,the PHM scheme of the on-board train control equipment has powerful data condition adaptability and can effectively predict the fault probability and life cycle status.Practical data from a specific type of on-board train control system is employed for the validation of the presented solution.The comparative results indicate that GAN-based sample augmentation is capable of achieving a desirable sample balancing level and enhancing the performance of correspondingly derived fault prediction models for the Condition-based Maintenance(CBM)operations.展开更多
The accuracy of laser-induced breakdown spectroscopy(LIBS) quantitative method is greatly dependent on the amount of certified standard samples used for training. However, in practical applications, only limited stand...The accuracy of laser-induced breakdown spectroscopy(LIBS) quantitative method is greatly dependent on the amount of certified standard samples used for training. However, in practical applications, only limited standard samples with labeled certified concentrations are available. A novel semi-supervised LIBS quantitative analysis method is proposed, based on co-training regression model with selection of effective unlabeled samples. The main idea of the proposed method is to obtain better regression performance by adding effective unlabeled samples in semisupervised learning. First, effective unlabeled samples are selected according to the testing samples by Euclidean metric. Two original regression models based on least squares support vector machine with different parameters are trained by the labeled samples separately, and then the effective unlabeled samples predicted by the two models are used to enlarge the training dataset based on labeling confidence estimation. The final predictions of the proposed method on the testing samples will be determined by weighted combinations of the predictions of two updated regression models. Chromium concentration analysis experiments of 23 certified standard high-alloy steel samples were carried out, in which 5 samples with labeled concentrations and 11 unlabeled samples were used to train the regression models and the remaining 7 samples were used for testing. With the numbers of effective unlabeled samples increasing, the root mean square error of the proposed method went down from 1.80% to 0.84% and the relative prediction error was reduced from 9.15% to 4.04%.展开更多
The aerodynamic optimization design of high-speed trains(HSTs)is crucial for energy conservation,environmental preservation,operational safety,and speeding up.This study aims to review the current state and progress o...The aerodynamic optimization design of high-speed trains(HSTs)is crucial for energy conservation,environmental preservation,operational safety,and speeding up.This study aims to review the current state and progress of the aerodynamic multi-objective optimization of HSTs.First,the study explores the impact of train nose shape parameters on aerodynamic performance.The parameterization methods involved in the aerodynamic multiobjective optimization ofHSTs are summarized and classified as shape-based and disturbance-based parameterizationmethods.Meanwhile,the advantages and limitations of each parameterizationmethod,aswell as the applicable scope,are briefly discussed.In addition,the NSGA-II algorithm,particle swarm optimization algorithm,standard genetic algorithm,and other commonly used multi-objective optimization algorithms and the improvements in the field of aerodynamic optimization for HSTs are summarized.Second,this study investigates the aerodynamic multi-objective optimization technology for HSTs using the surrogate model,focusing on the Kriging surrogate models,neural network,and support vector regression.Moreover,the construction methods of surrogate models are summarized,and the influence of different sample infill criteria on the efficiency ofmulti-objective optimization is analyzed.Meanwhile,advanced aerodynamic optimization methods in the field of aircraft have been briefly introduced to guide research on the aerodynamic optimization of HSTs.Finally,based on the summary of the research progress of the aerodynamicmulti-objective optimization ofHSTs,future research directions are proposed,such as intelligent recognition technology of characteristic parameters,collaborative optimization of multiple operating environments,and sample infill criterion of the surrogate model.展开更多
The fundamental step to get a Statistical Shape Model (SSM) is to align all the training samples to the same spatial modality. In this paper, we propose a new 3D alignment method for organic training samples matching,...The fundamental step to get a Statistical Shape Model (SSM) is to align all the training samples to the same spatial modality. In this paper, we propose a new 3D alignment method for organic training samples matching, whose modalities are orientable and surface figures could be recognized. It is a feature based alignment method which matches two models depending on the distribution of surface curvature. According to the affine transformation on 2D Gaussian map, the distances between the corresponding parts on surface could be minimized. We applied our proposed method on 5 cases left lung training samples alignment and 4 cases liver training samples alignment. The experiment results were performed on the left lung training samples and the liver training samples. The availability of proposed method was confirmed.展开更多
Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new meth...Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new method based on virtual model and invariant moments was proposed to generate training samples.The method was composed of the following steps:use computer and simulation software to build target object's virtual model and then simulate the environment,light condition,camera parameter,etc.;rotate the model by spin and nutation of inclination to get the image sequence by virtual camera;preprocess each image and transfer them into binary image;calculate the invariant moments for each image and get a vectors' sequence.The vectors' sequence which was proved to be complete became the training samples together with the target outputs.The simulated results showed that the proposed method could be used to recognize the real targets and improve the accuracy of target recognition effectively when the sampling interval was short enough and the circumstance simulation was close enough.展开更多
Based on the good localization characteristic of the wavelet transform both in time and frequency domain, a de-noising method based on wavelet transform is presented, which can make the extraction of visual evoked pot...Based on the good localization characteristic of the wavelet transform both in time and frequency domain, a de-noising method based on wavelet transform is presented, which can make the extraction of visual evoked potentials in single training sample from the EEG background noise in favor of studying the changes between the single sample response happen. The information is probably related with the different function, appearance and pathologies of the brain. At the same time this method can also be used to remove those signal’s artifacts that do not appear with EP within the same scope of time or frequency. The traditional Fourier filter can hardly attain the similar result. This method is different from other wavelet de-noising methods in which different criteria are employed in choosing wavelet coefficient. It has a biggest virtue of noting the differences among the single training sample and making use of the characteristics of high time frequency resolution to reduce the effect of interference factors to a maximum extent within the time scope that EP appear. The experiment result proves that this method is not restricted by the signal-to-noise ratio of evoked potential and electroencephalograph (EEG) and even can recognize instantaneous event under the condition of lower signal-to-noise ratio, as well as recognize the samples which evoked evident response more easily. Therefore, more evident average evoked response could be achieved by de-nosing the signals obtained through averaging out the samples that can evoke evident responses than de-nosing the average of original signals. In addition, averaging methodology can dramatically reduce the number of record samples needed, thus avoiding the effect of behavior change during the recording process. This methodology pays attention to the differences among single training sample and also accomplishes the extraction of visual evoked potentials from single trainings sample. As a result, system speed and accuracy could be improved to a great extent if this methodology is applied to brain-computer interface system based on evoked responses.展开更多
A current identification method based on optimized variational mode decomposition(VMD)and sample entropy(SampEn)is proposed in order to solve the problem that the main protection of the urban rail transit DC feeder ca...A current identification method based on optimized variational mode decomposition(VMD)and sample entropy(SampEn)is proposed in order to solve the problem that the main protection of the urban rail transit DC feeder cannot distinguish between train charging current and remote short circuit current.This method uses the principle of energy difference to optimize the optimal mode decomposition number k of VMD;the optimal VMD for DC feeder current is decomposed into the intrinsic modal function(IMF)of different frequency bands.The sample entropy algorithm is used to perform feature extraction of each IMF,and then the eigenvalues of the intrinsic modal function of each frequency band of the current signal can be obtained.The recognition feature vector is input into the support vector machine model based on Bayesian hyperparameter optimization for training.After a large number of experimental data are verified,it is found that the optimal VMD_SampEn algorithm to identify the train charging current and remote short circuit current is more accurate than other algorithms.Thus,the algorithm based on optimized VMD_SampEn has certain engineering application value in the fault current identification of the DC traction feeder.展开更多
中文故事结尾生成(SEG)是自然语言处理中的下游任务之一。基于全错误结尾的CLSEG(Contrastive Learning of Story Ending Generation)在故事的一致性方面表现较好。然而,由于错误结尾中也包含与原结尾文本相同的内容,仅使用错误结尾的...中文故事结尾生成(SEG)是自然语言处理中的下游任务之一。基于全错误结尾的CLSEG(Contrastive Learning of Story Ending Generation)在故事的一致性方面表现较好。然而,由于错误结尾中也包含与原结尾文本相同的内容,仅使用错误结尾的对比训练会导致生成文本中原结尾正确的主要部分被剥离。因此,在CLSEG基础上增加正向结尾增强训练,以保留对比训练中损失的正确部分;同时,通过正向结尾的引入,使生成的结尾具有更强的多样性和关联性。基于双向对比训练的中文故事结尾生成模型包含两个主要部分:1)多结尾采样,通过不同的模型方法获取正向增强的结尾和反向对比的错误结尾;2)对比训练,在训练过程中修改损失函数,使生成的结尾接近正向结尾,远离错误结尾。在公开的故事数据集OutGen上的实验结果表明,相较于GPT2. ft和深层逐层隐变量融合(Della)等模型,所提模型的BERTScore、METEOR等指标均取得了较优的结果,生成的结尾具有更强的多样性和关联性。展开更多
基金supported by the National Natural Science Foundation of China(62371049)。
文摘In engineering application,there is only one adaptive weights estimated by most of traditional early warning radars for adaptive interference suppression in a pulse reputation interval(PRI).Therefore,if the training samples used to calculate the weight vector does not contain the jamming,then the jamming cannot be removed by adaptive spatial filtering.If the weight vector is constantly updated in the range dimension,the training data may contain target echo signals,resulting in signal cancellation effect.To cope with the situation that the training samples are contaminated by target signal,an iterative training sample selection method based on non-homogeneous detector(NHD)is proposed in this paper for updating the weight vector in entire range dimension.The principle is presented,and the validity is proven by simulation results.
基金supported by the National Natural Science Foundation of China(61732018,61872335,61802367,61876215)the Strategic Priority Research Program of Chinese Academy of Sciences(XDC05000000)+1 种基金Beijing Academy of Artificial Intelligence(BAAI),the Open Project Program of the State Key Laboratory of Mathematical Engineering and Advanced Computing(2019A07)the Open Project of Zhejiang Laboratory,and a grant from the Institute for Guo Qiang,Tsinghua University.Recommended by Associate Editor Long Chen.
文摘Graph convolutional networks(GCNs)have received significant attention from various research fields due to the excellent performance in learning graph representations.Although GCN performs well compared with other methods,it still faces challenges.Training a GCN model for large-scale graphs in a conventional way requires high computation and storage costs.Therefore,motivated by an urgent need in terms of efficiency and scalability in training GCN,sampling methods have been proposed and achieved a significant effect.In this paper,we categorize sampling methods based on the sampling mechanisms and provide a comprehensive survey of sampling methods for efficient training of GCN.To highlight the characteristics and differences of sampling methods,we present a detailed comparison within each category and further give an overall comparative analysis for the sampling methods in all categories.Finally,we discuss some challenges and future research directions of the sampling methods.
基金supported by National Natural Science Foundation of China(U2268206,T2222015)Beijing Natural Science Foundation(4232031)+1 种基金Key Fields Project of DEGP(2021ZDZX1110)Shenzhen Science and Technology Program(CJGJZD20220517141801004).
文摘In view of class imbalance in data-driven modeling for Prognostics and Health Management(PHM),existing classification methods may fail in generating effective fault prediction models for the on-board high-speed train control equipment.A virtual sample generation solution based on Generative Adversarial Network(GAN)is proposed to overcome this shortcoming.Aiming at augmenting the sample classes with the imbalanced data problem,the GAN-based virtual sample generation strategy is embedded into the establishment of fault prediction models.Under the PHM framework of the on-board train control system,the virtual sample generation principle and the detailed procedures are presented.With the enhanced class-balancing mechanism and the designed sample augmentation logic,the PHM scheme of the on-board train control equipment has powerful data condition adaptability and can effectively predict the fault probability and life cycle status.Practical data from a specific type of on-board train control system is employed for the validation of the presented solution.The comparative results indicate that GAN-based sample augmentation is capable of achieving a desirable sample balancing level and enhancing the performance of correspondingly derived fault prediction models for the Condition-based Maintenance(CBM)operations.
基金supported by National Natural Science Foundation of China (No. 51674032)
文摘The accuracy of laser-induced breakdown spectroscopy(LIBS) quantitative method is greatly dependent on the amount of certified standard samples used for training. However, in practical applications, only limited standard samples with labeled certified concentrations are available. A novel semi-supervised LIBS quantitative analysis method is proposed, based on co-training regression model with selection of effective unlabeled samples. The main idea of the proposed method is to obtain better regression performance by adding effective unlabeled samples in semisupervised learning. First, effective unlabeled samples are selected according to the testing samples by Euclidean metric. Two original regression models based on least squares support vector machine with different parameters are trained by the labeled samples separately, and then the effective unlabeled samples predicted by the two models are used to enlarge the training dataset based on labeling confidence estimation. The final predictions of the proposed method on the testing samples will be determined by weighted combinations of the predictions of two updated regression models. Chromium concentration analysis experiments of 23 certified standard high-alloy steel samples were carried out, in which 5 samples with labeled concentrations and 11 unlabeled samples were used to train the regression models and the remaining 7 samples were used for testing. With the numbers of effective unlabeled samples increasing, the root mean square error of the proposed method went down from 1.80% to 0.84% and the relative prediction error was reduced from 9.15% to 4.04%.
基金supported by the Sichuan Science and Technology Program(2023JDRC0062)National Natural Science Foundation of China(12172308)Project of State Key Laboratory of Traction Power(2023TPL-T05).
文摘The aerodynamic optimization design of high-speed trains(HSTs)is crucial for energy conservation,environmental preservation,operational safety,and speeding up.This study aims to review the current state and progress of the aerodynamic multi-objective optimization of HSTs.First,the study explores the impact of train nose shape parameters on aerodynamic performance.The parameterization methods involved in the aerodynamic multiobjective optimization ofHSTs are summarized and classified as shape-based and disturbance-based parameterizationmethods.Meanwhile,the advantages and limitations of each parameterizationmethod,aswell as the applicable scope,are briefly discussed.In addition,the NSGA-II algorithm,particle swarm optimization algorithm,standard genetic algorithm,and other commonly used multi-objective optimization algorithms and the improvements in the field of aerodynamic optimization for HSTs are summarized.Second,this study investigates the aerodynamic multi-objective optimization technology for HSTs using the surrogate model,focusing on the Kriging surrogate models,neural network,and support vector regression.Moreover,the construction methods of surrogate models are summarized,and the influence of different sample infill criteria on the efficiency ofmulti-objective optimization is analyzed.Meanwhile,advanced aerodynamic optimization methods in the field of aircraft have been briefly introduced to guide research on the aerodynamic optimization of HSTs.Finally,based on the summary of the research progress of the aerodynamicmulti-objective optimization ofHSTs,future research directions are proposed,such as intelligent recognition technology of characteristic parameters,collaborative optimization of multiple operating environments,and sample infill criterion of the surrogate model.
文摘The fundamental step to get a Statistical Shape Model (SSM) is to align all the training samples to the same spatial modality. In this paper, we propose a new 3D alignment method for organic training samples matching, whose modalities are orientable and surface figures could be recognized. It is a feature based alignment method which matches two models depending on the distribution of surface curvature. According to the affine transformation on 2D Gaussian map, the distances between the corresponding parts on surface could be minimized. We applied our proposed method on 5 cases left lung training samples alignment and 4 cases liver training samples alignment. The experiment results were performed on the left lung training samples and the liver training samples. The availability of proposed method was confirmed.
基金Supported by the Ministerial Level Research Foundation(404040401)
文摘Training neural network to recognize targets needs a lot of samples.People usually get these samples in a non-systematic way,which can miss or overemphasize some target information.To improve this situation,a new method based on virtual model and invariant moments was proposed to generate training samples.The method was composed of the following steps:use computer and simulation software to build target object's virtual model and then simulate the environment,light condition,camera parameter,etc.;rotate the model by spin and nutation of inclination to get the image sequence by virtual camera;preprocess each image and transfer them into binary image;calculate the invariant moments for each image and get a vectors' sequence.The vectors' sequence which was proved to be complete became the training samples together with the target outputs.The simulated results showed that the proposed method could be used to recognize the real targets and improve the accuracy of target recognition effectively when the sampling interval was short enough and the circumstance simulation was close enough.
文摘Based on the good localization characteristic of the wavelet transform both in time and frequency domain, a de-noising method based on wavelet transform is presented, which can make the extraction of visual evoked potentials in single training sample from the EEG background noise in favor of studying the changes between the single sample response happen. The information is probably related with the different function, appearance and pathologies of the brain. At the same time this method can also be used to remove those signal’s artifacts that do not appear with EP within the same scope of time or frequency. The traditional Fourier filter can hardly attain the similar result. This method is different from other wavelet de-noising methods in which different criteria are employed in choosing wavelet coefficient. It has a biggest virtue of noting the differences among the single training sample and making use of the characteristics of high time frequency resolution to reduce the effect of interference factors to a maximum extent within the time scope that EP appear. The experiment result proves that this method is not restricted by the signal-to-noise ratio of evoked potential and electroencephalograph (EEG) and even can recognize instantaneous event under the condition of lower signal-to-noise ratio, as well as recognize the samples which evoked evident response more easily. Therefore, more evident average evoked response could be achieved by de-nosing the signals obtained through averaging out the samples that can evoke evident responses than de-nosing the average of original signals. In addition, averaging methodology can dramatically reduce the number of record samples needed, thus avoiding the effect of behavior change during the recording process. This methodology pays attention to the differences among single training sample and also accomplishes the extraction of visual evoked potentials from single trainings sample. As a result, system speed and accuracy could be improved to a great extent if this methodology is applied to brain-computer interface system based on evoked responses.
基金This project supported by The National Natural Science Foundation of China(No.11872253).
文摘A current identification method based on optimized variational mode decomposition(VMD)and sample entropy(SampEn)is proposed in order to solve the problem that the main protection of the urban rail transit DC feeder cannot distinguish between train charging current and remote short circuit current.This method uses the principle of energy difference to optimize the optimal mode decomposition number k of VMD;the optimal VMD for DC feeder current is decomposed into the intrinsic modal function(IMF)of different frequency bands.The sample entropy algorithm is used to perform feature extraction of each IMF,and then the eigenvalues of the intrinsic modal function of each frequency band of the current signal can be obtained.The recognition feature vector is input into the support vector machine model based on Bayesian hyperparameter optimization for training.After a large number of experimental data are verified,it is found that the optimal VMD_SampEn algorithm to identify the train charging current and remote short circuit current is more accurate than other algorithms.Thus,the algorithm based on optimized VMD_SampEn has certain engineering application value in the fault current identification of the DC traction feeder.
文摘中文故事结尾生成(SEG)是自然语言处理中的下游任务之一。基于全错误结尾的CLSEG(Contrastive Learning of Story Ending Generation)在故事的一致性方面表现较好。然而,由于错误结尾中也包含与原结尾文本相同的内容,仅使用错误结尾的对比训练会导致生成文本中原结尾正确的主要部分被剥离。因此,在CLSEG基础上增加正向结尾增强训练,以保留对比训练中损失的正确部分;同时,通过正向结尾的引入,使生成的结尾具有更强的多样性和关联性。基于双向对比训练的中文故事结尾生成模型包含两个主要部分:1)多结尾采样,通过不同的模型方法获取正向增强的结尾和反向对比的错误结尾;2)对比训练,在训练过程中修改损失函数,使生成的结尾接近正向结尾,远离错误结尾。在公开的故事数据集OutGen上的实验结果表明,相较于GPT2. ft和深层逐层隐变量融合(Della)等模型,所提模型的BERTScore、METEOR等指标均取得了较优的结果,生成的结尾具有更强的多样性和关联性。