Near-infrared(NIR)spectral analysis,which has the advantages of rapidness,nondestruction and high-efficiency,is widely used in the detection of feed,food and mineral.In terms of qualitative identification,it can also ...Near-infrared(NIR)spectral analysis,which has the advantages of rapidness,nondestruction and high-efficiency,is widely used in the detection of feed,food and mineral.In terms of qualitative identification,it can also be used for the discriminant analysis of medicines.Long short-term memory(LSTM)neural network,bidirectional long short-term memory(BiLSTM)neural network and gated recurrent unit(GRU)network are variants of the recurrent neural network(RNN).The potential relationship between nonlinear features learned from the sequence by these variants is used to complete the missions infields such as natural language processing,signal classification and video analysis.Since the effect of these variants in drug identification is still to be studied,this paper constructs a multiclassifier of these three variants,using compoundα-keto acid tablets produced by four manufacturers and repaglinide tablets produced by five manufacturers as the research object.Then,the paper analyzes the impacts of seven different preprocessed methods on the drug NIR data by constructing different layers of LSTM,BiLSTM and GRU networks and compares different classification model indicators and training time of each model.When the spectrum data are pre-processed by z-score normalization,the GRU-3 model has the best accuracy in all models.The BiLSTM models are better for analyzing high coincidence data.The method proposed in this paper can be further extended to other NIR spectroscopy data sets.展开更多
Predicting travel trajectory of vehicles can not only provide personalized services to users,but also have a certain effect on traffic guidance and traffic control.In this paper,we build a Bayonet-Corpus based on the ...Predicting travel trajectory of vehicles can not only provide personalized services to users,but also have a certain effect on traffic guidance and traffic control.In this paper,we build a Bayonet-Corpus based on the context of traffic intersections,and use it to model a traffic network.Besides,Bidirectional Gated Recurrent Unit(Bi-GRU)is used to predict the sequence of traffic intersections in one single trajectory.Firstly,considering that real traffic networks are usually complex and disorder and cannot reflect the higher dimensional relationship among traffic intersections,this paper proposes a new traffic network modeling algorithm based on the context of traffic intersections:inspired by the probabilistic language model,a Bayonet-Corpus is constructed from traffic intersections in real trajectory sequence,so the high-dimensional similarity between corpus nodes can be used to measure the semantic relation of real traffic intersections.This algorithm maps vehicle trajectory nodes into a high-dimensional space vector,blocking complex structure of real traffic network and reconstructing the traffic network space.Then,the bayonets sequence in real traffic network is mapped into a matrix.Considering the trajectories sequence is bidirectional,and Bi-GRU can handle information from forward and backward simultaneously,we use Bi-GRU to bidirectionally model the trajectory matrix for the purpose of prediction.展开更多
针对高压直流(high voltage direct current,HVDC)输电线路故障暂态行波具有时序性和强非线性的特点,导致高过渡电阻情况下故障识别率低的问题,提出基于卷积神经网络(convolutional neural networks,CNN)和双向循环门单元(bidirectional...针对高压直流(high voltage direct current,HVDC)输电线路故障暂态行波具有时序性和强非线性的特点,导致高过渡电阻情况下故障识别率低的问题,提出基于卷积神经网络(convolutional neural networks,CNN)和双向循环门单元(bidirectional gate recurrent unit,BiGRU)的HVDC输电线路故障识别方法。首先,采用故障后整流侧的双极暂态电流行波作为特征向量,利用CNN提取全局特征,并从中剔除噪声和不稳定成分,完成对数据的降维处理。然后,采用BiGRU来捕获CNN提取到特征的前后时间信息,进一步提取数据中的时序特征,以实现HVDC输电线路故障识别。仿真结果表明:该方法可在不同故障地点以及不同过渡电阻下对单极接地、双极短路、雷击故障、雷击干扰共四种故障实现准确识别,可靠性高,具有较强的耐受过渡电阻能力,同时具备一定的抗噪性能。展开更多
现有用户画像方法缺乏不同粒度文本信息表示,且特征提取阶段存在噪声,导致构建画像不够准确。针对以上问题,提出一种融合多粒度信息的用户画像生成方法(user profile based on multi-granularity information fusion,UP-MGIF)。首先,该...现有用户画像方法缺乏不同粒度文本信息表示,且特征提取阶段存在噪声,导致构建画像不够准确。针对以上问题,提出一种融合多粒度信息的用户画像生成方法(user profile based on multi-granularity information fusion,UP-MGIF)。首先,该方法在嵌入层融合字粒度、词粒度表示向量以扩充特征内容;其次,在改进双向门控循环单元网络基础上,结合降噪自编码器和注意力机制设计一种特征提取混合模型Bi-GRU-DAE-Attention,实现特征降噪和语义增强;最后,将鲁棒性强的特征向量输入到分类器中实现用户画像生成。实验表明,该用户画像生成方法在医疗和互联网两个画像数据集上的分类准确率高于其他基线方法,并通过消融实验验证了各个模块的有效性。展开更多
基金This research was supported by the Science and Technology Planning Project of Guangdong Province(Grant Nos.2017B020221002,2018B020207008 and 2021B1111610005)Science and Technology Planning Project of Guangzhou,Grant No.201707010410。
文摘Near-infrared(NIR)spectral analysis,which has the advantages of rapidness,nondestruction and high-efficiency,is widely used in the detection of feed,food and mineral.In terms of qualitative identification,it can also be used for the discriminant analysis of medicines.Long short-term memory(LSTM)neural network,bidirectional long short-term memory(BiLSTM)neural network and gated recurrent unit(GRU)network are variants of the recurrent neural network(RNN).The potential relationship between nonlinear features learned from the sequence by these variants is used to complete the missions infields such as natural language processing,signal classification and video analysis.Since the effect of these variants in drug identification is still to be studied,this paper constructs a multiclassifier of these three variants,using compoundα-keto acid tablets produced by four manufacturers and repaglinide tablets produced by five manufacturers as the research object.Then,the paper analyzes the impacts of seven different preprocessed methods on the drug NIR data by constructing different layers of LSTM,BiLSTM and GRU networks and compares different classification model indicators and training time of each model.When the spectrum data are pre-processed by z-score normalization,the GRU-3 model has the best accuracy in all models.The BiLSTM models are better for analyzing high coincidence data.The method proposed in this paper can be further extended to other NIR spectroscopy data sets.
基金This research is partially supported by the National Natural Science Foundation of China(Grant No.61772098)Science and Technology Research Program of Chongqing Municipal Education Commission(Grant No.KJZD K201900603,KJQN201900629)Chongqing Grad-uate Education Teaching Reform Project(No.yjg183081).
文摘Predicting travel trajectory of vehicles can not only provide personalized services to users,but also have a certain effect on traffic guidance and traffic control.In this paper,we build a Bayonet-Corpus based on the context of traffic intersections,and use it to model a traffic network.Besides,Bidirectional Gated Recurrent Unit(Bi-GRU)is used to predict the sequence of traffic intersections in one single trajectory.Firstly,considering that real traffic networks are usually complex and disorder and cannot reflect the higher dimensional relationship among traffic intersections,this paper proposes a new traffic network modeling algorithm based on the context of traffic intersections:inspired by the probabilistic language model,a Bayonet-Corpus is constructed from traffic intersections in real trajectory sequence,so the high-dimensional similarity between corpus nodes can be used to measure the semantic relation of real traffic intersections.This algorithm maps vehicle trajectory nodes into a high-dimensional space vector,blocking complex structure of real traffic network and reconstructing the traffic network space.Then,the bayonets sequence in real traffic network is mapped into a matrix.Considering the trajectories sequence is bidirectional,and Bi-GRU can handle information from forward and backward simultaneously,we use Bi-GRU to bidirectionally model the trajectory matrix for the purpose of prediction.
文摘针对高压直流(high voltage direct current,HVDC)输电线路故障暂态行波具有时序性和强非线性的特点,导致高过渡电阻情况下故障识别率低的问题,提出基于卷积神经网络(convolutional neural networks,CNN)和双向循环门单元(bidirectional gate recurrent unit,BiGRU)的HVDC输电线路故障识别方法。首先,采用故障后整流侧的双极暂态电流行波作为特征向量,利用CNN提取全局特征,并从中剔除噪声和不稳定成分,完成对数据的降维处理。然后,采用BiGRU来捕获CNN提取到特征的前后时间信息,进一步提取数据中的时序特征,以实现HVDC输电线路故障识别。仿真结果表明:该方法可在不同故障地点以及不同过渡电阻下对单极接地、双极短路、雷击故障、雷击干扰共四种故障实现准确识别,可靠性高,具有较强的耐受过渡电阻能力,同时具备一定的抗噪性能。
文摘现有用户画像方法缺乏不同粒度文本信息表示,且特征提取阶段存在噪声,导致构建画像不够准确。针对以上问题,提出一种融合多粒度信息的用户画像生成方法(user profile based on multi-granularity information fusion,UP-MGIF)。首先,该方法在嵌入层融合字粒度、词粒度表示向量以扩充特征内容;其次,在改进双向门控循环单元网络基础上,结合降噪自编码器和注意力机制设计一种特征提取混合模型Bi-GRU-DAE-Attention,实现特征降噪和语义增强;最后,将鲁棒性强的特征向量输入到分类器中实现用户画像生成。实验表明,该用户画像生成方法在医疗和互联网两个画像数据集上的分类准确率高于其他基线方法,并通过消融实验验证了各个模块的有效性。