轴承作为旋转机械中最易损耗的核心基础部件之一,是机械装备的重点监测对象。针对现有轴承智能故障诊断模型存在的对数据信息挖掘片面性及利用率低等问题,构建了一种基于双向长短期记忆(Bidirectional Long Short-term Memory,BLSTM)结...轴承作为旋转机械中最易损耗的核心基础部件之一,是机械装备的重点监测对象。针对现有轴承智能故障诊断模型存在的对数据信息挖掘片面性及利用率低等问题,构建了一种基于双向长短期记忆(Bidirectional Long Short-term Memory,BLSTM)结构与多尺度卷积结构融合的深度学习网络模型。为了增强模型的分类性能以及提高模型对实际工程环境的贴合度,数据集中各类故障数据的数据量为非等量;然后将数据集通过BLSTM结构来获取具有对称性的数据特征,从而减少模型对前后故障信息记忆的紊乱、增强信息利用率,接着通过多尺度卷积结构对数据特征进行多角度理解与交流,防止特征提取片面化,同时还能增强模型的抗噪性能;最后通过全连接网络实现智能分类。将所提模型分别对深沟球轴承与圆柱滚子轴承故障数据进行处理分析,结果表明该智能模型具有较高的准确度与实用性。展开更多
Aiming at the relation linking task for question answering over knowledge base,especially the multi relation linking task for complex questions,a relation linking approach based on the multi-attention recurrent neural...Aiming at the relation linking task for question answering over knowledge base,especially the multi relation linking task for complex questions,a relation linking approach based on the multi-attention recurrent neural network(RNN)model is proposed,which works for both simple and complex questions.First,the vector representations of questions are learned by the bidirectional long short-term memory(Bi-LSTM)model at the word and character levels,and named entities in questions are labeled by the conditional random field(CRF)model.Candidate entities are generated based on a dictionary,the disambiguation of candidate entities is realized based on predefined rules,and named entities mentioned in questions are linked to entities in knowledge base.Next,questions are classified into simple or complex questions by the machine learning method.Starting from the identified entities,for simple questions,one-hop relations are collected in the knowledge base as candidate relations;for complex questions,two-hop relations are collected as candidates.Finally,the multi-attention Bi-LSTM model is used to encode questions and candidate relations,compare their similarity,and return the candidate relation with the highest similarity as the result of relation linking.It is worth noting that the Bi-LSTM model with one attentions is adopted for simple questions,and the Bi-LSTM model with two attentions is adopted for complex questions.The experimental results show that,based on the effective entity linking method,the Bi-LSTM model with the attention mechanism improves the relation linking effectiveness of both simple and complex questions,which outperforms the existing relation linking methods based on graph algorithm or linguistics understanding.展开更多
Emotion classification in textual conversations focuses on classifying the emotion of each utterance from textual conversations.It is becoming one of the most important tasks for natural language processing in recent ...Emotion classification in textual conversations focuses on classifying the emotion of each utterance from textual conversations.It is becoming one of the most important tasks for natural language processing in recent years.However,it is a challenging task for machines to conduct emotion classification in textual conversations because emotions rely heavily on textual context.To address the challenge,we propose a method to classify emotion in textual conversations,by integrating the advantages of deep learning and broad learning,namely DBL.It aims to provide a more effective solution to capture local contextual information(i.e.,utterance-level)in an utterance,as well as global contextual information(i.e.,speaker-level)in a conversation,based on Convolutional Neural Network(CNN),Bidirectional Long Short-Term Memory(Bi-LSTM),and broad learning.Extensive experiments have been conducted on three public textual conversation datasets,which show that the context in both utterance-level and speaker-level is consistently beneficial to the performance of emotion classification.In addition,the results show that our proposed method outperforms the baseline methods on most of the testing datasets in weighted-average F1.展开更多
Reliable and accurate ultra-short-term prediction of wind power is vital for the operation and optimization of power systems.However,the volatility and intermittence of wind power pose uncertainties to traditional poi...Reliable and accurate ultra-short-term prediction of wind power is vital for the operation and optimization of power systems.However,the volatility and intermittence of wind power pose uncertainties to traditional point prediction,resulting in an increased risk of power system operation.To represent the uncertainty of wind power,this paper proposes a new method for ultra-short-term interval prediction of wind power based on a graph neural network(GNN)and an improved Bootstrap technique.Specifically,adjacent wind farms and local meteorological factors are modeled as the new form of a graph from the graph-theoretic perspective.Then,the graph convolutional network(GCN)and bi-directional long short-term memory(Bi-LSTM)are proposed to capture spatiotemporal features between nodes in the graph.To obtain highquality prediction intervals(PIs),an improved Bootstrap technique is designed to increase coverage percentage and narrow PIs effectively.Numerical simulations demonstrate that the proposed method can capture the spatiotemporal correlations from the graph,and the prediction results outperform popular baselines on two real-world datasets,which implies a high potential for practical applications in power systems.展开更多
Accurate pedestrian trajectory predictions are critical in self-driving systems,as they are fundamental to the response-and decision-making of ego vehicles.In this study,we focus on the problem of predicting the futur...Accurate pedestrian trajectory predictions are critical in self-driving systems,as they are fundamental to the response-and decision-making of ego vehicles.In this study,we focus on the problem of predicting the future trajectory of pedestrians from a first-person perspective.Most existing trajectory prediction methods from the first-person view copy the bird’s-eye view,neglecting the differences between the two.To this end,we clarify the differences between the two views and highlight the importance of action-aware trajectory prediction in the first-person view.We propose a new action-aware network based on an encoder-decoder framework with an action prediction and a goal estimation branch at the end of the encoder.In the decoder part,bidirectional long short-term memory(Bi-LSTM)blocks are adopted to generate the ultimate prediction of pedestrians’future trajectories.Our method was evaluated on a public dataset and achieved a competitive performance,compared with other approaches.An ablation study demonstrates the effectiveness of the action prediction branch.展开更多
Sentence Boundary Disambiguation(SBD)is a preprocessing step for natural language processing.Segmenting text into sentences is essential for Deep Learning(DL)and pretraining language models.Tibetan punctuation marks m...Sentence Boundary Disambiguation(SBD)is a preprocessing step for natural language processing.Segmenting text into sentences is essential for Deep Learning(DL)and pretraining language models.Tibetan punctuation marks may involve ambiguity about the sentences’beginnings and endings.Hence,the ambiguous punctuation marks must be distinguished,and the sentence structure must be correctly encoded in language models.This study proposed a component-level Tibetan SBD approach based on the DL model.The models can reduce the error amplification caused by word segmentation and part-of-speech tagging.Although most SBD methods have only considered text on the left side of punctuation marks,this study considers the text on both sides.In this study,465669 Tibetan sentences are adopted,and a Bidirectional Long Short-Term Memory(Bi-LSTM)model is used to perform SBD.The experimental results show that the F1-score of the Bi-LSTM model reached 96%,the most efficient among the six models.Experiments are performed on low-resource languages such as Turkish and Romanian,and high-resource languages such as English and German,to verify the models’generalization.展开更多
双向长短时记忆(bidirectional lorg short term memory,BLSTM)是一种特殊的递归神经网络(recurrent neural network,RNN),能够有效地对语音的长时上下文进行建模。该文提出一种基于深度BLSTM的语音驱动面部动画合成方法,利用说话人的...双向长短时记忆(bidirectional lorg short term memory,BLSTM)是一种特殊的递归神经网络(recurrent neural network,RNN),能够有效地对语音的长时上下文进行建模。该文提出一种基于深度BLSTM的语音驱动面部动画合成方法,利用说话人的音视频双模态信息训练BLSTM-RNN神经网络,采用主动外观模型(active appearance model,AAM)对人脸图像进行建模,将AAM模型参数作为网络输出,研究网络结构和不同语音特征输入对动画合成效果的影响。基于LIPS2008标准评测库的实验结果表明:具有BLSTM层的网络效果明显优于前向网络的,基于BLSTM-前向-BLSTM 256节点(BFB256)的三层模型结构的效果最佳,FBank、基频和能量组合可以进一步提升动画合成效果。展开更多
文摘轴承作为旋转机械中最易损耗的核心基础部件之一,是机械装备的重点监测对象。针对现有轴承智能故障诊断模型存在的对数据信息挖掘片面性及利用率低等问题,构建了一种基于双向长短期记忆(Bidirectional Long Short-term Memory,BLSTM)结构与多尺度卷积结构融合的深度学习网络模型。为了增强模型的分类性能以及提高模型对实际工程环境的贴合度,数据集中各类故障数据的数据量为非等量;然后将数据集通过BLSTM结构来获取具有对称性的数据特征,从而减少模型对前后故障信息记忆的紊乱、增强信息利用率,接着通过多尺度卷积结构对数据特征进行多角度理解与交流,防止特征提取片面化,同时还能增强模型的抗噪性能;最后通过全连接网络实现智能分类。将所提模型分别对深沟球轴承与圆柱滚子轴承故障数据进行处理分析,结果表明该智能模型具有较高的准确度与实用性。
基金The National Natural Science Foundation of China(No.61502095).
文摘Aiming at the relation linking task for question answering over knowledge base,especially the multi relation linking task for complex questions,a relation linking approach based on the multi-attention recurrent neural network(RNN)model is proposed,which works for both simple and complex questions.First,the vector representations of questions are learned by the bidirectional long short-term memory(Bi-LSTM)model at the word and character levels,and named entities in questions are labeled by the conditional random field(CRF)model.Candidate entities are generated based on a dictionary,the disambiguation of candidate entities is realized based on predefined rules,and named entities mentioned in questions are linked to entities in knowledge base.Next,questions are classified into simple or complex questions by the machine learning method.Starting from the identified entities,for simple questions,one-hop relations are collected in the knowledge base as candidate relations;for complex questions,two-hop relations are collected as candidates.Finally,the multi-attention Bi-LSTM model is used to encode questions and candidate relations,compare their similarity,and return the candidate relation with the highest similarity as the result of relation linking.It is worth noting that the Bi-LSTM model with one attentions is adopted for simple questions,and the Bi-LSTM model with two attentions is adopted for complex questions.The experimental results show that,based on the effective entity linking method,the Bi-LSTM model with the attention mechanism improves the relation linking effectiveness of both simple and complex questions,which outperforms the existing relation linking methods based on graph algorithm or linguistics understanding.
基金supported by the National Natural Science Foundation of China(No.61876205)the National Key Research and Development Program of China(No.2020YFB1005804)the MOE Project at Center for Linguistics and Applied Linguistics,Guangdong University of Foreign Studies.
文摘Emotion classification in textual conversations focuses on classifying the emotion of each utterance from textual conversations.It is becoming one of the most important tasks for natural language processing in recent years.However,it is a challenging task for machines to conduct emotion classification in textual conversations because emotions rely heavily on textual context.To address the challenge,we propose a method to classify emotion in textual conversations,by integrating the advantages of deep learning and broad learning,namely DBL.It aims to provide a more effective solution to capture local contextual information(i.e.,utterance-level)in an utterance,as well as global contextual information(i.e.,speaker-level)in a conversation,based on Convolutional Neural Network(CNN),Bidirectional Long Short-Term Memory(Bi-LSTM),and broad learning.Extensive experiments have been conducted on three public textual conversation datasets,which show that the context in both utterance-level and speaker-level is consistently beneficial to the performance of emotion classification.In addition,the results show that our proposed method outperforms the baseline methods on most of the testing datasets in weighted-average F1.
文摘Reliable and accurate ultra-short-term prediction of wind power is vital for the operation and optimization of power systems.However,the volatility and intermittence of wind power pose uncertainties to traditional point prediction,resulting in an increased risk of power system operation.To represent the uncertainty of wind power,this paper proposes a new method for ultra-short-term interval prediction of wind power based on a graph neural network(GNN)and an improved Bootstrap technique.Specifically,adjacent wind farms and local meteorological factors are modeled as the new form of a graph from the graph-theoretic perspective.Then,the graph convolutional network(GCN)and bi-directional long short-term memory(Bi-LSTM)are proposed to capture spatiotemporal features between nodes in the graph.To obtain highquality prediction intervals(PIs),an improved Bootstrap technique is designed to increase coverage percentage and narrow PIs effectively.Numerical simulations demonstrate that the proposed method can capture the spatiotemporal correlations from the graph,and the prediction results outperform popular baselines on two real-world datasets,which implies a high potential for practical applications in power systems.
文摘Accurate pedestrian trajectory predictions are critical in self-driving systems,as they are fundamental to the response-and decision-making of ego vehicles.In this study,we focus on the problem of predicting the future trajectory of pedestrians from a first-person perspective.Most existing trajectory prediction methods from the first-person view copy the bird’s-eye view,neglecting the differences between the two.To this end,we clarify the differences between the two views and highlight the importance of action-aware trajectory prediction in the first-person view.We propose a new action-aware network based on an encoder-decoder framework with an action prediction and a goal estimation branch at the end of the encoder.In the decoder part,bidirectional long short-term memory(Bi-LSTM)blocks are adopted to generate the ultimate prediction of pedestrians’future trajectories.Our method was evaluated on a public dataset and achieved a competitive performance,compared with other approaches.An ablation study demonstrates the effectiveness of the action prediction branch.
基金This work was supported by the National Key R&D Program of China(No.2020YFC0832500)the Ministry of Education-China Mobile Research Foundation(No.MCM20170206)+5 种基金the Fundamental Research Funds for the Central Universities(Nos.lzujbky-2022-kb12,lzujbky-2021-sp43,lzujbky-2020-sp02,lzujbky-2019-kb51,and lzujbky-2018-k12)the National Natural Science Foundation of China(No.61402210)the Science and Technology Plan of Qinghai Province(No.2020-GX-164)the Google Research Awards and Google Faculty Award,the Provincial Science and Technology Plan(Major Science and Technology Projects-Open Solicitation)(No.22ZD6GA048)the Gansu Provincial Science and Technology Major Special Innovation Consortium Project(No.21ZD3GA002)the Gansu Province Green and Smart Highway Key Technology Research and Demonstration。
文摘Sentence Boundary Disambiguation(SBD)is a preprocessing step for natural language processing.Segmenting text into sentences is essential for Deep Learning(DL)and pretraining language models.Tibetan punctuation marks may involve ambiguity about the sentences’beginnings and endings.Hence,the ambiguous punctuation marks must be distinguished,and the sentence structure must be correctly encoded in language models.This study proposed a component-level Tibetan SBD approach based on the DL model.The models can reduce the error amplification caused by word segmentation and part-of-speech tagging.Although most SBD methods have only considered text on the left side of punctuation marks,this study considers the text on both sides.In this study,465669 Tibetan sentences are adopted,and a Bidirectional Long Short-Term Memory(Bi-LSTM)model is used to perform SBD.The experimental results show that the F1-score of the Bi-LSTM model reached 96%,the most efficient among the six models.Experiments are performed on low-resource languages such as Turkish and Romanian,and high-resource languages such as English and German,to verify the models’generalization.
文摘双向长短时记忆(bidirectional lorg short term memory,BLSTM)是一种特殊的递归神经网络(recurrent neural network,RNN),能够有效地对语音的长时上下文进行建模。该文提出一种基于深度BLSTM的语音驱动面部动画合成方法,利用说话人的音视频双模态信息训练BLSTM-RNN神经网络,采用主动外观模型(active appearance model,AAM)对人脸图像进行建模,将AAM模型参数作为网络输出,研究网络结构和不同语音特征输入对动画合成效果的影响。基于LIPS2008标准评测库的实验结果表明:具有BLSTM层的网络效果明显优于前向网络的,基于BLSTM-前向-BLSTM 256节点(BFB256)的三层模型结构的效果最佳,FBank、基频和能量组合可以进一步提升动画合成效果。