期刊文献+
共找到4篇文章
< 1 >
每页显示 20 50 100
DPAL-BERT:A Faster and Lighter Question Answering Model
1
作者 Lirong Yin Lei Wang +8 位作者 Zhuohang Cai Siyu Lu Ruiyang Wang Ahmed AlSanad Salman A.AlQahtani Xiaobing Chen Zhengtong Yin Xiaolu Li Wenfeng Zheng 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第10期771-786,共16页
Recent advancements in natural language processing have given rise to numerous pre-training language models in question-answering systems.However,with the constant evolution of algorithms,data,and computing power,the ... Recent advancements in natural language processing have given rise to numerous pre-training language models in question-answering systems.However,with the constant evolution of algorithms,data,and computing power,the increasing size and complexity of these models have led to increased training costs and reduced efficiency.This study aims to minimize the inference time of such models while maintaining computational performance.It also proposes a novel Distillation model for PAL-BERT(DPAL-BERT),specifically,employs knowledge distillation,using the PAL-BERT model as the teacher model to train two student models:DPAL-BERT-Bi and DPAL-BERTC.This research enhances the dataset through techniques such as masking,replacement,and n-gram sampling to optimize knowledge transfer.The experimental results showed that the distilled models greatly outperform models trained from scratch.In addition,although the distilled models exhibit a slight decrease in performance compared to PAL-BERT,they significantly reduce inference time to just 0.25%of the original.This demonstrates the effectiveness of the proposed approach in balancing model performance and efficiency. 展开更多
关键词 DPAL-BERT question answering systems knowledge distillation model compression BERT bi-directional long short-term memory(bilstm) knowledge information transfer PAL-BERT training efficiency natural language processing
下载PDF
Practical Options for Adopting Recurrent Neural Network and Its Variants on Remaining Useful Life Prediction 被引量:2
2
作者 Youdao Wang Yifan Zhao Sri Addepalli 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2021年第3期32-51,共20页
The remaining useful life(RUL)of a system is generally predicted by utilising the data collected from the sensors that continuously monitor different indicators.Recently,different deep learning(DL)techniques have been... The remaining useful life(RUL)of a system is generally predicted by utilising the data collected from the sensors that continuously monitor different indicators.Recently,different deep learning(DL)techniques have been used for RUL prediction and achieved great success.Because the data is often time-sequential,recurrent neural network(RNN)has attracted significant interests due to its efficiency in dealing with such data.This paper systematically reviews RNN and its variants for RUL prediction,with a specific focus on understanding how different components(e.g.,types of optimisers and activation functions)or parameters(e.g.,sequence length,neuron quantities)affect their performance.After that,a case study using the well-studied NASA’s C-MAPSS dataset is presented to quantitatively evaluate the influence of various state-of-the-art RNN structures on the RUL prediction performance.The result suggests that the variant methods usually perform better than the original RNN,and among which,Bi-directional Long Short-Term Memory generally has the best performance in terms of stability,precision and accuracy.Certain model structures may fail to produce valid RUL prediction result due to the gradient vanishing or gradient exploring problem if the parameters are not chosen appropriately.It is concluded that parameter tuning is a crucial step to achieve optimal prediction performance. 展开更多
关键词 Remaining useful life prediction Deep learning Recurrent neural network long short-term memory bi-directional long short-term memory Gated recurrent unit
下载PDF
PRI modulation recognition and sequence search under small sample prerequisite 被引量:2
3
作者 ZHANG Chunjie LIU Yuchen SI Weijian 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2023年第3期706-713,共8页
Pulse repetition interval(PRI)modulation recognition and pulse sequence search are significant for effective electronic support measures.In modern electromagnetic environments,different types of inter-pulse slide rada... Pulse repetition interval(PRI)modulation recognition and pulse sequence search are significant for effective electronic support measures.In modern electromagnetic environments,different types of inter-pulse slide radars are highly confusing.There are few available training samples in practical situations,which leads to a low recognition accuracy and poor search effect of the pulse sequence.In this paper,an approach based on bi-directional long short-term memory(BiLSTM)networks and the temporal correlation algorithm for PRI modulation recognition and sequence search under the small sample prerequisite is proposed.The simulation results demonstrate that the proposed algorithm can recognize unilinear,bilinear,sawtooth,and sinusoidal PRI modulation types with 91.43% accuracy and complete the pulse sequence search with 30% missing pulses and 50% spurious pulses under the small sample prerequisite. 展开更多
关键词 inter-pulse slide pulse repetition interval(PRI)modulation type bi-directional long short-term memory(bilstm)network sequence search
下载PDF
Detection and Defense Method Against False Data Injection Attacks for Distributed Load Frequency Control System in Microgrid
4
作者 Zhixun Zhang Jianqiang Hu +3 位作者 Jianquan Lu Jie Yu Jinde Cao Ardak Kashkynbayev 《Journal of Modern Power Systems and Clean Energy》 SCIE EI CSCD 2024年第3期913-924,共12页
In the realm of microgrid(MG),the distributed load frequency control(LFC)system has proven to be highly susceptible to the negative effects of false data injection attacks(FDIAs).Considering the significant responsibi... In the realm of microgrid(MG),the distributed load frequency control(LFC)system has proven to be highly susceptible to the negative effects of false data injection attacks(FDIAs).Considering the significant responsibility of the distributed LFC system for maintaining frequency stability within the MG,this paper proposes a detection and defense method against unobservable FDIAs in the distributed LFC system.Firstly,the method integrates a bi-directional long short-term memory(Bi LSTM)neural network and an improved whale optimization algorithm(IWOA)into the LFC controller to detect and counteract FDIAs.Secondly,to enable the Bi LSTM neural network to proficiently detect multiple types of FDIAs with utmost precision,the model employs a historical MG dataset comprising the frequency and power variances.Finally,the IWOA is utilized to optimize the proportional-integral-derivative(PID)controller parameters to counteract the negative impacts of FDIAs.The proposed detection and defense method is validated by building the distributed LFC system in Simulink. 展开更多
关键词 MICROGRID load frequency control false data injection attack bi-directional long short-term memory(bilstm)neural network improved whale optimization algorithm(IWOA) detection and defense
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部