期刊文献+
共找到76篇文章
< 1 2 4 >
每页显示 20 50 100
End-to-End Joint Multi-Object Detection and Tracking for Intelligent Transportation Systems
1
作者 Qing Xu Xuewu Lin +6 位作者 Mengchi Cai Yu‑ang Guo Chuang Zhang Kai Li Keqiang Li Jianqiang Wang Dongpu Cao 《Chinese Journal of Mechanical Engineering》 SCIE EI CAS CSCD 2023年第5期280-290,共11页
Environment perception is one of the most critical technology of intelligent transportation systems(ITS).Motion interaction between multiple vehicles in ITS makes it important to perform multi-object tracking(MOT).How... Environment perception is one of the most critical technology of intelligent transportation systems(ITS).Motion interaction between multiple vehicles in ITS makes it important to perform multi-object tracking(MOT).However,most existing MOT algorithms follow the tracking-by-detection framework,which separates detection and tracking into two independent segments and limit the global efciency.Recently,a few algorithms have combined feature extraction into one network;however,the tracking portion continues to rely on data association,and requires com‑plex post-processing for life cycle management.Those methods do not combine detection and tracking efciently.This paper presents a novel network to realize joint multi-object detection and tracking in an end-to-end manner for ITS,named as global correlation network(GCNet).Unlike most object detection methods,GCNet introduces a global correlation layer for regression of absolute size and coordinates of bounding boxes,instead of ofsetting predictions.The pipeline of detection and tracking in GCNet is conceptually simple,and does not require compli‑cated tracking strategies such as non-maximum suppression and data association.GCNet was evaluated on a multivehicle tracking dataset,UA-DETRAC,demonstrating promising performance compared to state-of-the-art detectors and trackers. 展开更多
关键词 Intelligent transportation systems Joint detection and tracking Global correlation network end-to-end tracking
下载PDF
DTHN: Dual-Transformer Head End-to-End Person Search Network
2
作者 Cheng Feng Dezhi Han Chongqing Chen 《Computers, Materials & Continua》 SCIE EI 2023年第10期245-261,共17页
Person search mainly consists of two submissions,namely Person Detection and Person Re-identification(reID).Existing approaches are primarily based on Faster R-CNN and Convolutional Neural Network(CNN)(e.g.,ResNet).Wh... Person search mainly consists of two submissions,namely Person Detection and Person Re-identification(reID).Existing approaches are primarily based on Faster R-CNN and Convolutional Neural Network(CNN)(e.g.,ResNet).While these structures may detect high-quality bounding boxes,they seem to degrade the performance of re-ID.To address this issue,this paper proposes a Dual-Transformer Head Network(DTHN)for end-to-end person search,which contains two independent Transformer heads,a box head for detecting the bounding box and extracting efficient bounding box feature,and a re-ID head for capturing high-quality re-ID features for the re-ID task.Specifically,after the image goes through the ResNet backbone network to extract features,the Region Proposal Network(RPN)proposes possible bounding boxes.The box head then extracts more efficient features within these bounding boxes for detection.Following this,the re-ID head computes the occluded attention of the features in these bounding boxes and distinguishes them from other persons or backgrounds.Extensive experiments on two widely used benchmark datasets,CUHK-SYSU and PRW,achieve state-of-the-art performance levels,94.9 mAP and 95.3 top-1 scores on the CUHK-SYSU dataset,and 51.6 mAP and 87.6 top-1 scores on the PRW dataset,which demonstrates the advantages of this paper’s approach.The efficiency comparison also shows our method is highly efficient in both time and space. 展开更多
关键词 TRANSFORMER occluded attention end-to-end person search person detection person re-ID Dual-Transformer Head
下载PDF
Attention-based neural network for end-to-end music separation
3
作者 Jing Wang Hanyue Liu +3 位作者 Haorong Ying Chuhan Qiu Jingxin Li Muhammad Shahid Anwar 《CAAI Transactions on Intelligence Technology》 SCIE EI 2023年第2期355-363,共9页
The end-to-end separation algorithm with superior performance in the field of speech separation has not been effectively used in music separation.Moreover,since music signals are often dual channel data with a high sa... The end-to-end separation algorithm with superior performance in the field of speech separation has not been effectively used in music separation.Moreover,since music signals are often dual channel data with a high sampling rate,how to model longsequence data and make rational use of the relevant information between channels is also an urgent problem to be solved.In order to solve the above problems,the performance of the end-to-end music separation algorithm is enhanced by improving the network structure.Our main contributions include the following:(1)A more reasonable densely connected U-Net is designed to capture the long-term characteristics of music,such as main melody,tone and so on.(2)On this basis,the multi-head attention and dualpath transformer are introduced in the separation module.Channel attention units are applied recursively on the feature map of each layer of the network,enabling the network to perform long-sequence separation.Experimental results show that after the introduction of the channel attention,the performance of the proposed algorithm has a stable improvement compared with the baseline system.On the MUSDB18 dataset,the average score of the separated audio exceeds that of the current best-performing music separation algorithm based on the time-frequency domain(T-F domain). 展开更多
关键词 channel attention densely connected network end-to-end music separation
下载PDF
An End-to-End Machine Learning Framework for Predicting Common Geriatric Diseases
4
作者 Jian Guo Yu Han +2 位作者 Fan Xu Jiru Deng Zhe Li 《Journal of Beijing Institute of Technology》 EI CAS 2023年第2期209-218,共10页
Interdisciplinary applications between information technology and geriatrics have been accelerated in recent years by the advancement of artificial intelligence,cloud computing,and 5G technology,among others.Meanwhile... Interdisciplinary applications between information technology and geriatrics have been accelerated in recent years by the advancement of artificial intelligence,cloud computing,and 5G technology,among others.Meanwhile,applications developed by using the above technologies make it possible to predict the risk of age-related diseases early,which can give caregivers time to intervene and reduce the risk,potentially improving the health span of the elderly.However,the popularity of these applications is still limited for several reasons.For example,many older people are unable or unwilling to use mobile applications or devices(e.g.smartphones)because they are relatively complex operations or time-consuming for older people.In this work,we design and implement an end-to-end framework and integrate it with the WeChat platform to make it easily accessible to elders.In this work,multifactorial geriatric assessment data can be collected.Then,stacked machine learning models are trained to assess and predict the incidence of common diseases in the elderly.Experimental results show that our framework can not only provide more accurate prediction(precision:0.8713,recall:0.8212)for several common elderly diseases,but also very low timeconsuming(28.6 s)within a workflow compared to some existing similar applications. 展开更多
关键词 predicting geriatric diseases machine learning end-to-end framework
下载PDF
End-to-End Auto-Encoder System for Deep Residual Shrinkage Network for AWGN Channels
5
作者 Wenhao Zhao Shengbo Hu 《Journal of Computer and Communications》 2023年第5期161-176,共16页
With the rapid development of deep learning methods, the data-driven approach has shown powerful advantages over the model-driven one. In this paper, we propose an end-to-end autoencoder communication system based on ... With the rapid development of deep learning methods, the data-driven approach has shown powerful advantages over the model-driven one. In this paper, we propose an end-to-end autoencoder communication system based on Deep Residual Shrinkage Networks (DRSNs), where neural networks (DNNs) are used to implement the coding, decoding, modulation and demodulation functions of the communication system. Our proposed autoencoder communication system can better reduce the signal noise by adding an “attention mechanism” and “soft thresholding” modules and has better performance at various signal-to-noise ratios (SNR). Also, we have shown through comparative experiments that the system can operate at moderate block lengths and support different throughputs. It has been shown to work efficiently in the AWGN channel. Simulation results show that our model has a higher Bit-Error-Rate (BER) gain and greatly improved decoding performance compared to conventional modulation and classical autoencoder systems at various signal-to-noise ratios. 展开更多
关键词 Deep Residual Shrinkage Network Autoencoder end-to-end Learning Communication Systems
下载PDF
Redistribution of nerve strain enables end-to-end repair under tension without inhibiting nerve regeneration 被引量:2
6
作者 Holly M.Howarth Turki Alaziz +2 位作者 Brogan Nicolds Shawn O'Connor Sameer B.Shah 《Neural Regeneration Research》 SCIE CAS CSCD 2019年第7期1280-1288,共9页
End-to-end repair under no or low tension leads to improved outcomes for transected nerves with short gaps,compared to repairs with a graft.However,grafts are typically used to enable a tension-free repair for moderat... End-to-end repair under no or low tension leads to improved outcomes for transected nerves with short gaps,compared to repairs with a graft.However,grafts are typically used to enable a tension-free repair for moderate to large gaps,as excessive tension can cause repairs to fail and catastrophically impede recovery.In this study,we tested the hypothesis that unloading the repair interface by redistributing tension away from the site of repair is a safe and feasible strategy for end-to-end repair of larger nerve gaps.Further,we tested the hypothesis that such an approach does not adversely affect structural and functional regeneration.In this study,we used a rat sciatic nerve injury model to compare the integrity of repair and several regenerative outcomes following end-to-end repairs of nerve gaps of increasing size.In addition,we proposed the use of a novel implantable device to safely repair end-to-end repair of larger nerve gaps by redistributing tension away from the repair interface.Our data suggest that redistriubution of tension away from the site of repair enables safe end-to-end repair of larger gap sizes.In addition,structural and functional measures of regeneration were equal or enhanced in nerves repaired under tension – with or without a tension redistribution device – compared to tension-free repairs.Provided that repair integrity is maintained,end-to-end repairs under tension should be considered as a reasonable surgical strategy.All animal experiments were performed under the approval of the Institutional Animal Care and Use Committee of University of California,San Diego(Protocol S11274). 展开更多
关键词 tension BIOMECHANICS STRAIN end-to-end REPAIR peripheral NERVE NERVE regeneration
下载PDF
Data Augmentation Technology Driven By Image Style Transfer in Self-Driving Car Based on End-to-End Learning 被引量:2
7
作者 Dongjie Liu Jin Zhao +4 位作者 Axin Xi Chao Wang Xinnian Huang Kuncheng Lai Chang Liu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2020年第2期593-617,共25页
With the advent of deep learning,self-driving schemes based on deep learning are becoming more and more popular.Robust perception-action models should learn from data with different scenarios and real behaviors,while ... With the advent of deep learning,self-driving schemes based on deep learning are becoming more and more popular.Robust perception-action models should learn from data with different scenarios and real behaviors,while current end-to-end model learning is generally limited to training of massive data,innovation of deep network architecture,and learning in-situ model in a simulation environment.Therefore,we introduce a new image style transfer method into data augmentation,and improve the diversity of limited data by changing the texture,contrast ratio and color of the image,and then it is extended to the scenarios that the model has been unobserved before.Inspired by rapid style transfer and artistic style neural algorithms,we propose an arbitrary style generation network architecture,including style transfer network,style learning network,style loss network and multivariate Gaussian distribution function.The style embedding vector is randomly sampled from the multivariate Gaussian distribution and linearly interpolated with the embedded vector predicted by the input image on the style learning network,which provides a set of normalization constants for the style transfer network,and finally realizes the diversity of the image style.In order to verify the effectiveness of the method,image classification and simulation experiments were performed separately.Finally,we built a small-sized smart car experiment platform,and apply the data augmentation technology based on image style transfer drive to the experiment of automatic driving for the first time.The experimental results show that:(1)The proposed scheme can improve the prediction accuracy of the end-to-end model and reduce the model’s error accumulation;(2)the method based on image style transfer provides a new scheme for data augmentation technology,and also provides a solution for the high cost that many deep models rely heavily on a large number of label data. 展开更多
关键词 Deep learning SELF-DRIVING end-to-end learning style transfer data augmentation.
下载PDF
Internet end-to-end delay dynamics 被引量:2
8
作者 Zhu Changhua Pei Changxing Li Jiandong Chen Nan Yi Yunhui 《Journal of Systems Engineering and Electronics》 SCIE EI CSCD 2006年第3期685-691,共7页
End-to-end delay is one of the most important characteristics of Internet end-to-end packet dynamics, which can be applied to quality of services (OoS) management, service level agreement (SLA) management, congest... End-to-end delay is one of the most important characteristics of Internet end-to-end packet dynamics, which can be applied to quality of services (OoS) management, service level agreement (SLA) management, congestion control algorithm development, etc. Nonstationarity and nonlinearity are found by the analysis of various delay series measured from different links. The fact that different types of links have different degree of Self-Similarity is also obtained. By constructing appropriate network architecture and neural functions, functional networks can be used to model the Internet end-to-end nonlinear delay time series. Furthermore, by using adaptive parameter studying algorithm, the nonstationarity can also be well modeled. The numerical results show that the provided functional network architecture and adaptive algorithm can precisely characterize the Internet end-to-end delay dynamics. 展开更多
关键词 INTERNET end-to-end delay functional network nonlinear system.
下载PDF
End-to-end Delay Analysis for Mixed-criticality WirelessHART Networks 被引量:2
9
作者 Xi Jin Jintao Wang Peng Zeng 《IEEE/CAA Journal of Automatica Sinica》 SCIE EI 2015年第3期282-289,共8页
WirelessHART,as a robust and reliable wireless protocol,has been widely-used in industrial wireless sensoractuator networks.Its real-time performance has been extensively studied,but limited to the single criticality ... WirelessHART,as a robust and reliable wireless protocol,has been widely-used in industrial wireless sensoractuator networks.Its real-time performance has been extensively studied,but limited to the single criticality case.Many advanced applications have mixed-criticality communications,where different data flows come with different levels of importance or criticality.Hence,in this paper,we study the real-time mixedcriticality communication using WirelessHART protocol,and propose an end-to-end delay analysis approach based on fixed priority scheduling.To the best of our knowledge,this is the first work that introduces the concept of mixed-criticality into wireless sensor-actuator networks.Evaluation results show the effectiveness and efficacy of our approach. 展开更多
关键词 WirelessHART networks mixed-criticality end-to-end delay delay analysis
下载PDF
End-to-End Encryption in Messaging Services and National Security—Case of WhatsApp Messenger 被引量:1
10
作者 Robert E. Endeley 《Journal of Information Security》 2018年第1期95-99,共5页
The ubiquity of instant messaging services on mobile devices and their use of end-to-end encryption in safeguarding the privacy of their users have become a concern for some governments. WhatsApp messaging service has... The ubiquity of instant messaging services on mobile devices and their use of end-to-end encryption in safeguarding the privacy of their users have become a concern for some governments. WhatsApp messaging service has emerged as the most popular messaging app on mobile devices today. It uses end-to-end encryption which makes government and secret services efforts to combat organized crime, terrorists, and child pornographers technically impossible. Governments would like a “backdoor” into such apps, to use in accessing messages and have emphasized that they will only use the “backdoor” if there is a credible threat to national security. Users of WhatsApp have however, argued against a “backdoor”;they claim a “backdoor” would not only be an infringement of their privacy, but that hackers could also take advantage of it. In light of this security and privacy conflict between the end users of WhatsApp and government’s need to access messages in order to thwart potential terror attacks, this paper presents the advantages of maintaining E2EE in WhatsApp and why governments should not be allowed a “backdoor” to access users’ messages. This research presents the benefits encryption has on consumer security and privacy, and also on the challenges it poses to public safety and national security. 展开更多
关键词 INSTANT MESSAGING WhatsApp end-to-end ENCRYPTION National Security Privacy
下载PDF
Generating Questions Based on Semi-Automated and End-to-End Neural Network
11
作者 Tianci Xia Yuan Sun +2 位作者 Xiaobing Zhao Wei Song Yumiao Guo 《Computers, Materials & Continua》 SCIE EI 2019年第8期617-628,共12页
With the emergence of large-scale knowledge base,how to use triple information to generate natural questions is a key technology in question answering systems.The traditional way of generating questions require a lot ... With the emergence of large-scale knowledge base,how to use triple information to generate natural questions is a key technology in question answering systems.The traditional way of generating questions require a lot of manual intervention and produce lots of noise.To solve these problems,we propose a joint model based on semi-automated model and End-to-End neural network to automatically generate questions.The semi-automated model can generate question templates and real questions combining the knowledge base and center graph.The End-to-End neural network directly sends the knowledge base and real questions to BiLSTM network.Meanwhile,the attention mechanism is utilized in the decoding layer,which makes the triples and generated questions more relevant.Finally,the experimental results on SimpleQuestions demonstrate the effectiveness of the proposed approach. 展开更多
关键词 Generating questions semi-automated model end-to-end neural network question answering
下载PDF
End-to-end verifiable electronic voting scheme of blockchain based on random linear block code
12
作者 刘霆 Cui Zhe +1 位作者 Pu Hongquan Peng Xingyi 《High Technology Letters》 EI CAS 2020年第1期25-33,共9页
Blockchain is an emerging decentralized technology of electronic voting.The current main consensus protocols are not flexible enough to manage the distributed blockchain nodes to achieve high efficiency of consensus.F... Blockchain is an emerging decentralized technology of electronic voting.The current main consensus protocols are not flexible enough to manage the distributed blockchain nodes to achieve high efficiency of consensus.For practical implementation,the consensus based on random linear block code(RLBC)is proposed and applied to blockchain voting scheme.Along with achieving the record correctness and consistency among all nodes,the consensus method indicates the active and inactive consensus nodes.This ability can assist the management of consensus nodes and restrain the generating of chain forks.To achieve end-to-end verifiability,cast-or-audit and randomized partial checking(RPC)are used in the proposed scheme.The voter can verify the high probability of correctness in ballot encryption and decryption.The experiments illustrate that the efficiency of proposed consensus is suitable for blockchain.The proposed electronic voting scheme is adapted to practical implementation of voting. 展开更多
关键词 RANDOM linear block code(RLBC) ELECTRONIC voting(e-voting) blockchain CONSENSUS end-to-end verifiable
下载PDF
Techniques for Avoiding Saccular Extended Obstruction at the Anastomotic Site of Functional End-to-End Anastomosis
13
作者 Koichi Yoshida Kenichi Ietsugu +8 位作者 Jyunpei Okamoto Saki Hayashi Yoshihide Asaumi Naohiro Ota Hiroyuki Sugawara Satoshi Tabata Masahiro Kaneki Mitsuaki Sakatoku Kaoru Kiyohara 《Surgical Science》 2017年第7期305-311,共7页
Saccular extended obstruction is generated when the anastomotic site of functional end-to-end anastomosis is extended saccularly and blocked by intestinal contents. This is a specific complication of functional end-to... Saccular extended obstruction is generated when the anastomotic site of functional end-to-end anastomosis is extended saccularly and blocked by intestinal contents. This is a specific complication of functional end-to-end anastomosis. Saccular extended obstruction of the anastomotic site of func-tional end-to-end anastomosis causes postoperative intestinal obstruction. Saccular extended obstruction places a heavy burden on patients because surgery is necessary for treatment of intestinal obstruction due to saccular extended obstruction. However, saccular extended obstruction is not a commonly recognized complication. The greatest factor contributing to the development of saccular extended obstruction is an acute angle between the portions of the intestinal tract oral and aboral to the anastomotic site. When this angle approaches obtuse angle, preferably close to a straight line, stagnation of the intestinal contents does not occur at the anastomotic site of functional end-to-end anastomosis and saccular extended obstruction is avoided. For making the angle of anastomotic intestinal tracts obtuse or straight, it may be effective that the entry hole of stapling suture instrument creating the anastomotic stoma is closed perpendicular to the intestinal axis. 展开更多
关键词 FUNCTIONAL end-to-end ANASTOMOSIS STAPLING ANASTOMOSIS ANASTOMOSIS Postoperative COMPLICATION Intestinal OBSTRUCTION
下载PDF
Tibetan Multi-Dialect Speech Recognition Using Latent Regression Bayesian Network and End-To-End Mode
14
作者 Yue Zhao Jianjian Yue +4 位作者 Wei Song Xiaona Xu Xiali Li Licheng Wu Qiang Ji 《Journal on Internet of Things》 2019年第1期17-23,共7页
We proposed a method using latent regression Bayesian network (LRBN) toextract the shared speech feature for the input of end-to-end speech recognition model.The structure of LRBN is compact and its parameter learning... We proposed a method using latent regression Bayesian network (LRBN) toextract the shared speech feature for the input of end-to-end speech recognition model.The structure of LRBN is compact and its parameter learning is fast. Compared withConvolutional Neural Network, it has a simpler and understood structure and lessparameters to learn. Experimental results show that the advantage of hybridLRBN/Bidirectional Long Short-Term Memory-Connectionist Temporal Classificationarchitecture for Tibetan multi-dialect speech recognition, and demonstrate the LRBN ishelpful to differentiate among multiple language speech sets. 展开更多
关键词 Multi-dialect speech recognition Tibetan language latent regressionbayesian network end-to-end model
下载PDF
Adaptive Cache Allocation with Prefetching Policy over End-to-End Data Processing
15
作者 Hang Qin Li Zhu 《Journal of Signal and Information Processing》 2017年第3期152-160,共9页
With the speed gap between storage system access and processor computing, end-to-end data processing has become a bottleneck to improve the total performance of computer systems over the Internet. Based on the analysi... With the speed gap between storage system access and processor computing, end-to-end data processing has become a bottleneck to improve the total performance of computer systems over the Internet. Based on the analysis of data processing behavior, an adaptive cache organization scheme is proposed with fast address calculation. This scheme can make full use of the characteristics of stack space data access, adopt fast address calculation strategy, and reduce the hit time of stack access. Adaptively, the stack cache can be turned off from beginning to end, when a stack overflow occurs to avoid the effect of stack switching on processor performance. Also, through the instruction cache and the failure behavior for the data cache, a prefetching policy is developed, which is combined with the data capture of the failover queue state. Finally, the proposed method can maintain the order of instruction and data access, which facilitates the extraction of prefetching in the end-to-end data processing. 展开更多
关键词 end-to-end Data Processing STORAGE System CACHE PREFETCHING
下载PDF
Effect of swirling flow on platelet concentration distribution in small-caliber artificial grafts and end-to-end anastomoses
16
作者 Fan Zhan Yu-Bo Fan Xiao-Yan Deng 《Acta Mechanica Sinica》 SCIE EI CAS CSCD 2011年第5期833-839,共7页
Platelet concentration near the blood vessel wall is one of the major factors in the adhesion of platelets to the wall.In our previous studies,it was found that swirling flows could suppress platelet adhesion in small... Platelet concentration near the blood vessel wall is one of the major factors in the adhesion of platelets to the wall.In our previous studies,it was found that swirling flows could suppress platelet adhesion in small-caliber artificial grafts and end-to-end anastomoses.In order to better understand the beneficial effect of the swirling flow,we numerically analyzed the near-wall concentration distribution of platelets in a straight tube and a sudden tubular expansion tube under both swirling flow and normal flow conditions.The numerical models were created based on our previous experimental studies.The simulation results revealed that when compared with the normal flow,the swirling flow could significantly reduce the near-wall concentration of platelets in both the straight tube and the expansion tube.The present numerical study therefore indicates that the reduction in platelet adhesion under swirling flow conditions in small-caliber arterial grafts,or in end-to-end anastomoses as observed in our previous experimental study,was possibly through a mechanism of platelet transport,in which the swirling flow reduced the near-wall concentration of platelets. 展开更多
关键词 Platelet near-wall concentration · Swirling flow · Small-caliber arterial grafts · end-to-end anastomosis
下载PDF
Pairwise tagging framework for end-to-end emotion-cause pair extraction
17
作者 Zhen WU Xinyu DAI Rui XIA 《Frontiers of Computer Science》 SCIE EI CSCD 2023年第2期111-120,共10页
Emotion-cause pair extraction(ECPE)aims to extract all the pairs of emotions and corresponding causes in a document.It generally contains three subtasks,emotions extraction,causes extraction,and causal relations detec... Emotion-cause pair extraction(ECPE)aims to extract all the pairs of emotions and corresponding causes in a document.It generally contains three subtasks,emotions extraction,causes extraction,and causal relations detection between emotions and causes.Existing works adopt pipelined approaches or multi-task learning to address the ECPE task.However,the pipelined approaches easily suffer from error propagation in real-world scenarios.Typical multi-task learning cannot optimize all tasks globally and may lead to suboptimal extraction results.To address these issues,we propose a novel framework,Pairwise Tagging Framework(PTF),tackling the complete emotion-cause pair extraction in one unified tagging task.Unlike prior works,PTF innovatively transforms all subtasks of ECPE,i.e.,emotions extraction,causes extraction,and causal relations detection between emotions and causes,into one unified clause-pair tagging task.Through this unified tagging task,we can optimize the ECPE task globally and extract more accurate emotion-cause pairs.To validate the feasibility and effectiveness of PTF,we design an end-to-end PTF-based neural network and conduct experiments on the ECPE benchmark dataset.The experimental results show that our method outperforms pipelined approaches significantly and typical multi-task learning approaches. 展开更多
关键词 emotion-cause pair extraction pairwise tagging framework end-to-end neural network
原文传递
Unsupervised Dialogue State Tracking for End-to-End Task-Oriented Dialogue with a Multi-Span Prediction Network
18
作者 刘庆斌 何世柱 +2 位作者 刘操 刘康 赵军 《Journal of Computer Science & Technology》 SCIE EI CSCD 2023年第4期834-852,共19页
This paper focuses on end-to-end task-oriented dialogue systems,which jointly handle dialogue state tracking(DST)and response generation.Traditional methods usually adopt a supervised paradigm to learn DST from a manu... This paper focuses on end-to-end task-oriented dialogue systems,which jointly handle dialogue state tracking(DST)and response generation.Traditional methods usually adopt a supervised paradigm to learn DST from a manually labeled corpus.However,the annotation of the corpus is costly,time-consuming,and cannot cover a wide range of domains in the real world.To solve this problem,we propose a multi-span prediction network(MSPN)that performs unsupervised DST for end-to-end task-oriented dialogue.Specifically,MSPN contains a novel split-merge copy mechanism that captures long-term dependencies in dialogues to automatically extract multiple text spans as keywords.Based on these keywords,MSPN uses a semantic distance based clustering approach to obtain the values of each slot.In addition,we propose an ontology-based reinforcement learning approach,which employs the values of each slot to train MSPN to generate relevant values.Experimental results on single-domain and multi-domain task-oriented dialogue datasets show that MSPN achieves state-of-the-art performance with significant improvements.Besides,we construct a new Chinese dialogue dataset MeDial in the low-resource medical domain,which further demonstrates the adaptability of MSPN. 展开更多
关键词 end-to-end task-oriented dialogue dialogue state tracking(DST) unsupervised learning reinforcement learning
原文传递
Generalized End-to-End Loss for Forensic Speaker Verification
19
作者 Huapeng WANG Fangzhou HE Lianquan WU 《Journal of Systems Science and Information》 CSCD 2023年第2期264-276,共13页
In recent years,various speech embedding methods based on deep learning have been proposed and have shown better performance in speaker verification.Those new technologies will inevitably promote the development of fo... In recent years,various speech embedding methods based on deep learning have been proposed and have shown better performance in speaker verification.Those new technologies will inevitably promote the development of forensic speaker verification.We propose a new forensic speaker verification method based on embeddings trained with loss function called generalized end-to-end(GE2E)loss.First,a long short-term memory(LSTM)based deep neural network(DNN)is trained as the embedding extractor,then the cosine similarity scores between embeddings from same speaker comparison pairs and different speaker comparison pairs are trained to represent within-speaker model and between-speaker model respectively,and finally,the cosine similarity scores between the questioned embeddings and enrolled embeddings are evaluated in the above two models to get the likelihood ratio(LR)value.On the subset of LibriSpeech,test-other-500,we achieve a new state of the art.Both all the same speaker comparison pairs and different speaker comparison pairs get correct results and can provide considerable strong evidence strength for courts. 展开更多
关键词 speaker verification generalized end-to-end loss LSTM likelihood ratio cosine similarity
原文传递
Density Map Guided Region Localization for End-to-End Small Object Detection
20
作者 Bo LI Kai HUANG +1 位作者 Junhui LI Yufu LIAO 《Journal of Systems Science and Information》 CSCD 2023年第6期776-794,共19页
With the advancement of society and science and technology, the demand for detecting small objects in practical scenarios becomes stronger. Such objects are only represented by relatively small coverage of pixels, and... With the advancement of society and science and technology, the demand for detecting small objects in practical scenarios becomes stronger. Such objects are only represented by relatively small coverage of pixels, and the features are degraded severely after being extracted by a deep convolutional neural network, which is detrimental to the detection performance for small objects. Therefore, an intuitive solution is to increase the resolution of small objects by cropping the original image. In this paper, we propose a simple but effective object density map guided region localization module (DMGRL) to locate and crop the regions of interest where small objects may exist. Firstly, the density map of the objects is estimated by object density map estimation network, and then the coordinates of the small object regions are calculated;Secondly, the continuous differentiable affine transformation is utilized to crop these regions so that the detector with DMGRL can be trained end-to-end instead of two-stage training. Finally, the all prediction results of input image and cropped region images are merged together to output the final detection results by non maximum suppression (NMS). Extensive experiments demonstrate the superior performance of the detector incorporated DMGRL. 展开更多
关键词 small object detection density map estimation end-to-end training affine transformation
原文传递
上一页 1 2 4 下一页 到第
使用帮助 返回顶部