Light field tomography,an optical combustion diagnostic technology,has recently attracted extensive attention due to its easy implementation and non-intrusion.However,the conventional iterative methods are high data t...Light field tomography,an optical combustion diagnostic technology,has recently attracted extensive attention due to its easy implementation and non-intrusion.However,the conventional iterative methods are high data throughput,low efficiency and time-consuming,and the existing machine learning models use the radiation spectrum information of the flame to realize the parameter field measurement at the current time.It is still an offline measurement and cannot realize the online prediction of the instantaneous structure of the actual turbulent combustion field.In this work,a novel online prediction model of flame temperature instantaneous structure based on deep convolutional neural network and long short-term memory(CNN-LSTM)is proposed.The method uses the characteristics of local perception,shared weight,and pooling of CNN to extract the threedimensional(3D)features of flame temperature and outgoing radiation images.Moreover,the LSTM is used to comprehensively utilize the ten historical time series information of high dynamic combustion flame to accurately predict 3D temperature at three future moments.A chaotic time-series dataset based on the flame radiation forward model is built to train and validate the performance of the proposed CNN-LSTM model.It is proven that the CNN-LSTM prediction model can successfully learn the evolution pattern of combustion flame and make accurate predictions.展开更多
Compared with manual sorting of citrus fruit,vision-based sorting solutions can help achieve higher accuracy and efficiency.In this study,we present a vision system based on CNN-LSTM,which can cooperate with robotic g...Compared with manual sorting of citrus fruit,vision-based sorting solutions can help achieve higher accuracy and efficiency.In this study,we present a vision system based on CNN-LSTM,which can cooperate with robotic grippers for real-time sorting and is readily applicable to various citrus processing plants.A CNN-based detector was adopted to detect the defective oranges in view and temporarily classify them into corresponding types,and an LSTM-based predictor was used to predict the position of the oranges in a future frame based on image sequential data.The fusion of CNN and LSTM networks enabled the system to track defective ones during rotation and identify their true types,and their future path was also predicted which is vital for predictive control of visually guided robotic grasping.High detection accuracy of 94.1%was obtained based on experimental results,and the error for path prediction was within 4.33 pixels 40 frames later.The average time to process a frame was between 28 and 62 frames per second,which also satisfied real-time performance.The results proved the potential of the proposed system for automated citrus sorting with good precision and efficiency,and it can be readily extended to other fruit crops featuring high versatility.展开更多
In recent years,e-sports has rapidly developed,and the industry has produced large amounts of data with specifications,and these data are easily to be obtained.Due to the above characteristics,data mining and deep lea...In recent years,e-sports has rapidly developed,and the industry has produced large amounts of data with specifications,and these data are easily to be obtained.Due to the above characteristics,data mining and deep learning methods can be used to guide players and develop appropriate strategies to win games.As one of the world’s most famous e-sports events,Dota2 has a large audience base and a good game system.A victory in a game is often associated with a hero’s match,and players are often unable to pick the best lineup to compete.To solve this problem,in this paper,we present an improved bidirectional Long Short-Term Memory(LSTM)neural network model for Dota2 lineup recommendations.The model uses the Continuous Bag Of Words(CBOW)model in the Word2 vec model to generate hero vectors.The CBOW model can predict the context of a word in a sentence.Accordingly,a word is transformed into a hero,a sentence into a lineup,and a word vector into a hero vector,the model applied in this article recommends the last hero according to the first four heroes selected first,thereby solving a series of recommendation problems.展开更多
As a fundamental task in computer vision,visual object tracking has received much attention in recent years.Most studies focus on short-term visual tracking which addresses shorter videos and always-visible targets.Ho...As a fundamental task in computer vision,visual object tracking has received much attention in recent years.Most studies focus on short-term visual tracking which addresses shorter videos and always-visible targets.However,long-term visual tracking is much closer to practical applications with more complicated challenges.There exists a longer duration such as minute-level or even hour-level in the long-term tracking task,and the task also needs to handle more frequent target disappearance and reappearance.In this paper,we provide a thorough review of long-term tracking,summarizing long-term tracking algorithms from two perspectives:framework architectures and utilization of intermediate tracking results.Then we provide a detailed description of existing benchmarks and corresponding evaluation protocols.Furthermore,we conduct extensive experiments and analyse the performance of trackers on six benchmarks:VOTLT2018,VOTLT2019(2020/2021),OxUvA,LaSOT,TLP and the long-term subset of VTUAV-V.Finally,we discuss the future prospects from multiple perspectives,including algorithm design and benchmark construction.To our knowledge,this is the first comprehensive survey for long-term visual object tracking.The relevant content is available at https://github.com/wangdongdut/Long-term-Visual-Tracking.展开更多
基金This work was supported by the National Natural Science Foundation of China(Grant No.51976044,and 52227813)the Foundation for Heilongjiang Touyan Innovation Team Program。
文摘Light field tomography,an optical combustion diagnostic technology,has recently attracted extensive attention due to its easy implementation and non-intrusion.However,the conventional iterative methods are high data throughput,low efficiency and time-consuming,and the existing machine learning models use the radiation spectrum information of the flame to realize the parameter field measurement at the current time.It is still an offline measurement and cannot realize the online prediction of the instantaneous structure of the actual turbulent combustion field.In this work,a novel online prediction model of flame temperature instantaneous structure based on deep convolutional neural network and long short-term memory(CNN-LSTM)is proposed.The method uses the characteristics of local perception,shared weight,and pooling of CNN to extract the threedimensional(3D)features of flame temperature and outgoing radiation images.Moreover,the LSTM is used to comprehensively utilize the ten historical time series information of high dynamic combustion flame to accurately predict 3D temperature at three future moments.A chaotic time-series dataset based on the flame radiation forward model is built to train and validate the performance of the proposed CNN-LSTM model.It is proven that the CNN-LSTM prediction model can successfully learn the evolution pattern of combustion flame and make accurate predictions.
基金National Key R&D Program(2020YFD1000101)and Special Funds for the Construction of Industrial Technology System of Modern Agriculture(Citrus)(CARS-26)Construction Project of Citrus Whole Course Mechanized Scientifific Research Base(Agricultural Development Facility 297[2017]19),Hubei Agricultural Science and Technology Innovation Action Project.
文摘Compared with manual sorting of citrus fruit,vision-based sorting solutions can help achieve higher accuracy and efficiency.In this study,we present a vision system based on CNN-LSTM,which can cooperate with robotic grippers for real-time sorting and is readily applicable to various citrus processing plants.A CNN-based detector was adopted to detect the defective oranges in view and temporarily classify them into corresponding types,and an LSTM-based predictor was used to predict the position of the oranges in a future frame based on image sequential data.The fusion of CNN and LSTM networks enabled the system to track defective ones during rotation and identify their true types,and their future path was also predicted which is vital for predictive control of visually guided robotic grasping.High detection accuracy of 94.1%was obtained based on experimental results,and the error for path prediction was within 4.33 pixels 40 frames later.The average time to process a frame was between 28 and 62 frames per second,which also satisfied real-time performance.The results proved the potential of the proposed system for automated citrus sorting with good precision and efficiency,and it can be readily extended to other fruit crops featuring high versatility.
基金the Guangdong Province Key Research and Development Plan(No.2019B010137004)the National Natural Science Foundation of China(Nos.61402149 and 61871140)+3 种基金the Scientific and Technological Project of Henan Province(Nos.182102110065,182102210238,and 202102310340)the Natural Science Foundation of Henan Educational Committee(No.17B520006)Guangdong Province Universities and Colleges Pearl River Scholar Funded Scheme(2019)Foundation of University Young Key Teacher of Henan Province(No.2019GGJS040)。
文摘In recent years,e-sports has rapidly developed,and the industry has produced large amounts of data with specifications,and these data are easily to be obtained.Due to the above characteristics,data mining and deep learning methods can be used to guide players and develop appropriate strategies to win games.As one of the world’s most famous e-sports events,Dota2 has a large audience base and a good game system.A victory in a game is often associated with a hero’s match,and players are often unable to pick the best lineup to compete.To solve this problem,in this paper,we present an improved bidirectional Long Short-Term Memory(LSTM)neural network model for Dota2 lineup recommendations.The model uses the Continuous Bag Of Words(CBOW)model in the Word2 vec model to generate hero vectors.The CBOW model can predict the context of a word in a sentence.Accordingly,a word is transformed into a hero,a sentence into a lineup,and a word vector into a hero vector,the model applied in this article recommends the last hero according to the first four heroes selected first,thereby solving a series of recommendation problems.
基金supported by National Natural Science Foundation of China(Nos.62176041 and 62022021)Joint Fund of Ministry of Education for Equipment Preresearch,China(No.8091B032155)+1 种基金the Science and Technology Innovation Foundation of Dalian,China(No.2020 JJ26GX036)the Fundamental Research Funds for the Central Universities,China(No.DUT21LAB127).
文摘As a fundamental task in computer vision,visual object tracking has received much attention in recent years.Most studies focus on short-term visual tracking which addresses shorter videos and always-visible targets.However,long-term visual tracking is much closer to practical applications with more complicated challenges.There exists a longer duration such as minute-level or even hour-level in the long-term tracking task,and the task also needs to handle more frequent target disappearance and reappearance.In this paper,we provide a thorough review of long-term tracking,summarizing long-term tracking algorithms from two perspectives:framework architectures and utilization of intermediate tracking results.Then we provide a detailed description of existing benchmarks and corresponding evaluation protocols.Furthermore,we conduct extensive experiments and analyse the performance of trackers on six benchmarks:VOTLT2018,VOTLT2019(2020/2021),OxUvA,LaSOT,TLP and the long-term subset of VTUAV-V.Finally,we discuss the future prospects from multiple perspectives,including algorithm design and benchmark construction.To our knowledge,this is the first comprehensive survey for long-term visual object tracking.The relevant content is available at https://github.com/wangdongdut/Long-term-Visual-Tracking.