With the intensifying aging of the population,the phenomenon of the elderly living alone is also increasing.Therefore,using modern internet of things technology to monitor the daily behavior of the elderly in indoors ...With the intensifying aging of the population,the phenomenon of the elderly living alone is also increasing.Therefore,using modern internet of things technology to monitor the daily behavior of the elderly in indoors is a meaningful study.Video-based action recognition tasks are easily affected by object occlusion and weak ambient light,resulting in poor recognition performance.Therefore,this paper proposes an indoor human behavior recognition method based on wireless fidelity(Wi-Fi)perception and video feature fusion by utilizing the ability of Wi-Fi signals to carry environmental information during the propagation process.This paper uses the public WiFi-based activity recognition dataset(WIAR)containing Wi-Fi channel state information and essential action videos,and then extracts video feature vectors and Wi-Fi signal feature vectors in the datasets through the two-stream convolutional neural network and standard statistical algorithms,respectively.Then the two sets of feature vectors are fused,and finally,the action classification and recognition are performed by the support vector machine(SVM).The experiments in this paper contrast experiments between the two-stream network model and the methods in this paper under three different environments.And the accuracy of action recognition after adding Wi-Fi signal feature fusion is improved by 10%on average.展开更多
Recently,human healthcare from body sensor data has gained considerable interest from a wide variety of human-computer communication and pattern analysis research owing to their real-time applications namely smart hea...Recently,human healthcare from body sensor data has gained considerable interest from a wide variety of human-computer communication and pattern analysis research owing to their real-time applications namely smart healthcare systems.Even though there are various forms of utilizing distributed sensors to monitor the behavior of people and vital signs,physical human action recognition(HAR)through body sensors gives useful information about the lifestyle and functionality of an individual.This article concentrates on the design of an Improved Transient Search Optimization with Machine Learning based BehaviorRecognition(ITSOMLBR)technique using body sensor data.The presented ITSOML-BR technique collects data from different body sensors namely electrocardiography(ECG),accelerometer,and magnetometer.In addition,the ITSOML-BR technique extract features like variance,mean,skewness,and standard deviation.Moreover,the presented ITSOML-BR technique executes a micro neural network(MNN)which can be employed for long term healthcare monitoring and classification.Furthermore,the parameters related to the MNN model are optimally selected via the ITSO algorithm.The experimental result analysis of the ITSOML-BR technique is tested on the MHEALTH dataset.The comprehensive comparison study reported a higher result for the ITSOMLBR approach over other existing approaches with maximum accuracy of 99.60%.展开更多
Compared with RGB videos and images,human bone data is less vulnerable to external factors and has stronger robustness.Therefore,behavior recognition methods based on skeletons are widely studied.Because graph convolu...Compared with RGB videos and images,human bone data is less vulnerable to external factors and has stronger robustness.Therefore,behavior recognition methods based on skeletons are widely studied.Because graph convolution network(GCN)can deal with the irregular topology data of hu-man skeletons very well,more and more researchers apply GCN to human behavior recognition.Tra-ditional graph convolution methods only consider the joints with physical connectivity or the same type when building the behavior recognition model based on human skeletons structure,which cannot capture higher-order information better.To solve this problem,Motif-GCN is used in this paper to ex-tract spatial features.The relationship between the joints with natural connection in the human body is encoded by the first Motif-GCN,and the possible relationship between the unconnected joints in the human skeleton is encoded by the second Motif-GCN.In this way,the relationship between non-physical joints can be strengthened.Then a two stream framework combining joint and bone informa-tion is used to capture more action information.Finally,experiments are conducted on two subdata-sets X-Sub and X-View of NTU-RGB+D,and the accuracy shown in Top-1 classification results is 89.5%and 95.4%respectively.The experimental results are 1.0%and 0.3%higher than those of the 2S-AGCN model respectively.The superiority of this method is also proved by the experimental results.展开更多
Building indoor dangerous behavior recognition is a specific application in the field of abnormal human recognition.A human dangerous behavior recognition method based on LSTM-GCN with attention mechanism(GLA)model wa...Building indoor dangerous behavior recognition is a specific application in the field of abnormal human recognition.A human dangerous behavior recognition method based on LSTM-GCN with attention mechanism(GLA)model was proposed aiming at the problem that the existing human skeleton-based action recognition methods cannot fully extract the temporal and spatial features.The network connects GCN and LSTMnetwork in series,and inputs the skeleton sequence extracted by GCN that contains spatial information into the LSTM layer for time sequence feature extraction,which fully excavates the temporal and spatial features of the skeleton sequence.Finally,an attention layer is designed to enhance the features of key bone points,and Softmax is used to classify and identify dangerous behaviors.The dangerous behavior datasets are derived from NTU-RGB+D and Kinetics data sets.Experimental results show that the proposed method can effectively identify some dangerous behaviors in the building,and its accuracy is higher than those of other similar methods.展开更多
Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately ...Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately analyze the communication behavior.Traditional means can hardly utilize the scarce and crude spectrum sensing data captured in a real scene.Thus,communication behavior recognition using raw sensing data under smallsample condition has become a new challenge.In this paper,a data enhanced communication behavior recognition(DECBR)scheme is proposed to meet this challenge.Firstly,a preprocessing method is designed to make the raw spectrum data suitable for the proposed scheme.Then,an adaptive convolutional neural network structure is exploited to carry out communication behavior recognition.Moreover,DCGAN is applied to support data enhancement,which realize communication behavior recognition under small-sample condition.Finally,the scheme is verified by experiments under different data size.The results show that the DECBR scheme can greatly improve the accuracy and efficiency of behavior recognition under smallsample condition.展开更多
In order to effectively solve the problems of low accuracy,large amount of computation and complex logic of deep learning algorithms in behavior recognition,a kind of behavior recognition based on the fusion of 3 dime...In order to effectively solve the problems of low accuracy,large amount of computation and complex logic of deep learning algorithms in behavior recognition,a kind of behavior recognition based on the fusion of 3 dimensional batch normalization visual geometry group(3D-BN-VGG)and long short-term memory(LSTM)network is designed.In this network,3D convolutional layer is used to extract the spatial domain features and time domain features of video sequence at the same time,multiple small convolution kernels are stacked to replace large convolution kernels,thus the depth of neural network is deepened and the number of network parameters is reduced.In addition,the latest batch normalization algorithm is added to the 3-dimensional convolutional network to improve the training speed.Then the output of the full connection layer is sent to LSTM network as the feature vectors to extract the sequence information.This method,which directly uses the output of the whole base level without passing through the full connection layer,reduces the parameters of the whole fusion network to 15324485,nearly twice as much as those of 3D-BN-VGG.Finally,it reveals that the proposed network achieves 96.5%and 74.9%accuracy in the UCF-101 and HMDB-51 respectively,and the algorithm has a calculation speed of 1066 fps and an acceleration ratio of 1,which has a significant predominance in velocity.展开更多
In order to effectively solve the problems of low accuracy and large amount of calculation of current human behavior recognition,a behavior recognition algorithm based on squeeze-and-excitation network(SENet) combined...In order to effectively solve the problems of low accuracy and large amount of calculation of current human behavior recognition,a behavior recognition algorithm based on squeeze-and-excitation network(SENet) combined with 3 D Inception network(I3 D) and gated recurrent unit(GRU) network is proposed.The algorithm first expands the Inception module to three-dimensional,and builds a network based on the three-dimensional module,and expands SENet to three-dimensional,making it an attention mechanism that can pay attention to the three-dimensional channel.Then SENet is introduced into the 13 D network,named SE-I3 D,and SENet is introduced into the CRU network,named SE-GRU.And,SE-13 D and SE-GRU are merged,named SE-13 D-GRU.Finally,the network uses Softmax to classify the results in the UCF-101 dataset.The experimental results show that the SE-I3 D-GRU network achieves a recognition rate of 93.2% on the UCF-101 dataset.展开更多
Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the...Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the residual module is extended to three dimensions,which can extract features in the time and space domain at the same time.Second,by changing the size of the pooling layer window the integrity of the time domain features is preserved,at the same time,in order to overcome the difficulty of network training and over-fitting problems,the batch normalization(BN)layer and the dropout layer are added.After that,because the global average pooling layer(GAP)is affected by the size of the feature map,the network cannot be further deepened,so the convolution layer and maxpool layer are added to the R3D network.Finally,because LSTM has the ability to memorize information and can extract more abstract timing features,the LSTM network is introduced into the R3D network.Experimental results show that the R3D+LSTM network achieves 91%recognition rate on the UCF-101 dataset.展开更多
In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only ...In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only Look Once v3) and local optical flow method. Based on the dense optical flow method, the optical flow modulus of the area where the human target is detected is calculated to reduce the amount of computation and save the cost in terms of time. And then, a threshold value is set to complete the human behavior identification. Through design algorithm, experimental verification and other steps, the walking, running and falling state of human body in real life indoor sports video was identified. Experimental results show that this algorithm is more advantageous for jogging behavior recognition.展开更多
User behavior prediction has become a core element to Internet of Things(IoT)and received promising attention in the related fields.Many existing IoT systems(e.g.smart home systems)have been deployed various sensors a...User behavior prediction has become a core element to Internet of Things(IoT)and received promising attention in the related fields.Many existing IoT systems(e.g.smart home systems)have been deployed various sensors and the user’s behavior can be predicted through the sensor data.However,most of the existing sensor-based systems use the annotated behavior data which requires human intervention to achieve the behavior prediction.Therefore,it is a challenge to provide an automatic behavior prediction model based on the original sensor data.To solve the problem,this paper proposed a novel automatic annotated user behavior prediction(AAUBP)model.The proposed AAUBP model combined the Discontinuous Solving Order Sequence Mining(DVSM)behavior recognition model and behavior prediction model based on the Long Short Term Memory(LSTM)network.To evaluate the model,we performed several experiments on a real-world dataset tuning the parameters.The results showed that the AAUBP model can effectively recognize behaviors and had a good performance for behavior prediction.展开更多
This paper proposes the research on human body behavior recognition based on vision. Behavior based on high-level human structure can describe behavior more accurately, but it is dif? cult to extract the behavioral c...This paper proposes the research on human body behavior recognition based on vision. Behavior based on high-level human structure can describe behavior more accurately, but it is dif? cult to extract the behavioral characteristics while often relying on the accuracy of the human pose estimation. Moving object extraction of the moving targets in video analysis as the main content, research based on the image sequence robust, fast moving target extraction, motion estimation and target description algorithm, and the correlation between motion detection is to use frame, frame by comparing the difference between for change and not change area. The model is proposed based on the probability theory, and the future research will be focused on the simulation.展开更多
Effective collection,recognition,and analysis of sports information is the key to intelligent sports,which can help athletes to improve their skills and formulate scientific training plans and competition strategies.A...Effective collection,recognition,and analysis of sports information is the key to intelligent sports,which can help athletes to improve their skills and formulate scientific training plans and competition strategies.At present,wearable electronic devices used for movement monitoring still have some limitations,such as high cost and energy consumption,incompatibility of suitable flexibility and personalized spatial structure,dissatisfactory data analysis methods,etc.In this work,a novel three-dimensionalprinted thermoplastic polyurethane is introduced as the elastic shell and friction layer,and it endows the proposed customizable and flexible triboelectric nanogenerator(CF-TENG)with personalized spatial structure and robust correlation to external pressure.In practical application,it exhibits highly sensitive responses to the joint-bending motion of the finger,wrist,or elbow.Furthermore,a pressure-sensing insole and smart ski pole based on CF-TENG are manufactured to build a comprehensive sports monitoring system to transmit the athletes’motion information from feet and hands through the plantar pressure distribution and ski pole action.To recognize the movement status,the self-developed automatic peak recognition algorithm(P-Find)and machine learning algorithm(subspace K-Nearest Neighbors)were introduced to accurately distinguish the four typical motion behaviors and three primary sub-techniques of cross-country skiing,with accuracy rates of 98.2%and 100%.This work provides a novel strategy to promote the personalized applications of TENGs in intelligent sports.展开更多
The core technology in an intelligent video surveillance system is that detecting and recognizing abnormal behaviors timely and accurately.The key breakthrough point in recognizing abnormal behaviors is how to obtain ...The core technology in an intelligent video surveillance system is that detecting and recognizing abnormal behaviors timely and accurately.The key breakthrough point in recognizing abnormal behaviors is how to obtain the effective features of the picture,so as to solve the problem of recognizing them.In response to this difficulty,this paper introduces an adjustable jump link coefficients model based on the residual network.The effective coefficients for each layer of the network can be set after using this model to further improving the recognition accuracy of abnormal behavior.A convolution kernel of 1×1 size is added to reduce the number of parameters for the purpose of improving the speed of the model in this paper.In order to reduce the noise of the data edge,and at the same time,improve the accuracy of the data and speed up the training,a BN(Batch Normalization)layer is added before the activation function in this network.This paper trains this network model on the public ImageNet dataset,and then uses the transfer learning method to recognize these abnormal behaviors of human in the UTI behavior dataset processed by the YOLO_v3 target detection network.Under the same experimental conditions,compared with the original ResNet-50 model,the improved model in this paper has a 2.8%higher accuracy in recognition of abnormal behaviors on the public UTI dataset.展开更多
Aiming at the problem of automatic detection of normal operation behavior in self-service business management,with improved motion history image as input,a recognition method of convolutional neural network is propose...Aiming at the problem of automatic detection of normal operation behavior in self-service business management,with improved motion history image as input,a recognition method of convolutional neural network is proposed to timely judge the occurrence of anomie behavior.Firstly,the key frame sequence was extracted from the self-service operation video based on the method of uniform energy down-sampling.Secondly,combined with the timing information of key frames to adaptively estimate the decay parameters of the motion history image,adding information contrast to generating a logic matrix can improve the calculation speed of the improved motion history image.Finally,the formed motion history image was input into the established convolutional neural network to obtain the class of self-service behavior and distinguish anomie behavior.In real scenarios of self-service baggage check-in for civil aviation passengers,the typical check-in behavior data set is established and tested in actual self-service baggage check-in system of the airport.The results show that the method proposed can effectively identify typical anomie behaviors and has high practical value.展开更多
With the increasing number of digital devices generating a vast amount of video data,the recognition of abnormal image patterns has become more important.Accordingly,it is necessary to develop a method that achieves t...With the increasing number of digital devices generating a vast amount of video data,the recognition of abnormal image patterns has become more important.Accordingly,it is necessary to develop a method that achieves this task using object and behavior information within video data.Existing methods for detecting abnormal behaviors only focus on simple motions,therefore they cannot determine the overall behavior occurring throughout a video.In this study,an abnormal behavior detection method that uses deep learning(DL)-based video-data structuring is proposed.Objects and motions are first extracted from continuous images by combining existing DL-based image analysis models.The weight of the continuous data pattern is then analyzed through data structuring to classify the overall video.The performance of the proposed method was evaluated using varying parameter settings,such as the size of the action clip and interval between action clips.The model achieved an accuracy of 0.9817,indicating excellent performance.Therefore,we conclude that the proposed data structuring method is useful in detecting and classifying abnormal behaviors.展开更多
With the development of wireless technology, Frequency-Modulated Continuous Wave (FMCW) radar has increased sensing capability and can be used to recognize human activity. These applications have gained wide-spread at...With the development of wireless technology, Frequency-Modulated Continuous Wave (FMCW) radar has increased sensing capability and can be used to recognize human activity. These applications have gained wide-spread attention and become a hot research area. FMCW signals reflected by target activity can be collected, and human activity can be recognized based on the measurements. This paper focused on human activity recognition based on FMCW and DenseNet. We collected point clouds from FMCW and analyzed them to recognize human activity because different activities could lead to unique point cloud features. We built and trained the neural network to implement human activities using a FMCW signal. Firstly, this paper presented recent reviews about human activity recognition using wireless signals. Then, it introduced the basic concepts of FMCW radar and described the fundamental principles of the system using FMCW radar. We also provided the system framework, experiment scenario, and DenseNet neural network structure. Finally, we presented the experimental results and analyzed the accuracy of different neural network models. The system achieved recognition accuracy of 100 percent for five activities using the DenseNet. We concluded the paper by discussing the current issues and future research directions.展开更多
Due to the increasing demand for security, the development of intelligent surveillance systems has attracted considerable attention in recent years. This study aims to develop a system that is able to identify whether...Due to the increasing demand for security, the development of intelligent surveillance systems has attracted considerable attention in recent years. This study aims to develop a system that is able to identify whether or not the people need help in a public place. Different from previous work, our work considers not only the behaviors of the target person but also the interaction between him and nearby people. In the paper, we propose an event alarm system which can detect the human behaviors and recognize the happening event through integrating the results generated from the single and group behavior analysis. Several new effective features are proposed in the study. Besides, a mechanism capable of extracting one-to-one and multiple-to-one relations is also developed. Experimental results show that the proposed approach can correctly detect human behaviors and provide the alarm messages when emergency events occur.展开更多
Safety production is of great significance to the development of enterprises and society.Accidents often cause great losses because of the particularity environment of electric power.Therefore,it is important to impro...Safety production is of great significance to the development of enterprises and society.Accidents often cause great losses because of the particularity environment of electric power.Therefore,it is important to improve the safety supervision and protection in the electric power environment.In this paper,we simulate the actual electric power operation scenario by monitoring equipment and propose a real-time detection method of illegal actions based on human body key points to ensure safety behavior in real time.In this method,the human body key points in video frames were first extracted by the high-resolution network,and then classified in real time by spatial-temporal graph convolutional network.Experimental results show that this method can effectively detect illegal actions in the simulated scene.展开更多
With the advantages of real-time analysis and visual evaluation results,intelligent technology-enabled teaching behavior evaluation has gradually become a powerful means to help teachers adjust teaching behaviors and ...With the advantages of real-time analysis and visual evaluation results,intelligent technology-enabled teaching behavior evaluation has gradually become a powerful means to help teachers adjust teaching behaviors and improve teaching quality.However,at present,the evaluation of intelligent teachers’behaviors is still in the preliminary exploration stage,and the application research is not deep enough.This paper analyzes the application of intelligent technology in the evaluation of teachers’classroom teaching behaviors from the perspectives of evaluation data,methods,and results.Voice print recognition technology is used to recognize the teachers’identities and track the speech in the classroom videos,and the videos are segmented.Then,the evaluation framework of teachers’classroom teaching behaviors is constructed using three dimensions of emotion,posture,and position preference.Finally,evaluation results are presented to teachers in a more intuitive and easy-to-understand visual way,to help teachers reflect on teaching.This paper aims to promote the transformation of teachers’classroom teaching behavior evaluation toward an intelligent,efficient,and sustainable direction through current research.展开更多
When a human body moves within the coverage range of Wi-Fi signals,the reflected Wi-Fi signals by the various parts of the human body change the propagation path,so analysis of the channel state data can achieve the p...When a human body moves within the coverage range of Wi-Fi signals,the reflected Wi-Fi signals by the various parts of the human body change the propagation path,so analysis of the channel state data can achieve the perception of the human motion.By extracting the Channel State Information(CSI)related to human motion from the Wi-Fi signals and analyzing it with the introduced machine learning classification algorithm,the human motion in the spatial environment can be perceived.On the basis of this theory,this paper proposed an algorithm of human behavior recognition based on CSI wireless sensing to realize deviceless and over-the-air slide turning.This algorithm collects the environmental information containing upward or downward wave in a conference room scene,uses the local outlier factor detection algorithm to segment the actions,and then the time domain features are extracted to train Support Vector Machine(SVM)and eXtreme Gradient Boosting(XGBoost)classification modules.The experimental results show that the average accuracy of the XGBoost module sensing slide flipping can reach 94%,and the SVM module can reach 89%,so the module could be extended to the field of smart classroom and significantly improve speech efficiency.展开更多
基金supported by the National Natural Science Foundation of China(No.62006135)the Natural Science Foundation of Shandong Province(No.ZR2020QF116)。
文摘With the intensifying aging of the population,the phenomenon of the elderly living alone is also increasing.Therefore,using modern internet of things technology to monitor the daily behavior of the elderly in indoors is a meaningful study.Video-based action recognition tasks are easily affected by object occlusion and weak ambient light,resulting in poor recognition performance.Therefore,this paper proposes an indoor human behavior recognition method based on wireless fidelity(Wi-Fi)perception and video feature fusion by utilizing the ability of Wi-Fi signals to carry environmental information during the propagation process.This paper uses the public WiFi-based activity recognition dataset(WIAR)containing Wi-Fi channel state information and essential action videos,and then extracts video feature vectors and Wi-Fi signal feature vectors in the datasets through the two-stream convolutional neural network and standard statistical algorithms,respectively.Then the two sets of feature vectors are fused,and finally,the action classification and recognition are performed by the support vector machine(SVM).The experiments in this paper contrast experiments between the two-stream network model and the methods in this paper under three different environments.And the accuracy of action recognition after adding Wi-Fi signal feature fusion is improved by 10%on average.
文摘Recently,human healthcare from body sensor data has gained considerable interest from a wide variety of human-computer communication and pattern analysis research owing to their real-time applications namely smart healthcare systems.Even though there are various forms of utilizing distributed sensors to monitor the behavior of people and vital signs,physical human action recognition(HAR)through body sensors gives useful information about the lifestyle and functionality of an individual.This article concentrates on the design of an Improved Transient Search Optimization with Machine Learning based BehaviorRecognition(ITSOMLBR)technique using body sensor data.The presented ITSOML-BR technique collects data from different body sensors namely electrocardiography(ECG),accelerometer,and magnetometer.In addition,the ITSOML-BR technique extract features like variance,mean,skewness,and standard deviation.Moreover,the presented ITSOML-BR technique executes a micro neural network(MNN)which can be employed for long term healthcare monitoring and classification.Furthermore,the parameters related to the MNN model are optimally selected via the ITSO algorithm.The experimental result analysis of the ITSOML-BR technique is tested on the MHEALTH dataset.The comprehensive comparison study reported a higher result for the ITSOMLBR approach over other existing approaches with maximum accuracy of 99.60%.
基金the National Natural Science Foundation of China(No.61834005,61772417,61802304)the Shaanxi Province Key Research and Development Project(2021GY280).
文摘Compared with RGB videos and images,human bone data is less vulnerable to external factors and has stronger robustness.Therefore,behavior recognition methods based on skeletons are widely studied.Because graph convolution network(GCN)can deal with the irregular topology data of hu-man skeletons very well,more and more researchers apply GCN to human behavior recognition.Tra-ditional graph convolution methods only consider the joints with physical connectivity or the same type when building the behavior recognition model based on human skeletons structure,which cannot capture higher-order information better.To solve this problem,Motif-GCN is used in this paper to ex-tract spatial features.The relationship between the joints with natural connection in the human body is encoded by the first Motif-GCN,and the possible relationship between the unconnected joints in the human skeleton is encoded by the second Motif-GCN.In this way,the relationship between non-physical joints can be strengthened.Then a two stream framework combining joint and bone informa-tion is used to capture more action information.Finally,experiments are conducted on two subdata-sets X-Sub and X-View of NTU-RGB+D,and the accuracy shown in Top-1 classification results is 89.5%and 95.4%respectively.The experimental results are 1.0%and 0.3%higher than those of the 2S-AGCN model respectively.The superiority of this method is also proved by the experimental results.
文摘Building indoor dangerous behavior recognition is a specific application in the field of abnormal human recognition.A human dangerous behavior recognition method based on LSTM-GCN with attention mechanism(GLA)model was proposed aiming at the problem that the existing human skeleton-based action recognition methods cannot fully extract the temporal and spatial features.The network connects GCN and LSTMnetwork in series,and inputs the skeleton sequence extracted by GCN that contains spatial information into the LSTM layer for time sequence feature extraction,which fully excavates the temporal and spatial features of the skeleton sequence.Finally,an attention layer is designed to enhance the features of key bone points,and Softmax is used to classify and identify dangerous behaviors.The dangerous behavior datasets are derived from NTU-RGB+D and Kinetics data sets.Experimental results show that the proposed method can effectively identify some dangerous behaviors in the building,and its accuracy is higher than those of other similar methods.
基金supported by the National Natural Science Foundation of China(No.61971439 and No.61702543)the Natural Science Foundation of the Jiangsu Province of China(No.BK20191329)+1 种基金the China Postdoctoral Science Foundation Project(No.2019T120987)the Startup Foundation for Introducing Talent of NUIST(No.2020r100).
文摘Communication behavior recognition is an issue with increasingly importance in the antiterrorism and national defense area.However,the sensing data obtained in actual environment is often not sufficient to accurately analyze the communication behavior.Traditional means can hardly utilize the scarce and crude spectrum sensing data captured in a real scene.Thus,communication behavior recognition using raw sensing data under smallsample condition has become a new challenge.In this paper,a data enhanced communication behavior recognition(DECBR)scheme is proposed to meet this challenge.Firstly,a preprocessing method is designed to make the raw spectrum data suitable for the proposed scheme.Then,an adaptive convolutional neural network structure is exploited to carry out communication behavior recognition.Moreover,DCGAN is applied to support data enhancement,which realize communication behavior recognition under small-sample condition.Finally,the scheme is verified by experiments under different data size.The results show that the DECBR scheme can greatly improve the accuracy and efficiency of behavior recognition under smallsample condition.
基金the National Natural Science Foundation of China(No.61772417,61634004,61602377)Key R&D Program Projects in Shaanxi Province(No.2017GY-060)Shaanxi Natural Science Basic Research Project(No.2018JM4018).
文摘In order to effectively solve the problems of low accuracy,large amount of computation and complex logic of deep learning algorithms in behavior recognition,a kind of behavior recognition based on the fusion of 3 dimensional batch normalization visual geometry group(3D-BN-VGG)and long short-term memory(LSTM)network is designed.In this network,3D convolutional layer is used to extract the spatial domain features and time domain features of video sequence at the same time,multiple small convolution kernels are stacked to replace large convolution kernels,thus the depth of neural network is deepened and the number of network parameters is reduced.In addition,the latest batch normalization algorithm is added to the 3-dimensional convolutional network to improve the training speed.Then the output of the full connection layer is sent to LSTM network as the feature vectors to extract the sequence information.This method,which directly uses the output of the whole base level without passing through the full connection layer,reduces the parameters of the whole fusion network to 15324485,nearly twice as much as those of 3D-BN-VGG.Finally,it reveals that the proposed network achieves 96.5%and 74.9%accuracy in the UCF-101 and HMDB-51 respectively,and the algorithm has a calculation speed of 1066 fps and an acceleration ratio of 1,which has a significant predominance in velocity.
基金Supported by the Shaanxi Province Key Research and Development Project(No.2021 GY-280)the Natural Science Foundation of Shaanxi Province(No.2021JM-459)the National Natural Science Foundation of China(No.61772417,61634004,61602377).
文摘In order to effectively solve the problems of low accuracy and large amount of calculation of current human behavior recognition,a behavior recognition algorithm based on squeeze-and-excitation network(SENet) combined with 3 D Inception network(I3 D) and gated recurrent unit(GRU) network is proposed.The algorithm first expands the Inception module to three-dimensional,and builds a network based on the three-dimensional module,and expands SENet to three-dimensional,making it an attention mechanism that can pay attention to the three-dimensional channel.Then SENet is introduced into the 13 D network,named SE-I3 D,and SENet is introduced into the CRU network,named SE-GRU.And,SE-13 D and SE-GRU are merged,named SE-13 D-GRU.Finally,the network uses Softmax to classify the results in the UCF-101 dataset.The experimental results show that the SE-I3 D-GRU network achieves a recognition rate of 93.2% on the UCF-101 dataset.
基金Supported by the Shaanxi Province Key Research and Development Project (No. 2021GY-280)Shaanxi Province Natural Science Basic Research Program (No. 2021JM-459)the National Natural Science Foundation of China (No. 61772417)
文摘Because behavior recognition is based on video frame sequences,this paper proposes a behavior recognition algorithm that combines 3D residual convolutional neural network(R3D)and long short-term memory(LSTM).First,the residual module is extended to three dimensions,which can extract features in the time and space domain at the same time.Second,by changing the size of the pooling layer window the integrity of the time domain features is preserved,at the same time,in order to overcome the difficulty of network training and over-fitting problems,the batch normalization(BN)layer and the dropout layer are added.After that,because the global average pooling layer(GAP)is affected by the size of the feature map,the network cannot be further deepened,so the convolution layer and maxpool layer are added to the R3D network.Finally,because LSTM has the ability to memorize information and can extract more abstract timing features,the LSTM network is introduced into the R3D network.Experimental results show that the R3D+LSTM network achieves 91%recognition rate on the UCF-101 dataset.
文摘In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only Look Once v3) and local optical flow method. Based on the dense optical flow method, the optical flow modulus of the area where the human target is detected is calculated to reduce the amount of computation and save the cost in terms of time. And then, a threshold value is set to complete the human behavior identification. Through design algorithm, experimental verification and other steps, the walking, running and falling state of human body in real life indoor sports video was identified. Experimental results show that this algorithm is more advantageous for jogging behavior recognition.
基金supported by the National Natural Science Foundation of China(62071069)。
文摘User behavior prediction has become a core element to Internet of Things(IoT)and received promising attention in the related fields.Many existing IoT systems(e.g.smart home systems)have been deployed various sensors and the user’s behavior can be predicted through the sensor data.However,most of the existing sensor-based systems use the annotated behavior data which requires human intervention to achieve the behavior prediction.Therefore,it is a challenge to provide an automatic behavior prediction model based on the original sensor data.To solve the problem,this paper proposed a novel automatic annotated user behavior prediction(AAUBP)model.The proposed AAUBP model combined the Discontinuous Solving Order Sequence Mining(DVSM)behavior recognition model and behavior prediction model based on the Long Short Term Memory(LSTM)network.To evaluate the model,we performed several experiments on a real-world dataset tuning the parameters.The results showed that the AAUBP model can effectively recognize behaviors and had a good performance for behavior prediction.
文摘This paper proposes the research on human body behavior recognition based on vision. Behavior based on high-level human structure can describe behavior more accurately, but it is dif? cult to extract the behavioral characteristics while often relying on the accuracy of the human pose estimation. Moving object extraction of the moving targets in video analysis as the main content, research based on the image sequence robust, fast moving target extraction, motion estimation and target description algorithm, and the correlation between motion detection is to use frame, frame by comparing the difference between for change and not change area. The model is proposed based on the probability theory, and the future research will be focused on the simulation.
基金supported by the National Key R&D Program of China(Grant Nos. 2019YFF0301802, 2019YFB2004802, and 2018YFF0300605)National Natural Science Foundation of China (Grant Nos. 51975541 and51975542)+1 种基金Applied Fundamental Research Program of Shanxi Province(Grant No. 201901D211281)National Defense Fundamental Research Project and Program for the Innovative Talents of Higher Education Institutions of Shanxi
文摘Effective collection,recognition,and analysis of sports information is the key to intelligent sports,which can help athletes to improve their skills and formulate scientific training plans and competition strategies.At present,wearable electronic devices used for movement monitoring still have some limitations,such as high cost and energy consumption,incompatibility of suitable flexibility and personalized spatial structure,dissatisfactory data analysis methods,etc.In this work,a novel three-dimensionalprinted thermoplastic polyurethane is introduced as the elastic shell and friction layer,and it endows the proposed customizable and flexible triboelectric nanogenerator(CF-TENG)with personalized spatial structure and robust correlation to external pressure.In practical application,it exhibits highly sensitive responses to the joint-bending motion of the finger,wrist,or elbow.Furthermore,a pressure-sensing insole and smart ski pole based on CF-TENG are manufactured to build a comprehensive sports monitoring system to transmit the athletes’motion information from feet and hands through the plantar pressure distribution and ski pole action.To recognize the movement status,the self-developed automatic peak recognition algorithm(P-Find)and machine learning algorithm(subspace K-Nearest Neighbors)were introduced to accurately distinguish the four typical motion behaviors and three primary sub-techniques of cross-country skiing,with accuracy rates of 98.2%and 100%.This work provides a novel strategy to promote the personalized applications of TENGs in intelligent sports.
基金This research was funded by the Science and Technology Department of Shaanxi Province,China,Grant Number 2019GY-036.
文摘The core technology in an intelligent video surveillance system is that detecting and recognizing abnormal behaviors timely and accurately.The key breakthrough point in recognizing abnormal behaviors is how to obtain the effective features of the picture,so as to solve the problem of recognizing them.In response to this difficulty,this paper introduces an adjustable jump link coefficients model based on the residual network.The effective coefficients for each layer of the network can be set after using this model to further improving the recognition accuracy of abnormal behavior.A convolution kernel of 1×1 size is added to reduce the number of parameters for the purpose of improving the speed of the model in this paper.In order to reduce the noise of the data edge,and at the same time,improve the accuracy of the data and speed up the training,a BN(Batch Normalization)layer is added before the activation function in this network.This paper trains this network model on the public ImageNet dataset,and then uses the transfer learning method to recognize these abnormal behaviors of human in the UTI behavior dataset processed by the YOLO_v3 target detection network.Under the same experimental conditions,compared with the original ResNet-50 model,the improved model in this paper has a 2.8%higher accuracy in recognition of abnormal behaviors on the public UTI dataset.
文摘Aiming at the problem of automatic detection of normal operation behavior in self-service business management,with improved motion history image as input,a recognition method of convolutional neural network is proposed to timely judge the occurrence of anomie behavior.Firstly,the key frame sequence was extracted from the self-service operation video based on the method of uniform energy down-sampling.Secondly,combined with the timing information of key frames to adaptively estimate the decay parameters of the motion history image,adding information contrast to generating a logic matrix can improve the calculation speed of the improved motion history image.Finally,the formed motion history image was input into the established convolutional neural network to obtain the class of self-service behavior and distinguish anomie behavior.In real scenarios of self-service baggage check-in for civil aviation passengers,the typical check-in behavior data set is established and tested in actual self-service baggage check-in system of the airport.The results show that the method proposed can effectively identify typical anomie behaviors and has high practical value.
基金supported by Basic Science Research Program through the NationalResearch Foundation of Korea (NRF)funded by the Ministry of Education (2020R1A6A1A03040583).
文摘With the increasing number of digital devices generating a vast amount of video data,the recognition of abnormal image patterns has become more important.Accordingly,it is necessary to develop a method that achieves this task using object and behavior information within video data.Existing methods for detecting abnormal behaviors only focus on simple motions,therefore they cannot determine the overall behavior occurring throughout a video.In this study,an abnormal behavior detection method that uses deep learning(DL)-based video-data structuring is proposed.Objects and motions are first extracted from continuous images by combining existing DL-based image analysis models.The weight of the continuous data pattern is then analyzed through data structuring to classify the overall video.The performance of the proposed method was evaluated using varying parameter settings,such as the size of the action clip and interval between action clips.The model achieved an accuracy of 0.9817,indicating excellent performance.Therefore,we conclude that the proposed data structuring method is useful in detecting and classifying abnormal behaviors.
文摘With the development of wireless technology, Frequency-Modulated Continuous Wave (FMCW) radar has increased sensing capability and can be used to recognize human activity. These applications have gained wide-spread attention and become a hot research area. FMCW signals reflected by target activity can be collected, and human activity can be recognized based on the measurements. This paper focused on human activity recognition based on FMCW and DenseNet. We collected point clouds from FMCW and analyzed them to recognize human activity because different activities could lead to unique point cloud features. We built and trained the neural network to implement human activities using a FMCW signal. Firstly, this paper presented recent reviews about human activity recognition using wireless signals. Then, it introduced the basic concepts of FMCW radar and described the fundamental principles of the system using FMCW radar. We also provided the system framework, experiment scenario, and DenseNet neural network structure. Finally, we presented the experimental results and analyzed the accuracy of different neural network models. The system achieved recognition accuracy of 100 percent for five activities using the DenseNet. We concluded the paper by discussing the current issues and future research directions.
基金supported by the“MOST”under Grant No.104-2221-E-259-024-MY2
文摘Due to the increasing demand for security, the development of intelligent surveillance systems has attracted considerable attention in recent years. This study aims to develop a system that is able to identify whether or not the people need help in a public place. Different from previous work, our work considers not only the behaviors of the target person but also the interaction between him and nearby people. In the paper, we propose an event alarm system which can detect the human behaviors and recognize the happening event through integrating the results generated from the single and group behavior analysis. Several new effective features are proposed in the study. Besides, a mechanism capable of extracting one-to-one and multiple-to-one relations is also developed. Experimental results show that the proposed approach can correctly detect human behaviors and provide the alarm messages when emergency events occur.
基金the Science and Technology Program of State Grid Corporation of China(No.5211TZ1900S6)。
文摘Safety production is of great significance to the development of enterprises and society.Accidents often cause great losses because of the particularity environment of electric power.Therefore,it is important to improve the safety supervision and protection in the electric power environment.In this paper,we simulate the actual electric power operation scenario by monitoring equipment and propose a real-time detection method of illegal actions based on human body key points to ensure safety behavior in real time.In this method,the human body key points in video frames were first extracted by the high-resolution network,and then classified in real time by spatial-temporal graph convolutional network.Experimental results show that this method can effectively detect illegal actions in the simulated scene.
文摘With the advantages of real-time analysis and visual evaluation results,intelligent technology-enabled teaching behavior evaluation has gradually become a powerful means to help teachers adjust teaching behaviors and improve teaching quality.However,at present,the evaluation of intelligent teachers’behaviors is still in the preliminary exploration stage,and the application research is not deep enough.This paper analyzes the application of intelligent technology in the evaluation of teachers’classroom teaching behaviors from the perspectives of evaluation data,methods,and results.Voice print recognition technology is used to recognize the teachers’identities and track the speech in the classroom videos,and the videos are segmented.Then,the evaluation framework of teachers’classroom teaching behaviors is constructed using three dimensions of emotion,posture,and position preference.Finally,evaluation results are presented to teachers in a more intuitive and easy-to-understand visual way,to help teachers reflect on teaching.This paper aims to promote the transformation of teachers’classroom teaching behavior evaluation toward an intelligent,efficient,and sustainable direction through current research.
基金supported by the Special Zone Project of National Defense Innovation.
文摘When a human body moves within the coverage range of Wi-Fi signals,the reflected Wi-Fi signals by the various parts of the human body change the propagation path,so analysis of the channel state data can achieve the perception of the human motion.By extracting the Channel State Information(CSI)related to human motion from the Wi-Fi signals and analyzing it with the introduced machine learning classification algorithm,the human motion in the spatial environment can be perceived.On the basis of this theory,this paper proposed an algorithm of human behavior recognition based on CSI wireless sensing to realize deviceless and over-the-air slide turning.This algorithm collects the environmental information containing upward or downward wave in a conference room scene,uses the local outlier factor detection algorithm to segment the actions,and then the time domain features are extracted to train Support Vector Machine(SVM)and eXtreme Gradient Boosting(XGBoost)classification modules.The experimental results show that the average accuracy of the XGBoost module sensing slide flipping can reach 94%,and the SVM module can reach 89%,so the module could be extended to the field of smart classroom and significantly improve speech efficiency.