Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,ru...Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,rural power grids often lack digitalization;thus,the load distribution within these areas is not fully known.This hinders the calculation of the available PV capacity and deduction of node voltages.This study proposes a load-distribution modeling approach based on remote-sensing image recognition in pursuit of a scientific framework for developing distributed PV resources in rural grid areas.First,houses in remote-sensing images are accurately recognized using deep-learning techniques based on the YOLOv5 model.The distribution of the houses is then used to estimate the load distribution in the grid area.Next,equally spaced and clustered distribution models are used to adaptively determine the location of the nodes and load power in the distribution lines.Finally,by calculating the connectivity matrix of the nodes,a minimum spanning tree is extracted,the topology of the network is constructed,and the node parameters of the load-distribution model are calculated.The proposed scheme is implemented in a software package and its efficacy is demonstrated by analyzing typical remote-sensing images of rural grid areas.The results underscore the ability of the proposed approach to effectively discern the distribution-line structure and compute the node parameters,thereby offering vital support for determining PV access capability.展开更多
The video-oriented facial expression recognition has always been an important issue in emotion perception.At present,the key challenge in most existing methods is how to effectively extract robust features to characte...The video-oriented facial expression recognition has always been an important issue in emotion perception.At present,the key challenge in most existing methods is how to effectively extract robust features to characterize facial appearance and geometry changes caused by facial motions.On this basis,the video in this paper is divided into multiple segments,each of which is simultaneously described by optical flow and facial landmark trajectory.To deeply delve the emotional information of these two representations,we propose a Deep Spatiotemporal Network with Dual-flow Fusion(defined as DSN-DF),which highlights the region and strength of expressions by spatiotemporal appearance features and the speed of change by spatiotemporal geometry features.Finally,experiments are implemented on CKþand MMI datasets to demonstrate the superiority of the proposed method.展开更多
Gait recognition is an active research area that uses a walking theme to identify the subject correctly.Human Gait Recognition(HGR)is performed without any cooperation from the individual.However,in practice,it remain...Gait recognition is an active research area that uses a walking theme to identify the subject correctly.Human Gait Recognition(HGR)is performed without any cooperation from the individual.However,in practice,it remains a challenging task under diverse walking sequences due to the covariant factors such as normal walking and walking with wearing a coat.Researchers,over the years,have worked on successfully identifying subjects using different techniques,but there is still room for improvement in accuracy due to these covariant factors.This paper proposes an automated model-free framework for human gait recognition in this article.There are a few critical steps in the proposed method.Firstly,optical flow-based motion region esti-mation and dynamic coordinates-based cropping are performed.The second step involves training a fine-tuned pre-trained MobileNetV2 model on both original and optical flow cropped frames;the training has been conducted using static hyperparameters.The third step proposed a fusion technique known as normal distribution serially fusion.In the fourth step,a better optimization algorithm is applied to select the best features,which are then classified using a Bi-Layered neural network.Three publicly available datasets,CASIA A,CASIA B,and CASIA C,were used in the experimental process and obtained average accuracies of 99.6%,91.6%,and 95.02%,respectively.The proposed framework has achieved improved accuracy compared to the other methods.展开更多
In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only ...In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only Look Once v3) and local optical flow method. Based on the dense optical flow method, the optical flow modulus of the area where the human target is detected is calculated to reduce the amount of computation and save the cost in terms of time. And then, a threshold value is set to complete the human behavior identification. Through design algorithm, experimental verification and other steps, the walking, running and falling state of human body in real life indoor sports video was identified. Experimental results show that this algorithm is more advantageous for jogging behavior recognition.展开更多
Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.Ho...Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.However, false message often arises from the simple mechanics of alarms under the ambient noise interference.To improve the accuracy of infrasound monitoring for early-warning against debris flows, it is necessary to analyze the monitor information to identify in them the infrasonic signals characteristic of debris flows.Therefore, a large amount of debris flow infrasound and ambient noises have been collected from different sources for analysis to sum up their frequency spectra, sound pressures, waveforms, time duration and other correlated characteristics so as to specify the key characteristic parameters for different sound sources in completing the development of the recognition system of debris flow infrasonic signals for identifying their possible existence in the monitor signals.The recognition performance of the system has been verified by simulating tests and long-term in-situ monitoring of debris flows in Jiangjia Gully,Dongchuan, China to be of high accuracy and applicability.The recognition system can provide the local government and residents with accurate precautionary information about debris flows in preparation for disaster mitigation and minimizing the loss of life and property.展开更多
Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the fiel...Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the field data of Liantong Road of Zibo city to verify the validity and the feasibility of the theory. The results show that change point method of multiple linear regression can make out the rule of quantitative changes in traffic flow more accurately than ordinary methods. So, the change point method can be applied to traffic information management system more effectively.展开更多
Open-set recognition(OSR)is a realistic problem in wireless signal recogni-tion,which means that during the inference phase there may appear unknown classes not seen in the training phase.The method of intra-class spl...Open-set recognition(OSR)is a realistic problem in wireless signal recogni-tion,which means that during the inference phase there may appear unknown classes not seen in the training phase.The method of intra-class splitting(ICS)that splits samples of known classes to imitate unknown classes has achieved great performance.However,this approach relies too much on the predefined splitting ratio and may face huge performance degradation in new environment.In this paper,we train a multi-task learning(MTL)net-work based on the characteristics of wireless signals to improve the performance in new scenes.Besides,we provide a dynamic method to decide the splitting ratio per class to get more precise outer samples.To be specific,we make perturbations to the sample from the center of one class toward its adversarial direction and the change point of confidence scores during this process is used as the splitting threshold.We conduct several experi-ments on one wireless signal dataset collected at 2.4 GHz ISM band by LimeSDR and one open modulation recognition dataset,and the analytical results demonstrate the effective-ness of the proposed method.展开更多
Accurate and efficient urban traffic flow prediction can help drivers identify road traffic conditions in real-time,consequently helping them avoid congestion and accidents to a certain extent.However,the existing met...Accurate and efficient urban traffic flow prediction can help drivers identify road traffic conditions in real-time,consequently helping them avoid congestion and accidents to a certain extent.However,the existing methods for real-time urban traffic flow prediction focus on improving the model prediction accuracy or efficiency while ignoring the training efficiency,which results in a prediction system that lacks the scalability to integrate real-time traffic flow into the training procedure.To conduct accurate and real-time urban traffic flow prediction while considering the latest historical data and avoiding time-consuming online retraining,herein,we propose a scalable system for Predicting short-term URban traffic flow in real-time based on license Plate recognition data(PURP).First,to ensure prediction accuracy,PURP constructs the spatio-temporal contexts of traffic flow prediction from License Plate Recognition(LPR)data as effective characteristics.Subsequently,to utilize the recent data without retraining the model online,PURP uses the nonparametric method k-Nearest Neighbor(namely KNN)as the prediction framework because the KNN can efficiently identify the top-k most similar spatio-temporal contexts and make predictions based on these contexts without time-consuming model retraining online.The experimental results show that PURP retains strong prediction efficiency as the prediction period increases.展开更多
The two-stream convolutional neural network exhibits excellent performance in the video action recognition.The crux of the matter is to use the frames already clipped by the videos and the optical flow images pre-extr...The two-stream convolutional neural network exhibits excellent performance in the video action recognition.The crux of the matter is to use the frames already clipped by the videos and the optical flow images pre-extracted by the frames,to train a model each,and to finally integrate the outputs of the two models.Nevertheless,the reliance on the pre-extraction of the optical flow impedes the efficiency of action recognition,and the temporal and the spatial streams are just simply fused at the ends,with one stream failing and the other stream succeeding.We propose a novel hidden two-stream collaborative(HTSC)learning network that masks the steps of extracting the optical flow in the network and greatly speeds up the action recognition.Based on the two-stream method,the two-stream collaborative learning model captures the interaction of the temporal and spatial features to greatly enhance the accuracy of recognition.Our proposed method is highly capable of achieving the balance of efficiency and precision on large-scale video action recognition datasets.展开更多
How to correctly acquire the appropriate features is a primary problem in network protocol recognition field.Aiming to avoid the trouble of artificially extracting features in traditional methods and improve recogniti...How to correctly acquire the appropriate features is a primary problem in network protocol recognition field.Aiming to avoid the trouble of artificially extracting features in traditional methods and improve recognition accuracy,a network protocol recognition method based on Convolutional Neural Network(CNN)is proposed.The method utilizes deep learning technique,and it processes network flows automatically.Firstly,normalization is performed on the intercepted network flows and they are mapped into two-dimensional matrix which will be used as the input of CNN.Then,an improved classification model named Ptr CNN is built,which can automatically extract the appropriate features of network protocols.Finally,the classification model is trained to recognize the network protocols.The proposed approach is compared with several machine learning methods.Experimental results show that the tailored CNN can not only improve protocol recognition accuracy but also ensure the fast convergence of classification model and reduce the classification time.展开更多
Over the past decade, automatic traffic accident recognition has become a prominent objective in the area of machine vision and pattern recognition because of its immense application potential in developing autonomous...Over the past decade, automatic traffic accident recognition has become a prominent objective in the area of machine vision and pattern recognition because of its immense application potential in developing autonomous Intelligent Transportation Systems (ITS). In this paper, we present a new framework toward a real-time automated recognition of traffic accident based on the Histogram of Flow Gradient (HFG) and statistical logistic regression analysis. First, optical flow is estimated and the HFG is constructed from video shots. Then vehicle patterns are clustered based on the HFG-features. By using logistic regression analysis to fit data to logistic curves, the classifier model is generated. Finally, the trajectory of the vehicle by which the accident was occasioned, is determined and recorded. The experimental results on real video sequences demonstrate the efficiency and the applicability of the framework and show it is of higher robustness and can comfortably provide latency guarantees to real-time surveillance and traffic monitoring applications.展开更多
基金supported by the State Grid Science&Technology Project of China(5400-202224153A-1-1-ZN).
文摘Expanding photovoltaic(PV)resources in rural-grid areas is an essential means to augment the share of solar energy in the energy landscape,aligning with the“carbon peaking and carbon neutrality”objectives.However,rural power grids often lack digitalization;thus,the load distribution within these areas is not fully known.This hinders the calculation of the available PV capacity and deduction of node voltages.This study proposes a load-distribution modeling approach based on remote-sensing image recognition in pursuit of a scientific framework for developing distributed PV resources in rural grid areas.First,houses in remote-sensing images are accurately recognized using deep-learning techniques based on the YOLOv5 model.The distribution of the houses is then used to estimate the load distribution in the grid area.Next,equally spaced and clustered distribution models are used to adaptively determine the location of the nodes and load power in the distribution lines.Finally,by calculating the connectivity matrix of the nodes,a minimum spanning tree is extracted,the topology of the network is constructed,and the node parameters of the load-distribution model are calculated.The proposed scheme is implemented in a software package and its efficacy is demonstrated by analyzing typical remote-sensing images of rural grid areas.The results underscore the ability of the proposed approach to effectively discern the distribution-line structure and compute the node parameters,thereby offering vital support for determining PV access capability.
基金This work is supported by Natural Science Foundation of China(Grant No.61903056)Major Project of Science and Technology Research Program of Chongqing Education Commission of China(Grant No.KJZDM201900601)+3 种基金Chongqing Research Program of Basic Research and Frontier Technology(Grant Nos.cstc2019jcyj-msxmX0681,cstc2021jcyj-msxmX0530,and cstc2021jcyjmsxmX0761)Project Supported by Chongqing Municipal Key Laboratory of Institutions of Higher Education(Grant No.cqupt-mct-201901)Project Supported by Chongqing Key Laboratory of Mobile Communications Technology(Grant No.cqupt-mct-202002)Project Supported by Engineering Research Center of Mobile Communications,Ministry of Education(Grant No.cqupt-mct202006)。
文摘The video-oriented facial expression recognition has always been an important issue in emotion perception.At present,the key challenge in most existing methods is how to effectively extract robust features to characterize facial appearance and geometry changes caused by facial motions.On this basis,the video in this paper is divided into multiple segments,each of which is simultaneously described by optical flow and facial landmark trajectory.To deeply delve the emotional information of these two representations,we propose a Deep Spatiotemporal Network with Dual-flow Fusion(defined as DSN-DF),which highlights the region and strength of expressions by spatiotemporal appearance features and the speed of change by spatiotemporal geometry features.Finally,experiments are implemented on CKþand MMI datasets to demonstrate the superiority of the proposed method.
基金supported by“Human Resources Program in Energy Technology”of the Korea Institute of Energy Technology Evaluation and Planning(KETEP)granted financial resources from the Ministry of Trade,Industry&Energy,Republic of Korea.(No.20204010600090).
文摘Gait recognition is an active research area that uses a walking theme to identify the subject correctly.Human Gait Recognition(HGR)is performed without any cooperation from the individual.However,in practice,it remains a challenging task under diverse walking sequences due to the covariant factors such as normal walking and walking with wearing a coat.Researchers,over the years,have worked on successfully identifying subjects using different techniques,but there is still room for improvement in accuracy due to these covariant factors.This paper proposes an automated model-free framework for human gait recognition in this article.There are a few critical steps in the proposed method.Firstly,optical flow-based motion region esti-mation and dynamic coordinates-based cropping are performed.The second step involves training a fine-tuned pre-trained MobileNetV2 model on both original and optical flow cropped frames;the training has been conducted using static hyperparameters.The third step proposed a fusion technique known as normal distribution serially fusion.In the fourth step,a better optimization algorithm is applied to select the best features,which are then classified using a Bi-Layered neural network.Three publicly available datasets,CASIA A,CASIA B,and CASIA C,were used in the experimental process and obtained average accuracies of 99.6%,91.6%,and 95.02%,respectively.The proposed framework has achieved improved accuracy compared to the other methods.
文摘In the process of human behavior recognition, the traditional dense optical flow method has too many pixels and too much overhead, which limits the running speed. This paper proposed a method combing YOLOv3 (You Only Look Once v3) and local optical flow method. Based on the dense optical flow method, the optical flow modulus of the area where the human target is detected is calculated to reduce the amount of computation and save the cost in terms of time. And then, a threshold value is set to complete the human behavior identification. Through design algorithm, experimental verification and other steps, the walking, running and falling state of human body in real life indoor sports video was identified. Experimental results show that this algorithm is more advantageous for jogging behavior recognition.
基金supported by the National Science and Technology Support Program(2011BAK12B00)the International Cooperation Project of the Department of Science and Technology of Sichuan Province(2009HH0005)the Project of the Department of Science and Technology of Sichuan Province(2015JY0235)
文摘Low frequency infrasonic waves are emitted during the formation and movement of debris flows, which are detectable in a radius of several kilometers, thereby to serve as the precondition for their remote monitoring.However, false message often arises from the simple mechanics of alarms under the ambient noise interference.To improve the accuracy of infrasound monitoring for early-warning against debris flows, it is necessary to analyze the monitor information to identify in them the infrasonic signals characteristic of debris flows.Therefore, a large amount of debris flow infrasound and ambient noises have been collected from different sources for analysis to sum up their frequency spectra, sound pressures, waveforms, time duration and other correlated characteristics so as to specify the key characteristic parameters for different sound sources in completing the development of the recognition system of debris flow infrasonic signals for identifying their possible existence in the monitor signals.The recognition performance of the system has been verified by simulating tests and long-term in-situ monitoring of debris flows in Jiangjia Gully,Dongchuan, China to be of high accuracy and applicability.The recognition system can provide the local government and residents with accurate precautionary information about debris flows in preparation for disaster mitigation and minimizing the loss of life and property.
基金National Natural Science Foundations of China(No. 61074140,No. 60974094)Young Teacher Development Support Project of Shandong University of Technology,China
文摘Recognition method of traffic flow change point was put forward based on traffic flow theory and the statistical change point analysis of multiple linear regressions. The method was calibrated and tested with the field data of Liantong Road of Zibo city to verify the validity and the feasibility of the theory. The results show that change point method of multiple linear regression can make out the rule of quantitative changes in traffic flow more accurately than ordinary methods. So, the change point method can be applied to traffic information management system more effectively.
文摘Open-set recognition(OSR)is a realistic problem in wireless signal recogni-tion,which means that during the inference phase there may appear unknown classes not seen in the training phase.The method of intra-class splitting(ICS)that splits samples of known classes to imitate unknown classes has achieved great performance.However,this approach relies too much on the predefined splitting ratio and may face huge performance degradation in new environment.In this paper,we train a multi-task learning(MTL)net-work based on the characteristics of wireless signals to improve the performance in new scenes.Besides,we provide a dynamic method to decide the splitting ratio per class to get more precise outer samples.To be specific,we make perturbations to the sample from the center of one class toward its adversarial direction and the change point of confidence scores during this process is used as the splitting threshold.We conduct several experi-ments on one wireless signal dataset collected at 2.4 GHz ISM band by LimeSDR and one open modulation recognition dataset,and the analytical results demonstrate the effective-ness of the proposed method.
基金This work is supported by National Natural Science Foundation of China (Grant No.60773093, 60873209, and 60970107), the Key Program for Basic Research of Shanghai (Grant No. 09JC1407900, 09510701600, 10511500100), IBM SUR Funding and IBM Research-China JP Funding, and Key Lab of Information Network Security, Ministry of Public Security.
基金This work was supported by the National Natural Science Foundation of China(Nos.62072405 and 62276233)the Key Research Project of Zhejiang Province(No.2023C01048).
文摘Accurate and efficient urban traffic flow prediction can help drivers identify road traffic conditions in real-time,consequently helping them avoid congestion and accidents to a certain extent.However,the existing methods for real-time urban traffic flow prediction focus on improving the model prediction accuracy or efficiency while ignoring the training efficiency,which results in a prediction system that lacks the scalability to integrate real-time traffic flow into the training procedure.To conduct accurate and real-time urban traffic flow prediction while considering the latest historical data and avoiding time-consuming online retraining,herein,we propose a scalable system for Predicting short-term URban traffic flow in real-time based on license Plate recognition data(PURP).First,to ensure prediction accuracy,PURP constructs the spatio-temporal contexts of traffic flow prediction from License Plate Recognition(LPR)data as effective characteristics.Subsequently,to utilize the recent data without retraining the model online,PURP uses the nonparametric method k-Nearest Neighbor(namely KNN)as the prediction framework because the KNN can efficiently identify the top-k most similar spatio-temporal contexts and make predictions based on these contexts without time-consuming model retraining online.The experimental results show that PURP retains strong prediction efficiency as the prediction period increases.
基金This work was supported by the Scientific Research Fund of Hunan Provincial Education Department of China(Project No.17A007)the Teaching Reform and Research Project of Hunan Province of China(Project No.JG1615).
文摘The two-stream convolutional neural network exhibits excellent performance in the video action recognition.The crux of the matter is to use the frames already clipped by the videos and the optical flow images pre-extracted by the frames,to train a model each,and to finally integrate the outputs of the two models.Nevertheless,the reliance on the pre-extraction of the optical flow impedes the efficiency of action recognition,and the temporal and the spatial streams are just simply fused at the ends,with one stream failing and the other stream succeeding.We propose a novel hidden two-stream collaborative(HTSC)learning network that masks the steps of extracting the optical flow in the network and greatly speeds up the action recognition.Based on the two-stream method,the two-stream collaborative learning model captures the interaction of the temporal and spatial features to greatly enhance the accuracy of recognition.Our proposed method is highly capable of achieving the balance of efficiency and precision on large-scale video action recognition datasets.
基金supported by the National Key R&D Program of China(2017YFB0802900).
文摘How to correctly acquire the appropriate features is a primary problem in network protocol recognition field.Aiming to avoid the trouble of artificially extracting features in traditional methods and improve recognition accuracy,a network protocol recognition method based on Convolutional Neural Network(CNN)is proposed.The method utilizes deep learning technique,and it processes network flows automatically.Firstly,normalization is performed on the intercepted network flows and they are mapped into two-dimensional matrix which will be used as the input of CNN.Then,an improved classification model named Ptr CNN is built,which can automatically extract the appropriate features of network protocols.Finally,the classification model is trained to recognize the network protocols.The proposed approach is compared with several machine learning methods.Experimental results show that the tailored CNN can not only improve protocol recognition accuracy but also ensure the fast convergence of classification model and reduce the classification time.
文摘Over the past decade, automatic traffic accident recognition has become a prominent objective in the area of machine vision and pattern recognition because of its immense application potential in developing autonomous Intelligent Transportation Systems (ITS). In this paper, we present a new framework toward a real-time automated recognition of traffic accident based on the Histogram of Flow Gradient (HFG) and statistical logistic regression analysis. First, optical flow is estimated and the HFG is constructed from video shots. Then vehicle patterns are clustered based on the HFG-features. By using logistic regression analysis to fit data to logistic curves, the classifier model is generated. Finally, the trajectory of the vehicle by which the accident was occasioned, is determined and recorded. The experimental results on real video sequences demonstrate the efficiency and the applicability of the framework and show it is of higher robustness and can comfortably provide latency guarantees to real-time surveillance and traffic monitoring applications.