Although sentiment analysis is pivotal to understanding user preferences,existing models face significant challenges in handling context-dependent sentiments,sarcasm,and nuanced emotions.This study addresses these cha...Although sentiment analysis is pivotal to understanding user preferences,existing models face significant challenges in handling context-dependent sentiments,sarcasm,and nuanced emotions.This study addresses these challenges by integrating ontology-based methods with deep learning models,thereby enhancing sentiment analysis accuracy in complex domains such as film reviews and restaurant feedback.The framework comprises explicit topic recognition,followed by implicit topic identification to mitigate topic interference in subsequent sentiment analysis.In the context of sentiment analysis,we develop an expanded sentiment lexicon based on domainspecific corpora by leveraging techniques such as word-frequency analysis and word embedding.Furthermore,we introduce a sentiment recognition method based on both ontology-derived sentiment features and sentiment lexicons.We evaluate the performance of our system using a dataset of 10,500 restaurant reviews,focusing on sentiment classification accuracy.The incorporation of specialized lexicons and ontology structures enables the framework to discern subtle sentiment variations and context-specific expressions,thereby improving the overall sentiment-analysis performance.Experimental results demonstrate that the integration of ontology-based methods and deep learning models significantly improves sentiment analysis accuracy.展开更多
In recent years,more and more directors of culture and tourism have taken part in the promotion of local cultural tourism by cross-dressing,talent shows,and pushing their limits on self-media platforms.This study inve...In recent years,more and more directors of culture and tourism have taken part in the promotion of local cultural tourism by cross-dressing,talent shows,and pushing their limits on self-media platforms.This study investigates short videos of Lingnan culture promoted by directors general and deputy directors general of the Culture,Radio,Television,Tourism,and Sports Bureau of counties and cities in Guangdong Province on social media by the method of multimodal critical discourse analysis.The analysis of 33 videos shows that Lingnan culture is a domineering and confident culture,historical culture,graceful and elegant culture,and vibrant and active culture.Domineering and confident culture is embedded in the utterances and behaviors of the directors general or deputy directors general in the video.Historical culture is realized through the conversation with historical figures through time travel.Graceful and elegant culture is constructed in the depiction of sceneries and the depiction of characters’manners.Vibrant and active culture is represented in the depiction of the characters’actional process and analytical process.展开更多
In today’s information age,video data,as an important carrier of information,is growing explosively in terms of production volume.The quick and accurate extraction of useful information from massive video data has be...In today’s information age,video data,as an important carrier of information,is growing explosively in terms of production volume.The quick and accurate extraction of useful information from massive video data has become a focus of research in the field of computer vision.AI dynamic recognition technology has become one of the key technologies to address this issue due to its powerful data processing capabilities and intelligent recognition functions.Based on this,this paper first elaborates on the development of intelligent video AI dynamic recognition technology,then proposes several optimization strategies for intelligent video AI dynamic recognition technology,and finally analyzes the performance of intelligent video AI dynamic recognition technology for reference.展开更多
The Chinese microblog text is short,full of noise data and emoticons,and the words are often irregularly.For these characteristics,we proposed a fine-grained emotion analysis method.Combined with TF-IDF and variance s...The Chinese microblog text is short,full of noise data and emoticons,and the words are often irregularly.For these characteristics,we proposed a fine-grained emotion analysis method.Combined with TF-IDF and variance statistics,we realized a method of calculating multi-class feature selection.We judge the text whether it is positive or negative first,then choose the fine-grained emotional tendency.And we get good result with the test using COAE data set.Compared with other method for feature selection and other emotional library,we did better.展开更多
Several initiatives have been launched to help prevention of traffic accidents and near-accidents across the European Union. To aid the overall goal of reducing deaths and injuries related to traffic, one must underst...Several initiatives have been launched to help prevention of traffic accidents and near-accidents across the European Union. To aid the overall goal of reducing deaths and injuries related to traffic, one must understand the causation of the traffic accidents in order to prevent them. Rather than deploying a person to physically monitor a location, the task is eased by camera equipment installed in existing infrastructure, e.g. poles, and buildings, etc. In rural areas there is however a very limited infrastructure available which complicates the data acquisition. But even if there is infrastructure available in either the rural area or the urban area, this might not serve as an ideal position to capture video data from. In this work, we survey and provide an overview of available and relevant portable poles setups with respect to capturing data in both urban areas and rural areas. The conclusion of the survey shows a lack of a mobile, lightweight, compact, and easy deployable portable pole. We therefore design and develop a new portable pole meeting these requirements. The new proposed portable pole can be deployed by 2 persons in 2 hours in both rural areas as well as urban areas due to its compactness. The deployment and usage of the new portable pole is a complimentary tool, which may improve the camera capturing angle in case existing infrastructure is insufficient. This ultimately improves the traffic monitoring opportunities. Further, the survey of selected portable poles provides an excellent overview and can aid multiple applications within road traffic.展开更多
Objective:Saccades accompanied by normal gain in video head impulse tests(vHIT)are often observed in patients with vestibular migraine(VM).However,they are not considered as an independent indicator,reducing their uti...Objective:Saccades accompanied by normal gain in video head impulse tests(vHIT)are often observed in patients with vestibular migraine(VM).However,they are not considered as an independent indicator,reducing their utility in diagnosing VM.To better understand clinical features of VM,it is necessary to understand raw saccades data.Methods:Fourteen patients with confirmed VM,45 patients with probable VM(p-VM)and 14 agematched healthy volunteers were included in this study.Clinical findings related to spontaneous nystagmus(SN),positional nystagmus(PN),head-shaking nystagmus(HSN),caloric test and vHIT were recorded.Raw saccades data were exported and numbered by their sequences,and their features analyzed.Results:VM patients showed no SN,PN or HSN,and less than half of them showed unilateral weakness(UW)on caloric test.The first saccades from lateral semicircular canal stimulation were the most predominant for both left and right sides.Neither velocity nor time parameters were significantly different when compared between the two sides.Most VM patients(86%)exhibited small saccades,around 35%of the head peak velocity,with a latency of 200e400 ms.Characteristics of saccades were similar in patients with p-VM.Only four normal subjects showed saccades,all unilateral and seemingly random.Conclusions:Small saccades involving bilateral semicircular canals with a scattered distribution pattern are common in patients with VM and p-VM.展开更多
Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed video...Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed videos is proposed to analyze the video events according to six low-level features. After the detailed analysis of video events, the pattern of global motion and five features in foreground—the principal parts of videos, are employed as the observations of the Hidden Markov Model to classify events in videos. The applications of the proposed framework in some video event detections demonstrate the promising success of the proposed framework on semantic video analysis.展开更多
Foreground detection methods can be applied to efficiently distinguish foreground objects including moving or static objects from back- ground which is very important in the application of video analysis, especially v...Foreground detection methods can be applied to efficiently distinguish foreground objects including moving or static objects from back- ground which is very important in the application of video analysis, especially video surveillance. An excellent background model can obtain a good foreground detection results. A lot of background modeling methods had been proposed, but few comprehensive evaluations of them are available. These methods suffer from various challenges such as illumination changes and dynamic background. This paper first analyzed advantages and disadvantages of various background modeling methods in video analysis applications and then compared their performance in terms of quality and the computational cost. The Change detection.Net (CDnet2014) dataset and another video dataset with different envi- ronmental conditions (indoor, outdoor, snow) were used to test each method. The experimental results sufficiently demonstrated the strengths and drawbacks of traditional and recently proposed state-of-the-art background modeling methods. This work is helpful for both researchers and engineering practitioners. Codes of background modeling methods evaluated in this paper are available atwww.yongxu.org/lunwen.html.展开更多
With the rapid development of science and technology,video has become a very important way to demonstrate the city itself.This paper,based on Fairclough's three-dimensional conception of discourse to the video of ...With the rapid development of science and technology,video has become a very important way to demonstrate the city itself.This paper,based on Fairclough's three-dimensional conception of discourse to the video of Xitang,analyzes the on-screen titles of this video to reveal the ideology behind it.Through the analysis,it points out that this video,while protecting Xitang's own unique traditional characteristics,it tries to cater for the needs not only Chinese people but also foreigners.To some extent,it tries to promote Xitang itself.展开更多
The main research motive is to analysis and to veiny the inherent nonlinear character of MPEG-4 video. The power spectral density estimation of the video trafiic describes its 1/f^β and periodic characteristics.The p...The main research motive is to analysis and to veiny the inherent nonlinear character of MPEG-4 video. The power spectral density estimation of the video trafiic describes its 1/f^β and periodic characteristics.The priraeipal compohems analysis of the reconstructed space dimension shows only several principal components can be the representation of all dimensions. The correlation dimension analysis proves its fractal characteristic. To accurately compute the largest Lyapunov exponent, the video traffic is divided into many parts.So the largest Lyapunov exponent spectrum is separately calculated using the small data sets method. The largest Lyapunov exponent spectrum shows there exists abundant nonlinear chaos in MPEG-4 video traffic. The conclusion can be made that MPEG-4 video traffic have complex nonlinear be havior and can be characterized by its power spectral density,principal components, correlation dimension and the largest Lyapunov exponent besides its common statistics.展开更多
During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, l...During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today's approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called "pre-suspicion" mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data.展开更多
:Strabismus is a medical condition that is defined as the lack of coordination between the eyes.When Strabismus is detected at an early age,the chances of curing it are higher.The methods used to detect strabismus and...:Strabismus is a medical condition that is defined as the lack of coordination between the eyes.When Strabismus is detected at an early age,the chances of curing it are higher.The methods used to detect strabismus and measure its degree of deviation are complex and time-consuming,and they always require the presence of a physician.In this paper,we present a method of detecting strabismus and measuring its degree of deviation using videos of the patient’s eye region under a cover test.Our method involves extracting features from a set of training videos(training corpora)and using them to build a classifier.A decision tree(ID3)is built using labeled cases from actual strabismus diagnosis.Patterns are extracted from the corresponding videos of patients,and an association between the extracted features and actual diagnoses is established.Matching Rules from the correlation plot are used to predict diagnoses for future patients.The classifier was tested using a set of testing videos(testing corpora).The results showed 95.9%accuracy,4.1%were light cases and could not be detected correctly from the videos,half of them were false positive and the other half was false negative.展开更多
The increasing use of digital video everyday in a multitude of electronic devices, including mobile phones, tablets and laptops, poses the need for quick development of cross-platform video software. However current a...The increasing use of digital video everyday in a multitude of electronic devices, including mobile phones, tablets and laptops, poses the need for quick development of cross-platform video software. However current approaches to this direction usually require a long learning curve, and their development lacks standardization. This results in software components that are difficult to reuse, and hard to maintain or extend. In order to overcome such issues, we propose a novel object-oriented framework for efficient development of software systems for video analysis. It consists of a set of four abstract components, suitable for the implementation of independent plug-in modules for video acquisition, preprocessing, analysis and output handling. The extensibility of each module can be facilitated by sub-modules specifying additional functionalities. This architecture enables quick responses to changes and re-configurability;thus conforming to the requirements of agile software development practices. Considering the need for platform independency, the proposed Java Video Analysis (JVA) framework is implemented in Java. It is publicly available through the web as open-access software, supported by a growing collection of implemented modules. Its efficiency is empirically validated for the development of a representative video analysis system.展开更多
Trials is a specialty of off-road cycling in which the rider has to face obstacle courses without resting feet on the ground. Technique in this sport has a great importance, since it reduces the risk of committing pen...Trials is a specialty of off-road cycling in which the rider has to face obstacle courses without resting feet on the ground. Technique in this sport has a great importance, since it reduces the risk of committing penalties and allows more efficient execution of the gesture. To improve technique, the motion analysis allows to study the gesture both qualitatively and quantitatively. In this work video analysis was used to study the side hop from rear wheel technique. Two different executions of this technique were analyzed. The primary purpose is the identification of the phases that make up the technical gesture. It was given an explanation to the movement strategies adopted in the execution of the jump in the two different situations.展开更多
From WeChat,QQ to Weibo,Tik Tok this series of online social networking applications,especially with the increasingly maturity of online shopping in recent years,short video and mobile intelligence have been developin...From WeChat,QQ to Weibo,Tik Tok this series of online social networking applications,especially with the increasingly maturity of online shopping in recent years,short video and mobile intelligence have been developing rapidly,which have strongly influenced many people’s living habits and shopping habits.Their development not only promotes the social and economic development,but also produces a new form of visual communication.This paper discusses the characteristics of short video,analyzes the shift of public visual consumption behind the popularity of short video,and hopes to further build a healthy development path of short video.展开更多
This paper describes an approach to identify epicyclic and tricyclic motion during projectile flight caused by mass asymmetries in spinstabilized projectiles. Flight video was captured following projectile launch of s...This paper describes an approach to identify epicyclic and tricyclic motion during projectile flight caused by mass asymmetries in spinstabilized projectiles. Flight video was captured following projectile launch of several M110A2E1 155 mm artillery projectiles. These videos were then analyzed using the automated flight video analysis method to attain their initial position and orientation histories.Examination of the pitch and yaw histories clearly indicates that in addition to epicyclic motion's nutation and precession oscillations, an even faster wobble amplitude is present during each spin revolution, even though some of the amplitudes of the oscillation are smaller than 0.02 degree.The results are compared to a sequence of shots where little appreciable mass asymmetries were present, and only nutation and precession frequencies are predominantly apparent in the motion history results. Magnitudes of the wobble motion are estimated and compared to product of inertia measurements of the asymmetric projectiles.展开更多
Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimo...Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods.展开更多
A comparison between deep learning and standalone models in predicting the compaction parameters of soil is presented in this research.One hundred and ninety and fifty-three soil samples were randomly picked up from t...A comparison between deep learning and standalone models in predicting the compaction parameters of soil is presented in this research.One hundred and ninety and fifty-three soil samples were randomly picked up from two hundred and forty-three soil samples to create training and validation datasets,respectively.The performance and accuracy of the models were measured by root mean square error(RMSE),coefficient of determination(R2),Pearson product-moment correlation coefficient(r),mean absolute error(MAE),variance accounted for(VAF),mean absolute percentage error(MAPE),weighted mean absolute percentage error(WMAPE),a20-index,index of scatter(IOS),and index of agreement(IOA).Comparisons between standalone models demonstrate that the model MD 29 in Gaussian process regression(GPR)and model MD 101 in support vector machine(SVM)can achieve over 96%of accuracy in predicting the optimum moisture content(OMC)and maximum dry density(MDD)of soil,and outperformed other standalone models.The comparison between deep learning models shows that the models MD 46 and MD 146 in long short-term memory(LSTM)predict OMC and MDD with higher accuracy than ANN models.However,the LSTM models outperformed the GPR models in predicting the compaction parameters.The sensitivity analysis illustrates that fine content(FC),specific gravity(SG),and liquid limit(LL)highly influence the prediction of compaction parameters.展开更多
The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video ana...The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video analysis techniques have significantly impacted today’s research,and numerous applications have been developed in this domain.This research proposed an anomaly detection technique applied to Umrah videos in Kaaba during the COVID-19 pandemic through sparse crowd analysis.Managing theKaaba rituals is crucial since the crowd gathers from around the world and requires proper analysis during these days of the pandemic.The Umrah videos are analyzed,and a system is devised that can track and monitor the crowd flow in Kaaba.The crowd in these videos is sparse due to the pandemic,and we have developed a technique to track the maximum crowd flow and detect any object(person)moving in the direction unlikely of the major flow.We have detected abnormal movement by creating the histograms for the vertical and horizontal flows and applying thresholds to identify the non-majority flow.Our algorithm aims to analyze the crowd through video surveillance and timely detect any abnormal activity tomaintain a smooth crowd flowinKaaba during the pandemic.展开更多
基金supported by the BK21 FOUR Program of the National Research Foundation of Korea funded by the Ministry of Education(NRF5199991014091)Seok-Won Lee’s work was supported by Institute of Information&Communications Technology Planning&Evaluation(IITP)under the Artificial Intelligence Convergence Innovation Human Resources Development(IITP-2024-RS-2023-00255968)grant funded by the Korea government(MSIT).
文摘Although sentiment analysis is pivotal to understanding user preferences,existing models face significant challenges in handling context-dependent sentiments,sarcasm,and nuanced emotions.This study addresses these challenges by integrating ontology-based methods with deep learning models,thereby enhancing sentiment analysis accuracy in complex domains such as film reviews and restaurant feedback.The framework comprises explicit topic recognition,followed by implicit topic identification to mitigate topic interference in subsequent sentiment analysis.In the context of sentiment analysis,we develop an expanded sentiment lexicon based on domainspecific corpora by leveraging techniques such as word-frequency analysis and word embedding.Furthermore,we introduce a sentiment recognition method based on both ontology-derived sentiment features and sentiment lexicons.We evaluate the performance of our system using a dataset of 10,500 restaurant reviews,focusing on sentiment classification accuracy.The incorporation of specialized lexicons and ontology structures enables the framework to discern subtle sentiment variations and context-specific expressions,thereby improving the overall sentiment-analysis performance.Experimental results demonstrate that the integration of ontology-based methods and deep learning models significantly improves sentiment analysis accuracy.
基金Guangzhou Municipality’s Philosophy and Social Sciences Development“14th Five-Year Plan”2021 Annual Young Scholars Research Project(2021GZQN15)。
文摘In recent years,more and more directors of culture and tourism have taken part in the promotion of local cultural tourism by cross-dressing,talent shows,and pushing their limits on self-media platforms.This study investigates short videos of Lingnan culture promoted by directors general and deputy directors general of the Culture,Radio,Television,Tourism,and Sports Bureau of counties and cities in Guangdong Province on social media by the method of multimodal critical discourse analysis.The analysis of 33 videos shows that Lingnan culture is a domineering and confident culture,historical culture,graceful and elegant culture,and vibrant and active culture.Domineering and confident culture is embedded in the utterances and behaviors of the directors general or deputy directors general in the video.Historical culture is realized through the conversation with historical figures through time travel.Graceful and elegant culture is constructed in the depiction of sceneries and the depiction of characters’manners.Vibrant and active culture is represented in the depiction of the characters’actional process and analytical process.
文摘In today’s information age,video data,as an important carrier of information,is growing explosively in terms of production volume.The quick and accurate extraction of useful information from massive video data has become a focus of research in the field of computer vision.AI dynamic recognition technology has become one of the key technologies to address this issue due to its powerful data processing capabilities and intelligent recognition functions.Based on this,this paper first elaborates on the development of intelligent video AI dynamic recognition technology,then proposes several optimization strategies for intelligent video AI dynamic recognition technology,and finally analyzes the performance of intelligent video AI dynamic recognition technology for reference.
文摘The Chinese microblog text is short,full of noise data and emoticons,and the words are often irregularly.For these characteristics,we proposed a fine-grained emotion analysis method.Combined with TF-IDF and variance statistics,we realized a method of calculating multi-class feature selection.We judge the text whether it is positive or negative first,then choose the fine-grained emotional tendency.And we get good result with the test using COAE data set.Compared with other method for feature selection and other emotional library,we did better.
文摘Several initiatives have been launched to help prevention of traffic accidents and near-accidents across the European Union. To aid the overall goal of reducing deaths and injuries related to traffic, one must understand the causation of the traffic accidents in order to prevent them. Rather than deploying a person to physically monitor a location, the task is eased by camera equipment installed in existing infrastructure, e.g. poles, and buildings, etc. In rural areas there is however a very limited infrastructure available which complicates the data acquisition. But even if there is infrastructure available in either the rural area or the urban area, this might not serve as an ideal position to capture video data from. In this work, we survey and provide an overview of available and relevant portable poles setups with respect to capturing data in both urban areas and rural areas. The conclusion of the survey shows a lack of a mobile, lightweight, compact, and easy deployable portable pole. We therefore design and develop a new portable pole meeting these requirements. The new proposed portable pole can be deployed by 2 persons in 2 hours in both rural areas as well as urban areas due to its compactness. The deployment and usage of the new portable pole is a complimentary tool, which may improve the camera capturing angle in case existing infrastructure is insufficient. This ultimately improves the traffic monitoring opportunities. Further, the survey of selected portable poles provides an excellent overview and can aid multiple applications within road traffic.
文摘Objective:Saccades accompanied by normal gain in video head impulse tests(vHIT)are often observed in patients with vestibular migraine(VM).However,they are not considered as an independent indicator,reducing their utility in diagnosing VM.To better understand clinical features of VM,it is necessary to understand raw saccades data.Methods:Fourteen patients with confirmed VM,45 patients with probable VM(p-VM)and 14 agematched healthy volunteers were included in this study.Clinical findings related to spontaneous nystagmus(SN),positional nystagmus(PN),head-shaking nystagmus(HSN),caloric test and vHIT were recorded.Raw saccades data were exported and numbered by their sequences,and their features analyzed.Results:VM patients showed no SN,PN or HSN,and less than half of them showed unilateral weakness(UW)on caloric test.The first saccades from lateral semicircular canal stimulation were the most predominant for both left and right sides.Neither velocity nor time parameters were significantly different when compared between the two sides.Most VM patients(86%)exhibited small saccades,around 35%of the head peak velocity,with a latency of 200e400 ms.Characteristics of saccades were similar in patients with p-VM.Only four normal subjects showed saccades,all unilateral and seemingly random.Conclusions:Small saccades involving bilateral semicircular canals with a scattered distribution pattern are common in patients with VM and p-VM.
基金Supported in part by the National Natural Science Foundation of China (No. 60572045)the Ministry of Education of China Ph.D. Program Foundation (No.20050698033)Cooperation Project (2005.7-2007.6) with Microsoft Research Asia.
文摘Semantic video analysis plays an important role in the field of machine intelligence and pattern recognition. In this paper, based on the Hidden Markov Model (HMM), a semantic recognition framework on compressed videos is proposed to analyze the video events according to six low-level features. After the detailed analysis of video events, the pattern of global motion and five features in foreground—the principal parts of videos, are employed as the observations of the Hidden Markov Model to classify events in videos. The applications of the proposed framework in some video event detections demonstrate the promising success of the proposed framework on semantic video analysis.
文摘Foreground detection methods can be applied to efficiently distinguish foreground objects including moving or static objects from back- ground which is very important in the application of video analysis, especially video surveillance. An excellent background model can obtain a good foreground detection results. A lot of background modeling methods had been proposed, but few comprehensive evaluations of them are available. These methods suffer from various challenges such as illumination changes and dynamic background. This paper first analyzed advantages and disadvantages of various background modeling methods in video analysis applications and then compared their performance in terms of quality and the computational cost. The Change detection.Net (CDnet2014) dataset and another video dataset with different envi- ronmental conditions (indoor, outdoor, snow) were used to test each method. The experimental results sufficiently demonstrated the strengths and drawbacks of traditional and recently proposed state-of-the-art background modeling methods. This work is helpful for both researchers and engineering practitioners. Codes of background modeling methods evaluated in this paper are available atwww.yongxu.org/lunwen.html.
文摘With the rapid development of science and technology,video has become a very important way to demonstrate the city itself.This paper,based on Fairclough's three-dimensional conception of discourse to the video of Xitang,analyzes the on-screen titles of this video to reveal the ideology behind it.Through the analysis,it points out that this video,while protecting Xitang's own unique traditional characteristics,it tries to cater for the needs not only Chinese people but also foreigners.To some extent,it tries to promote Xitang itself.
基金Supported by the National Natural Science Founda-tion of China (60132030)
文摘The main research motive is to analysis and to veiny the inherent nonlinear character of MPEG-4 video. The power spectral density estimation of the video trafiic describes its 1/f^β and periodic characteristics.The priraeipal compohems analysis of the reconstructed space dimension shows only several principal components can be the representation of all dimensions. The correlation dimension analysis proves its fractal characteristic. To accurately compute the largest Lyapunov exponent, the video traffic is divided into many parts.So the largest Lyapunov exponent spectrum is separately calculated using the small data sets method. The largest Lyapunov exponent spectrum shows there exists abundant nonlinear chaos in MPEG-4 video traffic. The conclusion can be made that MPEG-4 video traffic have complex nonlinear be havior and can be characterized by its power spectral density,principal components, correlation dimension and the largest Lyapunov exponent besides its common statistics.
文摘During the past decade, feature extraction and knowledge acquisition based on video analysis have been extensively researched and tested on many applications such as closed-circuit television (CCTV) data analysis, large-scale public event control, and other daily security monitoring and surveillance operations with various degrees of success. However, since the actual video process is a multi-phased one and encompasses extensive theories and techniques ranging from fundamental image processing, computational geometry and graphics, and machine vision, to advanced artificial intelligence, pattern analysis, and even cognitive science, there are still many important problems to resolve before it can be widely applied. Among them, video event identification and detection are two prominent ones. Comparing with the most popular frame-to-frame processing mode of most of today's approaches and systems, this project reorganizes video data as a 3D volume structure that provides the hybrid spatial and temporal information in a unified space. This paper reports an innovative technique to transform original video frames to 3D volume structures denoted by spatial and temporal features. It then highlights the volume array structure in a so-called "pre-suspicion" mechanism for a later process. The focus of this report is the development of an effective and efficient voxel-based segmentation technique suitable to the volumetric nature of video events and ready for deployment in 3D clustering operations. The paper is concluded with a performance evaluation of the devised technique and discussion on the future work for accelerating the pre-processing of the original video data.
基金funded by the Deanship of Scientific Research at Princess Nourah bint Abdulrahman University,through the Research Funding Program(Grand No.FRP-1440-32).
文摘:Strabismus is a medical condition that is defined as the lack of coordination between the eyes.When Strabismus is detected at an early age,the chances of curing it are higher.The methods used to detect strabismus and measure its degree of deviation are complex and time-consuming,and they always require the presence of a physician.In this paper,we present a method of detecting strabismus and measuring its degree of deviation using videos of the patient’s eye region under a cover test.Our method involves extracting features from a set of training videos(training corpora)and using them to build a classifier.A decision tree(ID3)is built using labeled cases from actual strabismus diagnosis.Patterns are extracted from the corresponding videos of patients,and an association between the extracted features and actual diagnoses is established.Matching Rules from the correlation plot are used to predict diagnoses for future patients.The classifier was tested using a set of testing videos(testing corpora).The results showed 95.9%accuracy,4.1%were light cases and could not be detected correctly from the videos,half of them were false positive and the other half was false negative.
文摘The increasing use of digital video everyday in a multitude of electronic devices, including mobile phones, tablets and laptops, poses the need for quick development of cross-platform video software. However current approaches to this direction usually require a long learning curve, and their development lacks standardization. This results in software components that are difficult to reuse, and hard to maintain or extend. In order to overcome such issues, we propose a novel object-oriented framework for efficient development of software systems for video analysis. It consists of a set of four abstract components, suitable for the implementation of independent plug-in modules for video acquisition, preprocessing, analysis and output handling. The extensibility of each module can be facilitated by sub-modules specifying additional functionalities. This architecture enables quick responses to changes and re-configurability;thus conforming to the requirements of agile software development practices. Considering the need for platform independency, the proposed Java Video Analysis (JVA) framework is implemented in Java. It is publicly available through the web as open-access software, supported by a growing collection of implemented modules. Its efficiency is empirically validated for the development of a representative video analysis system.
文摘Trials is a specialty of off-road cycling in which the rider has to face obstacle courses without resting feet on the ground. Technique in this sport has a great importance, since it reduces the risk of committing penalties and allows more efficient execution of the gesture. To improve technique, the motion analysis allows to study the gesture both qualitatively and quantitatively. In this work video analysis was used to study the side hop from rear wheel technique. Two different executions of this technique were analyzed. The primary purpose is the identification of the phases that make up the technical gesture. It was given an explanation to the movement strategies adopted in the execution of the jump in the two different situations.
基金Academic research project of Huashang College of Guangdong University of Finance and Economics(No.2018HSXS15).
文摘From WeChat,QQ to Weibo,Tik Tok this series of online social networking applications,especially with the increasingly maturity of online shopping in recent years,short video and mobile intelligence have been developing rapidly,which have strongly influenced many people’s living habits and shopping habits.Their development not only promotes the social and economic development,but also produces a new form of visual communication.This paper discusses the characteristics of short video,analyzes the shift of public visual consumption behind the popularity of short video,and hopes to further build a healthy development path of short video.
文摘This paper describes an approach to identify epicyclic and tricyclic motion during projectile flight caused by mass asymmetries in spinstabilized projectiles. Flight video was captured following projectile launch of several M110A2E1 155 mm artillery projectiles. These videos were then analyzed using the automated flight video analysis method to attain their initial position and orientation histories.Examination of the pitch and yaw histories clearly indicates that in addition to epicyclic motion's nutation and precession oscillations, an even faster wobble amplitude is present during each spin revolution, even though some of the amplitudes of the oscillation are smaller than 0.02 degree.The results are compared to a sequence of shots where little appreciable mass asymmetries were present, and only nutation and precession frequencies are predominantly apparent in the motion history results. Magnitudes of the wobble motion are estimated and compared to product of inertia measurements of the asymmetric projectiles.
基金supported by the Science and Technology Project of Henan Province(No.222102210081).
文摘Joint Multimodal Aspect-based Sentiment Analysis(JMASA)is a significant task in the research of multimodal fine-grained sentiment analysis,which combines two subtasks:Multimodal Aspect Term Extraction(MATE)and Multimodal Aspect-oriented Sentiment Classification(MASC).Currently,most existing models for JMASA only perform text and image feature encoding from a basic level,but often neglect the in-depth analysis of unimodal intrinsic features,which may lead to the low accuracy of aspect term extraction and the poor ability of sentiment prediction due to the insufficient learning of intra-modal features.Given this problem,we propose a Text-Image Feature Fine-grained Learning(TIFFL)model for JMASA.First,we construct an enhanced adjacency matrix of word dependencies and adopt graph convolutional network to learn the syntactic structure features for text,which addresses the context interference problem of identifying different aspect terms.Then,the adjective-noun pairs extracted from image are introduced to enable the semantic representation of visual features more intuitive,which addresses the ambiguous semantic extraction problem during image feature learning.Thereby,the model performance of aspect term extraction and sentiment polarity prediction can be further optimized and enhanced.Experiments on two Twitter benchmark datasets demonstrate that TIFFL achieves competitive results for JMASA,MATE and MASC,thus validating the effectiveness of our proposed methods.
文摘A comparison between deep learning and standalone models in predicting the compaction parameters of soil is presented in this research.One hundred and ninety and fifty-three soil samples were randomly picked up from two hundred and forty-three soil samples to create training and validation datasets,respectively.The performance and accuracy of the models were measured by root mean square error(RMSE),coefficient of determination(R2),Pearson product-moment correlation coefficient(r),mean absolute error(MAE),variance accounted for(VAF),mean absolute percentage error(MAPE),weighted mean absolute percentage error(WMAPE),a20-index,index of scatter(IOS),and index of agreement(IOA).Comparisons between standalone models demonstrate that the model MD 29 in Gaussian process regression(GPR)and model MD 101 in support vector machine(SVM)can achieve over 96%of accuracy in predicting the optimum moisture content(OMC)and maximum dry density(MDD)of soil,and outperformed other standalone models.The comparison between deep learning models shows that the models MD 46 and MD 146 in long short-term memory(LSTM)predict OMC and MDD with higher accuracy than ANN models.However,the LSTM models outperformed the GPR models in predicting the compaction parameters.The sensitivity analysis illustrates that fine content(FC),specific gravity(SG),and liquid limit(LL)highly influence the prediction of compaction parameters.
基金The authors extend their appreciation to the Deputyship for Research and Innovation,Ministry of Education in Saudi Arabia for funding this research work through the Project Number QURDO001Project title:Intelligent Real-Time Crowd Monitoring System Using Unmanned Aerial Vehicle(UAV)Video and Global Positioning Systems(GPS)Data。
文摘The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video analysis techniques have significantly impacted today’s research,and numerous applications have been developed in this domain.This research proposed an anomaly detection technique applied to Umrah videos in Kaaba during the COVID-19 pandemic through sparse crowd analysis.Managing theKaaba rituals is crucial since the crowd gathers from around the world and requires proper analysis during these days of the pandemic.The Umrah videos are analyzed,and a system is devised that can track and monitor the crowd flow in Kaaba.The crowd in these videos is sparse due to the pandemic,and we have developed a technique to track the maximum crowd flow and detect any object(person)moving in the direction unlikely of the major flow.We have detected abnormal movement by creating the histograms for the vertical and horizontal flows and applying thresholds to identify the non-majority flow.Our algorithm aims to analyze the crowd through video surveillance and timely detect any abnormal activity tomaintain a smooth crowd flowinKaaba during the pandemic.