期刊文献+
共找到74,367篇文章
< 1 2 250 >
每页显示 20 50 100
Trends in Event Understanding and Caption Generation/Reconstruction in Dense Video:A Review
1
作者 Ekanayake Mudiyanselage Chulabhaya Lankanatha Ekanayake Abubakar Sulaiman Gezawa Yunqi Lei 《Computers, Materials & Continua》 SCIE EI 2024年第3期2941-2965,共25页
Video description generates natural language sentences that describe the subject,verb,and objects of the targeted Video.The video description has been used to help visually impaired people to understand the content.It... Video description generates natural language sentences that describe the subject,verb,and objects of the targeted Video.The video description has been used to help visually impaired people to understand the content.It is also playing an essential role in devolving human-robot interaction.The dense video description is more difficult when compared with simple Video captioning because of the object’s interactions and event overlapping.Deep learning is changing the shape of computer vision(CV)technologies and natural language processing(NLP).There are hundreds of deep learning models,datasets,and evaluations that can improve the gaps in current research.This article filled this gap by evaluating some state-of-the-art approaches,especially focusing on deep learning and machine learning for video caption in a dense environment.In this article,some classic techniques concerning the existing machine learning were reviewed.And provides deep learning models,a detail of benchmark datasets with their respective domains.This paper reviews various evaluation metrics,including Bilingual EvaluationUnderstudy(BLEU),Metric for Evaluation of Translation with Explicit Ordering(METEOR),WordMover’s Distance(WMD),and Recall-Oriented Understudy for Gisting Evaluation(ROUGE)with their pros and cons.Finally,this article listed some future directions and proposed work for context enhancement using key scene extraction with object detection in a particular frame.Especially,how to improve the context of video description by analyzing key frames detection through morphological image analysis.Additionally,the paper discusses a novel approach involving sentence reconstruction and context improvement through key frame object detection,which incorporates the fusion of large languagemodels for refining results.The ultimate results arise fromenhancing the generated text of the proposedmodel by improving the predicted text and isolating objects using various keyframes.These keyframes identify dense events occurring in the video sequence. 展开更多
关键词 video description video to text video caption sentence reconstruction
下载PDF
Stepwise Ecological Restoration:A framework for improving restoration outcomes
2
作者 Junguo Liu Yuehan Dou He Chen 《Geography and Sustainability》 CSCD 2024年第2期160-166,共7页
Ecosystem degradation is one of the critical constraints for the sustainable development of our planet.However,recovering an ecosystem to a pre-impairment condition is often not practical.The International Restoration... Ecosystem degradation is one of the critical constraints for the sustainable development of our planet.However,recovering an ecosystem to a pre-impairment condition is often not practical.The International Restoration Standards provide the first framework for practical guidance on what constitutes the process of ecological repair and how this repair process can be influenced to improve net ecological benefits.In these Standards,Restorative Continuum is highlighted and it recognises that many do not,yet there is still value in aspiring to improvements to the highest extent possible,with some sites potentially being able to be improved in a stepwise manner.Here we elaborate on these Standards by providing a cross-ecosystem theoretical framework of Stepwise Ecological Restoration(STERE)for promoting higher environmental benefits.STERE allows the selection of suitable restorative modes by considering the degree of degradation while encouraging a transition to a higher state.These models include environmental remediation for completely modified and degraded ecosystems,ecological rehabilitation for highly modified and degraded ecosystems,and ecological restoration for degraded native ecosystems.STERE requires selecting tailored restorative modes,setting clear restorative targets and reference ecosystems,applying a systematic-thinking approach,and implementing a continuous monitoring program at all process stages to achieve a resilient trajectory.STERE allows adaptive management in the context of climate change,and when the evidence is available,to“adapt to the future”to ensure climate resilience.The STERE framework could assist in initiating and implementing restoration projects worldwide,especially in developing countries. 展开更多
关键词 Ecological restoration Reference ecosystem Restorative modes Sustainable development
下载PDF
Cloud‐based video streaming services:Trends,challenges,and opportunities
3
作者 Tajinder Kumar Purushottam Sharma +5 位作者 Jaswinder Tanwar Hisham Alsghier Shashi Bhushan Hesham Alhumyani Vivek Sharma Ahmed I.Alutaibi 《CAAI Transactions on Intelligence Technology》 SCIE EI 2024年第2期265-285,共21页
Cloud computing has drastically changed the delivery and consumption of live streaming content.The designs,challenges,and possible uses of cloud computing for live streaming are studied.A comprehensive overview of the... Cloud computing has drastically changed the delivery and consumption of live streaming content.The designs,challenges,and possible uses of cloud computing for live streaming are studied.A comprehensive overview of the technical and business issues surrounding cloudbased live streaming is provided,including the benefits of cloud computing,the various live streaming architectures,and the challenges that live streaming service providers face in delivering high‐quality,real‐time services.The different techniques used to improve the performance of video streaming,such as adaptive bit‐rate streaming,multicast distribution,and edge computing are discussed and the necessity of low‐latency and high‐quality video transmission in cloud‐based live streaming is underlined.Issues such as improving user experience and live streaming service performance using cutting‐edge technology,like artificial intelligence and machine learning are discussed.In addition,the legal and regulatory implications of cloud‐based live streaming,including issues with network neutrality,data privacy,and content moderation are addressed.The future of cloud computing for live streaming is examined in the section that follows,and it looks at the most likely new developments in terms of trends and technology.For technology vendors,live streaming service providers,and regulators,the findings have major policy‐relevant implications.Suggestions on how stakeholders should address these concerns and take advantage of the potential presented by this rapidly evolving sector,as well as insights into the key challenges and opportunities associated with cloud‐based live streaming are provided. 展开更多
关键词 cloud computing video analysis video coding
下载PDF
Lactate is a potentially harmful substitute for brain glucose fuel:consequences for metabolic restoration of neurotransmission
4
作者 Oliver Kann Lennart Soder Babak Khodaie 《Neural Regeneration Research》 SCIE CAS 2025年第5期1403-1404,共2页
The metabolite lactate (L-lactate) can be generated and released by diverse brain cells,including neurons,astrocytes,and oligodendrocytes (Kann,2023;Rae et al.,2024).Lactate production usually requires the degradation... The metabolite lactate (L-lactate) can be generated and released by diverse brain cells,including neurons,astrocytes,and oligodendrocytes (Kann,2023;Rae et al.,2024).Lactate production usually requires the degradation of glucose (D-glucose)-and glycogen in astrocytes-to pyruvate by glycolysis and subsequent conversion of pyruvate to lactate by the enzyme lactate dehydrogenase(Figure 1A;Dienel,2019;Rae et al.,2024). 展开更多
关键词 consequences FIGURE restoration
下载PDF
Restoration or Rehabilitation of the Faleme River Affected by Mining Activities: What Methods?
5
作者 Mor Diop Ibrahima Mall +3 位作者 Elhadji Mamadou Sonko Tidiane Diop Birane Niane Cheikh Mbow 《Journal of Water Resource and Protection》 CAS 2024年第4期233-263,共31页
The Faleme River, a West Africa long transboundary stream (625 km) and abundant flow (>1100 million m<sup>3</sup>) is affected by severe erosion because of mining activities that takes place throughout ... The Faleme River, a West Africa long transboundary stream (625 km) and abundant flow (>1100 million m<sup>3</sup>) is affected by severe erosion because of mining activities that takes place throughout the riverbed. To preserve this important watercourse and ensure the sustainability of its services, selecting and implementing appropriates restorations techniques is vital. In this context, the purpose of this paper was to present an overview of the actions and techniques that can be implemented for the restoration/rehabilitation of the Faleme. The methodological approach includes field investigation, water sampling, literature review with cases studies and SWOT analysis of the four methods presented: river dredging, constructed wetlands, floating treatment wetlands and chemical precipitation (coagulation and flocculation). The study confirmed the pollution of the river by suspended solids (TSS > 1100 mg/L) and heavy metals such as iron, zinc, aluminium, and arsenic. For the restoration methods, it was illustrated through description of their mode of operation and through some case studies presented, that all the four methods have proven their effectiveness in treating rivers but have differences in their costs, their sustainability (detrimental to living organisms or causing a second pollution) and social acceptance. They also have weaknesses and issues that must be addressed to ensure success of rehabilitation. For the case of the Faleme river, after analysis, floating treatment wetlands are highly recommended for their low cost, good removal efficiency if the vulnerability of the raft and buoyancy to strong waves and flow is under control. 展开更多
关键词 Faleme River River restoration Constructed Wetlands DREDGING Floating Treatment Wetlands COAGULATION-FLOCCULATION
下载PDF
A HEVC Video Steganalysis Method Using the Optimality of Motion Vector Prediction
6
作者 Jun Li Minqing Zhang +2 位作者 Ke Niu Yingnan Zhang Xiaoyuan Yang 《Computers, Materials & Continua》 SCIE EI 2024年第5期2085-2103,共19页
Among steganalysis techniques,detection against MV(motion vector)domain-based video steganography in the HEVC(High Efficiency Video Coding)standard remains a challenging issue.For the purpose of improving the detectio... Among steganalysis techniques,detection against MV(motion vector)domain-based video steganography in the HEVC(High Efficiency Video Coding)standard remains a challenging issue.For the purpose of improving the detection performance,this paper proposes a steganalysis method that can perfectly detectMV-based steganography in HEVC.Firstly,we define the local optimality of MVP(Motion Vector Prediction)based on the technology of AMVP(Advanced Motion Vector Prediction).Secondly,we analyze that in HEVC video,message embedding either usingMVP index orMVD(Motion Vector Difference)may destroy the above optimality of MVP.And then,we define the optimal rate of MVP as a steganalysis feature.Finally,we conduct steganalysis detection experiments on two general datasets for three popular steganographymethods and compare the performance with four state-ofthe-art steganalysis methods.The experimental results demonstrate the effectiveness of the proposed feature set.Furthermore,our method stands out for its practical applicability,requiring no model training and exhibiting low computational complexity,making it a viable solution for real-world scenarios. 展开更多
关键词 video steganography video steganalysis motion vector prediction motion vector difference advanced motion vector prediction local optimality
下载PDF
Customized Convolutional Neural Network for Accurate Detection of Deep Fake Images in Video Collections
7
作者 Dmitry Gura Bo Dong +1 位作者 Duaa Mehiar Nidal Al Said 《Computers, Materials & Continua》 SCIE EI 2024年第5期1995-2014,共20页
The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method in... The motivation for this study is that the quality of deep fakes is constantly improving,which leads to the need to develop new methods for their detection.The proposed Customized Convolutional Neural Network method involves extracting structured data from video frames using facial landmark detection,which is then used as input to the CNN.The customized Convolutional Neural Network method is the date augmented-based CNN model to generate‘fake data’or‘fake images’.This study was carried out using Python and its libraries.We used 242 films from the dataset gathered by the Deep Fake Detection Challenge,of which 199 were made up and the remaining 53 were real.Ten seconds were allotted for each video.There were 318 videos used in all,199 of which were fake and 119 of which were real.Our proposedmethod achieved a testing accuracy of 91.47%,loss of 0.342,and AUC score of 0.92,outperforming two alternative approaches,CNN and MLP-CNN.Furthermore,our method succeeded in greater accuracy than contemporary models such as XceptionNet,Meso-4,EfficientNet-BO,MesoInception-4,VGG-16,and DST-Net.The novelty of this investigation is the development of a new Convolutional Neural Network(CNN)learning model that can accurately detect deep fake face photos. 展开更多
关键词 Deep fake detection video analysis convolutional neural network machine learning video dataset collection facial landmark prediction accuracy models
下载PDF
Phytoremediation Strategies for Heavy Metal Contamination: A Review on Sustainable Approach for Environmental Restoration
8
作者 Mariam Salifu Matthew Abu John +3 位作者 Murtala Abubakar Ibukunoluwa Abimbola Bankole Nneka Damola Ajayi Olawumi Amusan 《Journal of Environmental Protection》 2024年第4期450-474,共25页
Current globalization trends and important breakthroughs globally need a complete study of heavy metal contamination, its causes, its impacts on human and environmental health, and different remediation strategies. He... Current globalization trends and important breakthroughs globally need a complete study of heavy metal contamination, its causes, its impacts on human and environmental health, and different remediation strategies. Heavy metal pollution is mostly produced by urbanization and industry, which threatens ecosystems and human health. Herein, we discuss a sustainable environmental restoration strategy employing phytoremediation for heavy metal pollution, the carcinogenic, mutagenic, and cytotoxic effects of heavy metals such as cadmium, copper, mercury, selenium, zinc, arsenic, chromium, lead, nickel, and silver, which may be fatal. Phytoremediation, which was prioritized, uses plants to remove, accumulate, and depollute pollutants. This eco-friendly method may safely collect, accumulate, and detoxify toxins using plants, making it popular. This study covers phytostabilization, phytodegradation, rhizodegradation, phytoextraction, phytovolatilization, and rhizofiltration. A phytoremediation process’s efficiency in varied environmental circumstances depends on these components’ complex interplay. This paper also introduces developing phytoremediation approaches including microbe-assisted, chemical-assisted, and organic or bio-char use. These advancements attempt to overcome conventional phytoremediation’s limitations, such as limited suitable plant species, location problems, and sluggish remediation. Current research includes machine learning techniques and computer modeling, biostimulation, genetic engineering, bioaugmentation, and hybrid remediation. These front-line solutions show that phytoremediation research is developing towards transdisciplinary efficiency enhancement. We acknowledge phytoremediation’s promise but also its drawbacks, such as site-specific variables, biomass buildup, and sluggish remediation, as well as ongoing research to address them. In conclusion, heavy metal pollution threatens the ecology and public health and must be reduced. Phytoremediation treats heavy metal pollution in different ways. Over time, phytoremediation systems have developed unique ways that improve efficiency. Despite difficulties like site-specificity, sluggish remediation, and biomass buildup potential, phytoremediation is still a vital tool for environmental sustainability. 展开更多
关键词 PHYTOREMEDIATION HEAVY-METAL CONTAMINATION SUSTAINABILITY restoration
下载PDF
Restoration of the JPEG Maximum Lossy Compressed Face Images with Hourglass Block-GAN
9
作者 Jongwook Si Sungyoung Kim 《Computers, Materials & Continua》 SCIE EI 2024年第3期2893-2908,共16页
In the context of high compression rates applied to Joint Photographic Experts Group(JPEG)images through lossy compression techniques,image-blocking artifacts may manifest.This necessitates the restoration of the imag... In the context of high compression rates applied to Joint Photographic Experts Group(JPEG)images through lossy compression techniques,image-blocking artifacts may manifest.This necessitates the restoration of the image to its original quality.The challenge lies in regenerating significantly compressed images into a state in which these become identifiable.Therefore,this study focuses on the restoration of JPEG images subjected to substantial degradation caused by maximum lossy compression using Generative Adversarial Networks(GAN).The generator in this network is based on theU-Net architecture.It features a newhourglass structure that preserves the characteristics of the deep layers.In addition,the network incorporates two loss functions to generate natural and high-quality images:Low Frequency(LF)loss and High Frequency(HF)loss.HF loss uses a pretrained VGG-16 network and is configured using a specific layer that best represents features.This can enhance the performance in the high-frequency region.In contrast,LF loss is used to handle the low-frequency region.The two loss functions facilitate the generation of images by the generator,which can mislead the discriminator while accurately generating high-and low-frequency regions.Consequently,by removing the blocking effects frommaximum lossy compressed images,images inwhich identities could be recognized are generated.This study represents a significant improvement over previous research in terms of the image resolution performance. 展开更多
关键词 JPEG lossy compression restoration image generation GAN
下载PDF
Characterisation of the Bacteria and Archaea Community Associated with Wild Oysters, At Three Possible Restoration Sites in the North Sea
10
作者 Natacha M. S. Juste-Poinapen Yang Lu +2 位作者 Blanca Bolaños De Hoyos George C. Birch Camille Saurel 《Open Journal of Marine Science》 2024年第2期19-40,共22页
With 85% of the global oyster reefs destroyed, there is an urgent need for large scale restoration to benefit from the ecosystem services provided by biogenic oyster reefs and their associated biodiversity, including ... With 85% of the global oyster reefs destroyed, there is an urgent need for large scale restoration to benefit from the ecosystem services provided by biogenic oyster reefs and their associated biodiversity, including microorganisms that drive marine biogeochemical cycles. This experiment established a baseline for the monitoring of the bacterial and archaeal community associated with wild oysters, using samples from their immediate environment of the Voordelta, with cohabiting Crassostrea gigas and Ostrea edulis, Duikplaats with only C. gigas attached to rocks, and the Dansk Skaldyrcentre, with no onsite oysters. The microbial profiling was carried out through DNA analysis of samples collected from the surfaces of oyster shells and their substrate, the sediment and seawater. Following 16S rRNA amplicon sequencing and bioinformatics, alpha indices implied high species abundance and diversity in sediment but low abundance in seawater. As expected, Proteobacteria, Bacteroidetes, Firmicutes and Thaumarchaeota dominated the top 20 OTUs. In the Voordelta, OTUs related to Colwellia, Shewanella and Psychrobium differentiated the oysters collected from a reef with those attached to rocks. Duikplaats were distinct for sulfur-oxidizers Sulfurimonas and sulfate-reducers from the Sva 0081 sediment group. Archaea were found mainly in sediments and the oyster associated microbiome, with greater abundance at the reef site, consisting mostly of Thaumarchaeota from the family Nitrosopumilaceae. The oyster free site displayed archaea in sediments only, and algal bloom indicator microorganisms from the Rhodobacteraceae, Flavobacteriaceae family and genus [Polaribacter] huanghezhanensis, in addition to the ascidian symbiotic partner, Synechococcus. This study suggests site specific microbiome shifts, influenced by the presence of oysters and the type of substrate. 展开更多
关键词 Oyster Reefs MICROBIOME Marine Bacteria Marine Archaea restoration
下载PDF
Problematic Use of Video Games in Schools in Northern Benin (2023)
11
作者 Ireti Nethania Elie Ataigba David Sinet Koivogui +6 位作者 Damega Wenkourama Marcos Tohou Eurydice Elvire Djossou Anselme Djidonou Francis Tognon Tchegnonsi Prosper Gandaho Josiane Ezin Houngbe 《Open Journal of Psychiatry》 2024年第2期120-141,共22页
Objective: To study the problematic use of video games among secondary school students in the city of Parakou in 2023. Methods: Descriptive cross-sectional study conducted in the commune of Parakou from December 2022 ... Objective: To study the problematic use of video games among secondary school students in the city of Parakou in 2023. Methods: Descriptive cross-sectional study conducted in the commune of Parakou from December 2022 to July 2023. The study population consisted of students regularly enrolled in public and private secondary schools in the city of Parakou for the 2022-2023 academic year. A two-stage non-proportional stratified sampling technique combined with simple random sampling was adopted. The Problem Video Game Playing (PVP) scale was used to assess problem gambling in the study population, while anxiety and depression were assessed using the Hospital Anxiety and Depression Scale (HADS). Results: A total of 1030 students were included. The mean age of the pupils surveyed was 15.06 ± 2.68 years, with extremes of 10 and 28 years. The [13 - 18] age group was the most represented, with a proportion of 59.6% (614) in the general population. Females predominated, at 52.8% (544), with a sex ratio of 0.89. The prevalence of problematic video game use was 24.9%, measured using the Video Game Playing scale. Associated factors were male gender (p = 0.005), pocket money under 10,000 cfa (p = 0.001) and between 20,000 - 90,000 cfa (p = 0.030), addictive family behavior (p < 0.001), monogamous family (p = 0.023), good relationship with father (p = 0.020), organization of video game competitions (p = 0.001) and definite anxiety (p Conclusion: Substance-free addiction is struggling to attract the attention it deserves, as it did in its infancy everywhere else. This study complements existing data and serves as a reminder of the need to focus on this group of addictions, whose problematic use of video games remains the most frequent due to its accessibility and social tolerance. Preventive action combined with curative measures remains the most effective means of combating the problem at national level. 展开更多
关键词 Gaming Problem video Games BENIN 2023
下载PDF
Video Summarization Approach Based on Binary Robust Invariant Scalable Keypoints and Bisecting K-Means
12
作者 Sameh Zarif Eman Morad +3 位作者 Khalid Amin Abdullah Alharbi Wail S.Elkilani Shouze Tang 《Computers, Materials & Continua》 SCIE EI 2024年第3期3565-3583,共19页
Due to the exponential growth of video data,aided by rapid advancements in multimedia technologies.It became difficult for the user to obtain information from a large video series.The process of providing an abstract ... Due to the exponential growth of video data,aided by rapid advancements in multimedia technologies.It became difficult for the user to obtain information from a large video series.The process of providing an abstract of the entire video that includes the most representative frames is known as static video summarization.This method resulted in rapid exploration,indexing,and retrieval of massive video libraries.We propose a framework for static video summary based on a Binary Robust Invariant Scalable Keypoint(BRISK)and bisecting K-means clustering algorithm.The current method effectively recognizes relevant frames using BRISK by extracting keypoints and the descriptors from video sequences.The video frames’BRISK features are clustered using a bisecting K-means,and the keyframe is determined by selecting the frame that is most near the cluster center.Without applying any clustering parameters,the appropriate clusters number is determined using the silhouette coefficient.Experiments were carried out on a publicly available open video project(OVP)dataset that contained videos of different genres.The proposed method’s effectiveness is compared to existing methods using a variety of evaluation metrics,and the proposed method achieves a trade-off between computational cost and quality. 展开更多
关键词 BRISK bisecting K-mean video summarization keyframe extraction shot detection
下载PDF
A Video Captioning Method by Semantic Topic-Guided Generation
13
作者 Ou Ye Xinli Wei +2 位作者 Zhenhua Yu Yan Fu Ying Yang 《Computers, Materials & Continua》 SCIE EI 2024年第1期1071-1093,共23页
In the video captioning methods based on an encoder-decoder,limited visual features are extracted by an encoder,and a natural sentence of the video content is generated using a decoder.However,this kind ofmethod is de... In the video captioning methods based on an encoder-decoder,limited visual features are extracted by an encoder,and a natural sentence of the video content is generated using a decoder.However,this kind ofmethod is dependent on a single video input source and few visual labels,and there is a problem with semantic alignment between video contents and generated natural sentences,which are not suitable for accurately comprehending and describing the video contents.To address this issue,this paper proposes a video captioning method by semantic topic-guided generation.First,a 3D convolutional neural network is utilized to extract the spatiotemporal features of videos during the encoding.Then,the semantic topics of video data are extracted using the visual labels retrieved from similar video data.In the decoding,a decoder is constructed by combining a novel Enhance-TopK sampling algorithm with a Generative Pre-trained Transformer-2 deep neural network,which decreases the influence of“deviation”in the semantic mapping process between videos and texts by jointly decoding a baseline and semantic topics of video contents.During this process,the designed Enhance-TopK sampling algorithm can alleviate a long-tail problem by dynamically adjusting the probability distribution of the predicted words.Finally,the experiments are conducted on two publicly used Microsoft Research Video Description andMicrosoft Research-Video to Text datasets.The experimental results demonstrate that the proposed method outperforms several state-of-art approaches.Specifically,the performance indicators Bilingual Evaluation Understudy,Metric for Evaluation of Translation with Explicit Ordering,Recall Oriented Understudy for Gisting Evaluation-longest common subsequence,and Consensus-based Image Description Evaluation of the proposed method are improved by 1.2%,0.1%,0.3%,and 2.4% on the Microsoft Research Video Description dataset,and 0.1%,1.0%,0.1%,and 2.8% on the Microsoft Research-Video to Text dataset,respectively,compared with the existing video captioning methods.As a result,the proposed method can generate video captioning that is more closely aligned with human natural language expression habits. 展开更多
关键词 video captioning encoder-decoder semantic topic jointly decoding Enhance-TopK sampling
下载PDF
Workout Action Recognition in Video Streams Using an Attention Driven Residual DC-GRU Network
14
作者 Arnab Dey Samit Biswas Dac-Nhuong Le 《Computers, Materials & Continua》 SCIE EI 2024年第5期3067-3087,共21页
Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions i... Regular exercise is a crucial aspect of daily life, as it enables individuals to stay physically active, lowers thelikelihood of developing illnesses, and enhances life expectancy. The recognition of workout actions in videostreams holds significant importance in computer vision research, as it aims to enhance exercise adherence, enableinstant recognition, advance fitness tracking technologies, and optimize fitness routines. However, existing actiondatasets often lack diversity and specificity for workout actions, hindering the development of accurate recognitionmodels. To address this gap, the Workout Action Video dataset (WAVd) has been introduced as a significantcontribution. WAVd comprises a diverse collection of labeled workout action videos, meticulously curated toencompass various exercises performed by numerous individuals in different settings. This research proposes aninnovative framework based on the Attention driven Residual Deep Convolutional-Gated Recurrent Unit (ResDCGRU)network for workout action recognition in video streams. Unlike image-based action recognition, videoscontain spatio-temporal information, making the task more complex and challenging. While substantial progresshas been made in this area, challenges persist in detecting subtle and complex actions, handling occlusions,and managing the computational demands of deep learning approaches. The proposed ResDC-GRU Attentionmodel demonstrated exceptional classification performance with 95.81% accuracy in classifying workout actionvideos and also outperformed various state-of-the-art models. The method also yielded 81.6%, 97.2%, 95.6%, and93.2% accuracy on established benchmark datasets, namely HMDB51, Youtube Actions, UCF50, and UCF101,respectively, showcasing its superiority and robustness in action recognition. The findings suggest practicalimplications in real-world scenarios where precise video action recognition is paramount, addressing the persistingchallenges in the field. TheWAVd dataset serves as a catalyst for the development ofmore robust and effective fitnesstracking systems and ultimately promotes healthier lifestyles through improved exercise monitoring and analysis. 展开更多
关键词 Workout action recognition video stream action recognition residual network GRU ATTENTION
下载PDF
A Unified Model Fusing Region of Interest Detection and Super Resolution for Video Compression
15
作者 Xinkun Tang Feng Ouyang +2 位作者 Ying Xu Ligu Zhu Bo Peng 《Computers, Materials & Continua》 SCIE EI 2024年第6期3955-3975,共21页
High-resolution video transmission requires a substantial amount of bandwidth.In this paper,we present a novel video processing methodology that innovatively integrates region of interest(ROI)identification and super-... High-resolution video transmission requires a substantial amount of bandwidth.In this paper,we present a novel video processing methodology that innovatively integrates region of interest(ROI)identification and super-resolution enhancement.Our method commences with the accurate detection of ROIs within video sequences,followed by the application of advanced super-resolution techniques to these areas,thereby preserving visual quality while economizing on data transmission.To validate and benchmark our approach,we have curated a new gaming dataset tailored to evaluate the effectiveness of ROI-based super-resolution in practical applications.The proposed model architecture leverages the transformer network framework,guided by a carefully designed multi-task loss function,which facilitates concurrent learning and execution of both ROI identification and resolution enhancement tasks.This unified deep learning model exhibits remarkable performance in achieving super-resolution on our custom dataset.The implications of this research extend to optimizing low-bitrate video streaming scenarios.By selectively enhancing the resolution of critical regions in videos,our solution enables high-quality video delivery under constrained bandwidth conditions.Empirical results demonstrate a 15%reduction in transmission bandwidth compared to traditional super-resolution based compression methods,without any perceivable decline in visual quality.This work thus contributes to the advancement of video compression and enhancement technologies,offering an effective strategy for improving digital media delivery efficiency and user experience,especially in bandwidth-limited environments.The innovative integration of ROI identification and super-resolution presents promising avenues for future research and development in adaptive and intelligent video communication systems. 展开更多
关键词 Super resolution region of interest detection video compression
下载PDF
Adaptive Graph Convolutional Adjacency Matrix Network for Video Summarization
16
作者 Jing Zhang Guangli Wu Shanshan Song 《Computers, Materials & Continua》 SCIE EI 2024年第8期1947-1965,共19页
Video summarization aims to select key frames or key shots to create summaries for fast retrieval,compression,and efficient browsing of videos.Graph neural networks efficiently capture information about graph nodes an... Video summarization aims to select key frames or key shots to create summaries for fast retrieval,compression,and efficient browsing of videos.Graph neural networks efficiently capture information about graph nodes and their neighbors,but ignore the dynamic dependencies between nodes.To address this challenge,we propose an innovative Adaptive Graph Convolutional Adjacency Matrix Network(TAMGCN),leveraging the attention mechanism to dynamically adjust dependencies between graph nodes.Specifically,we first segment shots and extract features of each frame,then compute the representative features of each shot.Subsequently,we utilize the attention mechanism to dynamically adjust the adjacency matrix of the graph convolutional network to better capture the dynamic dependencies between graph nodes.Finally,we fuse temporal features extracted by Bi-directional Long Short-Term Memory network with structural features extracted by the graph convolutional network to generate high-quality summaries.Extensive experiments are conducted on two benchmark datasets,TVSum and SumMe,yielding F1-scores of 60.8%and 53.2%,respectively.Experimental results demonstrate that our method outperforms most state-of-the-art video summarization techniques. 展开更多
关键词 Attention mechanism deep learning graph neural network key-shot video summarization
下载PDF
Multi-Stream Temporally Enhanced Network for Video Salient Object Detection
17
作者 Dan Xu Jiale Ru Jinlong Shi 《Computers, Materials & Continua》 SCIE EI 2024年第1期85-104,共20页
Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing com... Video salient object detection(VSOD)aims at locating the most attractive objects in a video by exploring the spatial and temporal features.VSOD poses a challenging task in computer vision,as it involves processing complex spatial data that is also influenced by temporal dynamics.Despite the progress made in existing VSOD models,they still struggle in scenes of great background diversity within and between frames.Additionally,they encounter difficulties related to accumulated noise and high time consumption during the extraction of temporal features over a long-term duration.We propose a multi-stream temporal enhanced network(MSTENet)to address these problems.It investigates saliency cues collaboration in the spatial domain with a multi-stream structure to deal with the great background diversity challenge.A straightforward,yet efficient approach for temporal feature extraction is developed to avoid the accumulative noises and reduce time consumption.The distinction between MSTENet and other VSOD methods stems from its incorporation of both foreground supervision and background supervision,facilitating enhanced extraction of collaborative saliency cues.Another notable differentiation is the innovative integration of spatial and temporal features,wherein the temporal module is integrated into the multi-stream structure,enabling comprehensive spatial-temporal interactions within an end-to-end framework.Extensive experimental results demonstrate that the proposed method achieves state-of-the-art performance on five benchmark datasets while maintaining a real-time speed of 27 fps(Titan XP).Our code and models are available at https://github.com/RuJiaLe/MSTENet. 展开更多
关键词 video salient object detection deep learning temporally enhanced foreground-background collaboration
下载PDF
A Hybrid Machine Learning Approach for Improvised QoE in Video Services over 5G Wireless Networks
18
作者 K.B.Ajeyprasaath P.Vetrivelan 《Computers, Materials & Continua》 SCIE EI 2024年第3期3195-3213,共19页
Video streaming applications have grown considerably in recent years.As a result,this becomes one of the most significant contributors to global internet traffic.According to recent studies,the telecommunications indu... Video streaming applications have grown considerably in recent years.As a result,this becomes one of the most significant contributors to global internet traffic.According to recent studies,the telecommunications industry loses millions of dollars due to poor video Quality of Experience(QoE)for users.Among the standard proposals for standardizing the quality of video streaming over internet service providers(ISPs)is the Mean Opinion Score(MOS).However,the accurate finding of QoE by MOS is subjective and laborious,and it varies depending on the user.A fully automated data analytics framework is required to reduce the inter-operator variability characteristic in QoE assessment.This work addresses this concern by suggesting a novel hybrid XGBStackQoE analytical model using a two-level layering technique.Level one combines multiple Machine Learning(ML)models via a layer one Hybrid XGBStackQoE-model.Individual ML models at level one are trained using the entire training data set.The level two Hybrid XGBStackQoE-Model is fitted using the outputs(meta-features)of the layer one ML models.The proposed model outperformed the conventional models,with an accuracy improvement of 4 to 5 percent,which is still higher than the current traditional models.The proposed framework could significantly improve video QoE accuracy. 展开更多
关键词 Hybrid XGBStackQoE-model machine learning MOS performance metrics QOE 5G video services
下载PDF
Generative Multi-Modal Mutual Enhancement Video Semantic Communications
19
作者 Yuanle Chen Haobo Wang +3 位作者 Chunyu Liu Linyi Wang Jiaxin Liu Wei Wu 《Computer Modeling in Engineering & Sciences》 SCIE EI 2024年第6期2985-3009,共25页
Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the... Recently,there have been significant advancements in the study of semantic communication in single-modal scenarios.However,the ability to process information in multi-modal environments remains limited.Inspired by the research and applications of natural language processing across different modalities,our goal is to accurately extract frame-level semantic information from videos and ultimately transmit high-quality videos.Specifically,we propose a deep learning-basedMulti-ModalMutual Enhancement Video Semantic Communication system,called M3E-VSC.Built upon a VectorQuantized Generative AdversarialNetwork(VQGAN),our systemaims to leverage mutual enhancement among different modalities by using text as the main carrier of transmission.With it,the semantic information can be extracted fromkey-frame images and audio of the video and performdifferential value to ensure that the extracted text conveys accurate semantic information with fewer bits,thus improving the capacity of the system.Furthermore,a multi-frame semantic detection module is designed to facilitate semantic transitions during video generation.Simulation results demonstrate that our proposed model maintains high robustness in complex noise environments,particularly in low signal-to-noise ratio conditions,significantly improving the accuracy and speed of semantic transmission in video communication by approximately 50 percent. 展开更多
关键词 Generative adversarial networks multi-modal mutual enhancement video semantic transmission deep learning
下载PDF
Ecological problems and ecological restoration zoning of the Aral Sea
20
作者 BAO Anming YU Tao +7 位作者 XU Wenqiang LEI Jiaqiang JIAPAER Guli CHEN Xi Tojibaev KOMILJON Shomurodov KHABIBULLO Xabibullaev B SAGIDULLAEVICH Idirisov KAMALATDIN 《Journal of Arid Land》 SCIE CSCD 2024年第3期315-330,共16页
The Aral Sea was the fourth largest lake in the world but it has shrunk dramatically as a result of irrational human activities, triggering the "Aral Sea ecological crisis". The ecological problems of the Ar... The Aral Sea was the fourth largest lake in the world but it has shrunk dramatically as a result of irrational human activities, triggering the "Aral Sea ecological crisis". The ecological problems of the Aral Sea have attracted widespread attention, and the alleviation of the Aral Sea ecological crisis has reached a consensus among the five Central Asian countries(Kazakhstan, Uzbekistan, Tajikistan, Kyrgyzstan, and Turkmenistan). In the past decades, many ecological management measures have been implemented for the ecological restoration of the Aral Sea. However, due to the lack of regional planning and zoning, the results are not ideal. In this study, we mapped the ecological zoning of the Aral Sea from the perspective of ecological restoration based on soil type, soil salinity, surface water, groundwater table, Normalized Difference Vegetation Index(NDVI), land cover, and aerosol optical depth(AOD) data. Soil salinization and salt dust are the most prominent ecological problems in the Aral Sea. We divided the Aral Sea into 7 first-level ecological restoration subregions(North Aral Sea catchment area in the downstream of the Syr Darya River(Subregion Ⅰ);artificial flood overflow area in the downstream of the Aral Sea(Subregion Ⅱ);physical/chemical remediation area of the salt dust source area in the eastern part of the South Aral Sea(Subregion Ⅲ);physical/chemical remediation area of severe salinization in the central part of the South Aral Sea(Subregion Ⅳ);existing water surface and potential restoration area of the South Aral Sea(Subregion Ⅴ);Aral Sea vegetation natural recovery area(Subregion Ⅵ);and vegetation planting area with slight salinization in the South Aral Sea(Subregion Ⅶ)) and 14 second-level ecological restoration subregions according to the ecological zoning principles. Implementable measures are proposed for each ecological restoration subregion. For Subregion Ⅰ and Subregion Ⅱ with lower elevations, artificial flooding should be carried out to restore the surface of the Aral Sea. Subregion Ⅲ and Subregion Ⅳ have severe salinization, making it difficult for vegetation to grow. In these subregions, it is recommended to cover and pave the areas with green biomatrix coverings and environmentally sustainable bonding materials. In Subregion Ⅴ located in the central and western parts of the South Aral Sea, surface water recharge should be increased to ensure that this subregion can maintain normal water levels. In Subregion Ⅵ and Subregion Ⅶ where natural conditions are suitable for vegetation growth, measures such as afforestation and buffer zones should be implemented to protect vegetation. This study could provide a reference basis for future comprehensive ecological management and restoration of the Aral Sea. 展开更多
关键词 ecological restoration zoning salt and dust storms soil salinization ecological crisis Aral Sea Central Asia
下载PDF
上一页 1 2 250 下一页 到第
使用帮助 返回顶部