期刊文献+
共找到11篇文章
< 1 >
每页显示 20 50 100
Identical-video retrieval using the low-peak feature of a video's audio information 被引量:2
1
作者 Myoung-beom CHUNG Il-ju KO 《Journal of Zhejiang University-Science C(Computers and Electronics)》 SCIE EI 2010年第3期151-159,共9页
The recognition and retrieval of identical videos by combing through entire video files requires a great deal of time and memory space. Therefore, most current video-matching methods analyze only a part of each video&... The recognition and retrieval of identical videos by combing through entire video files requires a great deal of time and memory space. Therefore, most current video-matching methods analyze only a part of each video's image frame information. All these methods, however, share the critical problem of erroneously categorizing identical videos as different if they have merely been altered in resolution or converted with a different codec. This paper deals instead with an identical-video-retrieval method using the low-peak feature of audio data. The low-peak feature remains relatively stable even with changes in bit-rate or codec. The proposed method showed a search success rate of 93.7% in a video matching experiment. This approach could provide a technique for recognizing identical content on video file share sites. 展开更多
关键词 video retrieval video DNA Audio signal processing Audio feature extraction
原文传递
Advance on large scale near-duplicate video retrieval 被引量:1
2
作者 Ling Shen Richang Hong Yanbin Hao 《Frontiers of Computer Science》 SCIE EI CSCD 2020年第5期1-24,共24页
Emerging Internet services and applications attract increasing users to involve in diverse video-related activities,such as video searching,video downloading,video sharing and so on.As normal operations,they lead to a... Emerging Internet services and applications attract increasing users to involve in diverse video-related activities,such as video searching,video downloading,video sharing and so on.As normal operations,they lead to an explosive growth of online video volume,and inevitably give rise to the massive near-duplicate contents.Near-duplicate video retrieval(NDVR)has always been a hot topic.The primary purpose of this paper is to present a comprehensive survey and an updated review of the advance on large-scale NDVR to supply guidance for researchers.Specifically,we summarize and compare the definitions of near-duplicate videos(NDVs)in the literature,analyze the relationship between NDVR and its related research topics theoretically,describe its generic framework in detail,investigate the existing state-of-the-art NDVR systems.Finally,we present the development trends and research directions of this topic. 展开更多
关键词 near-duplicate videos video retrieval feature representation video signature INDEXING similarity measurement
原文传递
Semantics in Image and Video Retrieval Systems 被引量:1
3
作者 CAIJun LIXiao-fei 《The Journal of China Universities of Posts and Telecommunications》 EI CSCD 2002年第4期57-63,共7页
Multimedia document annotation is used in traditional multimedia databasesystems. However, without the help of human beings, it is very difficult to extract the semanticcontent of multimedia automatically. On the othe... Multimedia document annotation is used in traditional multimedia databasesystems. However, without the help of human beings, it is very difficult to extract the semanticcontent of multimedia automatically. On the other hand, it is a tedious job to annotate multimediadocuments in large databases one by one manually. This paper first introduces a method to constructa semantic net-work on top of a multimedia database. Second, a useful and efficient annotationstrategy is presented based on the framework to obtain an accurate and rapid annotation of anymultimedia databases. Third, two methods of joint similarity measures for semantic and low-levelfeatures are evaluated . 展开更多
关键词 image retrieval video retrieval semantic-based information retrieval MPEG-7 CONTENT-BASED FEATURE SEMANTICS
原文传递
Dynamic Hyperlinker: Innovative Solution for 3D Video Content Search and Retrieval
4
作者 Mohammad Rafiq Swash Amar Aggoun +1 位作者 Obaidullah Abdul Fatah Bei Li 《Journal of Computer and Communications》 2016年第6期10-23,共14页
Recently, 3D display technology, and content creation tools have been undergone rigorous development and as a result they have been widely adopted by home and professional users. 3D digital repositories are increasing... Recently, 3D display technology, and content creation tools have been undergone rigorous development and as a result they have been widely adopted by home and professional users. 3D digital repositories are increasing and becoming available ubiquitously. However, searching and visualizing 3D content remains a great challenge. In this paper, we propose and present the development of a novel approach for creating hypervideos, which ease the 3D content search and retrieval. It is called the dynamic hyperlinker for 3D content search and retrieval process. It advances 3D multimedia navigability and searchability by creating dynamic links for selectable and clickable objects in the video scene whilst the user consumes the 3D video clip. The proposed system involves 3D video processing, such as detecting/tracking clickable objects, annotating objects, and metadata engineering including 3D content descriptive protocol. Such system attracts the attention from both home and professional users and more specifically broadcasters and digital content providers. The experiment is conducted on full parallax holoscopic 3D videos “also known as integral images”. 展开更多
关键词 Holoscopic 3D Image Integral Image 3D video 3D Display video Search and retrieval Hyperlinker Hypervideo
下载PDF
Fish Feeding Behavior Recognition Using Adaptive DMCA-UMT Algorithm
5
作者 Caiwei Yang Xinting Yang +1 位作者 Kaijie Zhu Chao Zhou 《Journal of Beijing Institute of Technology》 EI CAS 2023年第3期285-297,共13页
Realtime analyzing the feeding behavior of fish is the premise and key to accurate guidance on feeding.The identification of fish behavior using a single information is susceptible to various factors.To overcome the p... Realtime analyzing the feeding behavior of fish is the premise and key to accurate guidance on feeding.The identification of fish behavior using a single information is susceptible to various factors.To overcome the problems,this paper proposes an adaptive deep modular co-attention unified multi-modal transformers(DMCA-UMT).By fusing the video,audio and water quality parameters,the whole process of fish feeding behavior could be identified.Firstly,for the input video,audio and water quality parameter information,features are extracted to obtain feature vectors of different modalities.Secondly,deep modular co-attention(DMCA)is introduced on the basis of the original cross-modal encoder,and the adaptive learnable weights are added.The feature vector of video and audio joint representation is obtained by automatic learning based on fusion contribution.Finally,the information of visual-audio modality fusion and text features are used to generate clip-level moment queries.The query decoder decodes the input features and uses the prediction head to obtain the final joint moment retrieval,which is the start-end time of feeding the fish.The results show that the mAP Avg of the proposed algorithm reaches 75.3%,which is37.8%higher than that of unified multi-modal transformers(UMT)algorithm. 展开更多
关键词 AQUACULTURE multi-modal fusion deep modular co-attention(DMCA) unified multimodal transformers(UMT) video moment retrieval
下载PDF
Retrieval of flower videos based on a query with multiple species of flowers
6
作者 V.K.Jyothi V.N.Manjunath Aradhya +1 位作者 Y.H.Sharath Kumar D.S.Guru 《Artificial Intelligence in Agriculture》 2021年第1期262-277,共16页
Searching,recognizing and retrieving a video of interest froma large collection of a video data is an instantaneous requirement.This requirement has been recognized as an active area of research in computer vision,mac... Searching,recognizing and retrieving a video of interest froma large collection of a video data is an instantaneous requirement.This requirement has been recognized as an active area of research in computer vision,machine learning and pattern recognition.Flower video recognition and retrieval is vital in the field of floriculture and horticulture.In this paper we propose a model for the retrieval of videos of flowers.Initially,videos are represented with keyframes and flowers in keyframes are segmented from their background.Then,the model is analysed by features extracted from flower regions of the keyframe.A Linear Discriminant Analysis(LDA)is adapted for the extraction of discriminating features.Multiclass Support VectorMachine(MSVM)classifier is applied to identify the class of the query video.Experiments have been conducted on relatively large dataset of our own,consisting of 7788 videos of 30 different species of flowers captured from three different devices.Generally,retrieval of flower videos is addressed by the use of a query video consisting of a flower of a single species.In this work we made an attempt to develop a system consisting of retrieval of similar videos for a query video consisting of flowers of different species. 展开更多
关键词 Flower region of interest(FRoI) Linear discriminant analysis(LDA) retrieval of flower videos Multiclass support vector machine
原文传递
Visual Ontology Construction for Digitized Art Image Retrieval 被引量:7
7
作者 蒋树强 杜军 +2 位作者 黄庆明 黄铁军 高文 《Journal of Computer Science & Technology》 SCIE EI CSCD 2005年第6期855-860,共6页
Current investigations on visual information retrieval are generally content-based methods. The significant difference between similarity in low-level features and similarity in high-level semantic meanings is still a... Current investigations on visual information retrieval are generally content-based methods. The significant difference between similarity in low-level features and similarity in high-level semantic meanings is still a major challenge in the area of image retrieval. In this work, a scheme for constructing visual ontology to retrieve art images is proposed. The proposed ontology describes images in various aspects, including type & style, objects and global perceptual effects. Concepts in the ontology could be automatically derived. Various art image classification methods are employed based on low-level image features. Non-objective semantics are introduced, and how to express these semantics is given. The proposed ontology scheme could make users more naturally find visual information and thus narrows the “semantic gap”. Experimental implementation demonstrates its good potential for retrieving art images in a human-centered manner. 展开更多
关键词 ontology design image/video retrieval image database
原文传递
A comprehensive review of significant researches on content based indexing and retrieval of visual information 被引量:3
8
作者 R. PRIYA T. N. SHANMUGAM 《Frontiers of Computer Science》 SCIE EI CSCD 2013年第5期782-799,共18页
Developments in multimedia technologies have paved way for the storage of huge collections of video doc- uments on computer systems. It is essential to design tools for content-based access to the documents, so as to ... Developments in multimedia technologies have paved way for the storage of huge collections of video doc- uments on computer systems. It is essential to design tools for content-based access to the documents, so as to allow an efficient exploitation of these collections. Content based anal- ysis provides a flexible and powerful way to access video data when compared with the other traditional video analysis tech- niques. The area of content based video indexing and retrieval (CBVIR), focusing on automating the indexing, retrieval and management of video, has attracted extensive research in the last decade. CBVIR is a lively area of research with endur- ing acknowledgments from several domains. Herein a vital assessment of contemporary researches associated with the content-based indexing and retrieval of visual information. In this paper, we present an extensive review of significant researches on CBV1R. Concise description of content based video analysis along with the techniques associated with the content based video indexing and retrieval is presented. 展开更多
关键词 nultimedia information content based video retrieval (CBVR) content based video indexing and retrieval (CBVIR) shot segmentation object segmentation feature extraction INDEXING motion estimation QUERYING key frame retrieval and indexing.
原文传递
Video Key Frame Extraction by Unsupervised Clustering and Feedback Adjustment 被引量:2
9
作者 庄越挺 芮勇 《Journal of Computer Science & Technology》 SCIE EI CSCD 1999年第3期283-288,F003,共7页
In video information retrieval, key frame extraction has been rec ognized as one of the important research issues. Although much progress has been made, the existing approaches are either computationally expensive or ... In video information retrieval, key frame extraction has been rec ognized as one of the important research issues. Although much progress has been made, the existing approaches are either computationally expensive or ineffective in capturing salient visual content. In this paper, we first discuss the importance of key frame extraction and then briefly review and evaluate the existing approaches. To overcome the shortcomings of the existing approaches, we introduce a new algorithm for key frame extraction based on unsupervised clustering. Meanwhile, we provide a feedback chain to adjust the granularity of the extraction result. The proposed algorithm is both computationally simple and able to capture the visual content.The efficiency and effectiveness are validated by large amount of real-world videos. 展开更多
关键词 key frame extraction CLUSTERING FEEDBACK video retrieval
原文传递
Key Frame Extraction Using Unsupervised Clustering Based on a Statistical Model 被引量:5
10
作者 阳书平 林行刚 《Tsinghua Science and Technology》 SCIE EI CAS 2005年第2期169-173,共5页
This paper proposes a novel algorithm for extracting key frames to represent video shots. Re- garding whether, or how well, a key frame represents a shot, different interpretations have been suggested. We develop ou... This paper proposes a novel algorithm for extracting key frames to represent video shots. Re- garding whether, or how well, a key frame represents a shot, different interpretations have been suggested. We develop our algorithm on the assumption that more important content may demand more attention and may last relatively more frames. Unsupervised clustering is used to divide the frames into clusters within a shot, and then a key frame is selected from each candidate cluster. To make the algorithm independent of video sequences, we employ a statistical model to calculate the clustering threshold. The proposed algo- rithm can capture the important yet salient content as the key frame. Its robustness and adaptability are validated by experiments with various kinds of video sequences. 展开更多
关键词 key frame video retrieval motion compensation
原文传递
Semantic and Structural Analysis of TV Diving Programs
11
作者 FeiWang Jin-TaoLi Yong-DongZhang Shou-XunLin 《Journal of Computer Science & Technology》 SCIE EI CSCD 2004年第6期928-935,共8页
Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis bas... Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis based on compiler principles. It integrates both semantic analysis and syntactic analysis to automatically create an index and a table of contents for a sports video. Each shot of the video sequence is first annotated and indexed with semantic labels through detection of events using domain knowledge. A grammar-based parser is then constructed to identify the tree structure of the video content based on the labels. Meanwhile, the grammar can be used to detect and recover errors during the analysis. As a case study, a sports video parsing system is presented in the particular domain of diving. Experimental results indicate the proposed approach is effective. Keywords sports video - event detection - grammar - video retrieval - content analysis and index This work was supported in part by the State Physical Culture Administration of China under Grant No.02005.Fei Wang was born in 1977. He is a Ph.D. candidate at Institute of Computing Technology (ICT), the Chinese Academy of Sciences (CAS). He received the B.S. degree in electrical engineering from Zhejiang University in 1999 and the M.S degree in computer science from Graduate School of the Chinese Academy of Sciences in 2001. His current research interests include content-based video analysis and retrieval.Jin-Tao Li was born in 1962. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia data compression, virtual reality, and home network.Yong-Dong Zhang was born in 1973. He is an associate professor at ICT, CAS. His main research areas include multimedia data compression and multimedia information retrieval.Shou-Xun Lin was born in 1948. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia technology and systems. 展开更多
关键词 sports video event detection GRAMMAR video retrieval content analysis and index
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部