期刊文献+
共找到3篇文章
< 1 >
每页显示 20 50 100
Digit Recognition in Natural Scene Texts
1
作者 Shih-Wei Sun 《Journal of Electronic Science and Technology》 CAS CSCD 2017年第2期199-206,共8页
Digit recognition from a natural scene text in video surveillance/broadcasting applications is a challenging research task due to blurred, font variations, twisted, and non-uniform color distribution issues with a dig... Digit recognition from a natural scene text in video surveillance/broadcasting applications is a challenging research task due to blurred, font variations, twisted, and non-uniform color distribution issues with a digit in a natural scene to be recognized. In this paper, to solve the digit number recognition problem, a principal-axis based topology contour descriptor with support vector machine (SVM) classification is proposed. The contributions of this paper include: a) a local descriptor with SVM classification for digit recognition, b) higher accuracy than the state-of-the art methods, and c) low computational power (0.03 second/digit recognition), which make this method adoptable to real-time applications. 展开更多
关键词 Index Terms--Digit recognition scene text sports video video surveillance.
下载PDF
Real-time distance field acceleration based free-viewpoint video synthesis for large sports fields
2
作者 Yanran Dai Jing Li +5 位作者 Yuqi Jiang Haidong Qin Bang Liang Shikuan Hong Haozhe Pan Tao Yang 《Computational Visual Media》 SCIE EI 2024年第2期331-353,共23页
Free-viewpoint video allows the user to view objects from any virtual perspective,creating an immersive visual experience.This technology enhances the interactivity and freedom of multimedia performances.However,many ... Free-viewpoint video allows the user to view objects from any virtual perspective,creating an immersive visual experience.This technology enhances the interactivity and freedom of multimedia performances.However,many free-viewpoint video synthesis methods hardly satisfy the requirement to work in real time with high precision,particularly for sports fields having large areas and numerous moving objects.To address these issues,we propose a freeviewpoint video synthesis method based on distance field acceleration.The central idea is to fuse multiview distance field information and use it to adjust the search step size adaptively.Adaptive step size search is used in two ways:for fast estimation of multiobject three-dimensional surfaces,and synthetic view rendering based on global occlusion judgement.We have implemented our ideas using parallel computing for interactive display,using CUDA and OpenGL frameworks,and have used real-world and simulated experimental datasets for evaluation.The results show that the proposed method can render free-viewpoint videos with multiple objects on large sports fields at 25 fps.Furthermore,the visual quality of our synthetic novel viewpoint images exceeds that of state-of-the-art neural-rendering-based methods. 展开更多
关键词 free-viewpoint video view synthesis camera array distance field sports video
原文传递
Semantic and Structural Analysis of TV Diving Programs
3
作者 FeiWang Jin-TaoLi Yong-DongZhang Shou-XunLin 《Journal of Computer Science & Technology》 SCIE EI CSCD 2004年第6期928-935,共8页
Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis bas... Automatic content analysis of sports videos is a valuable and challenging task. Motivated by analogies between a class of sports videos and languages, the authors propose a novel approach for sports video analysis based on compiler principles. It integrates both semantic analysis and syntactic analysis to automatically create an index and a table of contents for a sports video. Each shot of the video sequence is first annotated and indexed with semantic labels through detection of events using domain knowledge. A grammar-based parser is then constructed to identify the tree structure of the video content based on the labels. Meanwhile, the grammar can be used to detect and recover errors during the analysis. As a case study, a sports video parsing system is presented in the particular domain of diving. Experimental results indicate the proposed approach is effective. Keywords sports video - event detection - grammar - video retrieval - content analysis and index This work was supported in part by the State Physical Culture Administration of China under Grant No.02005.Fei Wang was born in 1977. He is a Ph.D. candidate at Institute of Computing Technology (ICT), the Chinese Academy of Sciences (CAS). He received the B.S. degree in electrical engineering from Zhejiang University in 1999 and the M.S degree in computer science from Graduate School of the Chinese Academy of Sciences in 2001. His current research interests include content-based video analysis and retrieval.Jin-Tao Li was born in 1962. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia data compression, virtual reality, and home network.Yong-Dong Zhang was born in 1973. He is an associate professor at ICT, CAS. His main research areas include multimedia data compression and multimedia information retrieval.Shou-Xun Lin was born in 1948. He is a professor and Ph.D. supervisor at ICT, CAS. His main research areas include multimedia technology and systems. 展开更多
关键词 sports video event detection GRAMMAR video retrieval content analysis and index
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部