期刊文献+
共找到5篇文章
< 1 >
每页显示 20 50 100
Few-Shot Object Detection Based on the Transformer and High-Resolution Network 被引量:1
1
作者 Dengyong Zhang Huaijian Pu +2 位作者 Feng Li Xiangling Ding Victor S.Sheng 《Computers, Materials & Continua》 SCIE EI 2023年第2期3439-3454,共16页
Now object detection based on deep learning tries different strategies.It uses fewer data training networks to achieve the effect of large dataset training.However,the existing methods usually do not achieve the balan... Now object detection based on deep learning tries different strategies.It uses fewer data training networks to achieve the effect of large dataset training.However,the existing methods usually do not achieve the balance between network parameters and training data.It makes the information provided by a small amount of picture data insufficient to optimize model parameters,resulting in unsatisfactory detection results.To improve the accuracy of few shot object detection,this paper proposes a network based on the transformer and high-resolution feature extraction(THR).High-resolution feature extractionmaintains the resolution representation of the image.Channels and spatial attention are used to make the network focus on features that are more useful to the object.In addition,the recently popular transformer is used to fuse the features of the existing object.This compensates for the previous network failure by making full use of existing object features.Experiments on the Pascal VOC and MS-COCO datasets prove that the THR network has achieved better results than previous mainstream few shot object detection. 展开更多
关键词 Object detection few shot object detection TRANSFORMER HIGH-RESOLUTION
下载PDF
RANDOM FOREST FOR INTERMEDIATE DESCRIPTOR FUSION IN SHOT BOUNDARY DETECTION 被引量:1
2
作者 Zhang Lei Chang Anqi Xiang Xuezhi 《Journal of Electronics(China)》 2014年第5期465-472,共8页
Shot boundary detection is the fundamental part in many real applications as video retrieval and so on. This paper tackles the problem of video segment obtaining in complex movie videos. Firstly, intermediate descript... Shot boundary detection is the fundamental part in many real applications as video retrieval and so on. This paper tackles the problem of video segment obtaining in complex movie videos. Firstly, intermediate descriptor is proposed to depict the variation of both abrupt and gradual change in shot boundaries, which is formed by distance vector on Local Binary Pattern(LBP), GIST(GIST) or their fusion. Instead of just using the adjacent frames distance, intermediate descriptor keeps the distances between current frame and consecutive frames. It comprehensively characterizes local temporal structure, which is especially important for gradual change. For the excellent ability for feature fusion in random forests, it is adopted here to verify the fusion effect of intermediate descriptor on LBP and GIST. The whole experiments are designed on the subset of TRECVid 2013 INS(INstance Search) task to verify the effectiveness of proposed intermediate descriptor and the fusion ability for random forest. Compared with static and adaptive thresholds approaches, the best performance can be achieved by post-fusion of intermediate descriptor on LBP and GIST. 展开更多
关键词 shot boundary detection Intermediate descriptor Random forest ~sion Gist (GIST) Local Binary Pattern (LBP)
下载PDF
Video Shot Boundary Detection in MPEG Compressed Sequences Using SVM Learning 被引量:1
3
作者 GUO Lihua YANG Shutang LIJianhua TONGZhipeng(School of Electronic and Information Technology,Shanghai JiaoTong University Shanghai 200030 China) 《Journal of Electronic Science and Technology of China》 2003年第1期15-17,28,共4页
A number of automated video shot boundary detection methods for indexing a videosequence to facilitate browsing and retrieval have been proposed in recent years.Among these methods,the dissolve shot boundary isn't... A number of automated video shot boundary detection methods for indexing a videosequence to facilitate browsing and retrieval have been proposed in recent years.Among these methods,the dissolve shot boundary isn't accurately detected because it involves the camera operation and objectmovement.In this paper,a method based on support vector machine (SVM) is proposed to detect thedissolve shot boundary in MPEG compressed sequence.The problem of detection between the dissolveshot boundary and other boundaries is considered as two-class classification in our method.Featuresfrom the compressed sequences are directly extracted without decoding them,and the optimal classboundary between two classes are learned from training data by using SVM.Experiments,whichcompare various classification methods,show that using proposed method encourages performance ofvideo shot boundary detection. 展开更多
关键词 video shot boundary detection dissolve detection MPEG compressed sequences support vector machine(SVM)
下载PDF
Video Summarization Approach Based on Binary Robust Invariant Scalable Keypoints and Bisecting K-Means
4
作者 Sameh Zarif Eman Morad +3 位作者 Khalid Amin Abdullah Alharbi Wail S.Elkilani Shouze Tang 《Computers, Materials & Continua》 SCIE EI 2024年第3期3565-3583,共19页
Due to the exponential growth of video data,aided by rapid advancements in multimedia technologies.It became difficult for the user to obtain information from a large video series.The process of providing an abstract ... Due to the exponential growth of video data,aided by rapid advancements in multimedia technologies.It became difficult for the user to obtain information from a large video series.The process of providing an abstract of the entire video that includes the most representative frames is known as static video summarization.This method resulted in rapid exploration,indexing,and retrieval of massive video libraries.We propose a framework for static video summary based on a Binary Robust Invariant Scalable Keypoint(BRISK)and bisecting K-means clustering algorithm.The current method effectively recognizes relevant frames using BRISK by extracting keypoints and the descriptors from video sequences.The video frames’BRISK features are clustered using a bisecting K-means,and the keyframe is determined by selecting the frame that is most near the cluster center.Without applying any clustering parameters,the appropriate clusters number is determined using the silhouette coefficient.Experiments were carried out on a publicly available open video project(OVP)dataset that contained videos of different genres.The proposed method’s effectiveness is compared to existing methods using a variety of evaluation metrics,and the proposed method achieves a trade-off between computational cost and quality. 展开更多
关键词 BRISK bisecting K-mean video summarization keyframe extraction shot detection
下载PDF
Improved Dissolve Detection for Video Segmentation
5
作者 曾昭平 MA +2 位作者 Zhonghua Zhang Wenjun 《High Technology Letters》 EI CAS 2003年第2期44-46,共3页
It is difficult to detect dissolve accurately in video segmentation. Two new parameters AEI and IDM are computed to describe dissolve. An improved method based on the change curves of AEI and IDM is proposed to detect... It is difficult to detect dissolve accurately in video segmentation. Two new parameters AEI and IDM are computed to describe dissolve. An improved method based on the change curves of AEI and IDM is proposed to detect dissolve accurately. The experiments show that this method can detect dissolve accurately. 展开更多
关键词 shot detection dissolve video indexing
下载PDF
上一页 1 下一页 到第
使用帮助 返回顶部