期刊文献+
共找到8篇文章
< 1 >
每页显示 20 50 100
ER-Net:Efficient Recalibration Network for Multi-ViewMulti-Person 3D Pose Estimation
1
作者 Mi Zhou Rui Liu +1 位作者 Pengfei Yi Dongsheng Zhou 《Computer Modeling in Engineering & Sciences》 SCIE EI 2023年第8期2093-2109,共17页
Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the fi... Multi-view multi-person 3D human pose estimation is a hot topic in the field of human pose estimation due to its wide range of application scenarios.With the introduction of end-to-end direct regression methods,the field has entered a new stage of development.However,the regression results of joints that are more heavily influenced by external factors are not accurate enough even for the optimal method.In this paper,we propose an effective feature recalibration module based on the channel attention mechanism and a relative optimal calibration strategy,which is applied to themulti-viewmulti-person 3D human pose estimation task to achieve improved detection accuracy for joints that are more severely affected by external factors.Specifically,it achieves relative optimal weight adjustment of joint feature information through the recalibration module and strategy,which enables the model to learn the dependencies between joints and the dependencies between people and their corresponding joints.We call this method as the Efficient Recalibration Network(ER-Net).Finally,experiments were conducted on two benchmark datasets for this task,Campus and Shelf,in which the PCP reached 97.3% and 98.3%,respectively. 展开更多
关键词 Multi-view multi-person pose estimation attention mechanism computer vision
下载PDF
Multi-Person Device-Free Gesture Recognition Using mmWave Signals 被引量:1
2
作者 Jie Wang Zhouhua Ran +3 位作者 Qinghua Gao Xiaorui Ma Miao Pan Kaiping Xue 《China Communications》 SCIE CSCD 2021年第2期186-199,共14页
Device-free gesture recognition is an emerging wireless sensing technique which could recognize gestures by analyzing its influence on surrounding wireless signals,it may empower wireless networks with the augmented s... Device-free gesture recognition is an emerging wireless sensing technique which could recognize gestures by analyzing its influence on surrounding wireless signals,it may empower wireless networks with the augmented sensing ability.Researchers have made great achievements for singleperson device-free gesture recognition.However,when multiple persons conduct gestures simultaneously,the received signals will be mixed together,and thus traditional methods would not work well anymore.Moreover,the anonymity of persons and the change in the surrounding environment would cause feature shift and mismatch,and thus the recognition accuracy would degrade remarkably.To address these problems,we explore and exploit the diversity of spatial information and propose a multidimensional analysis method to separate the gesture feature of each person using a focusing sensing strategy.Meanwhile,we also present a deep-learning based robust device free gesture recognition framework,which leverages an adversarial approach to extract robust gesture feature that is insensitive to the change of persons and environment.Furthermore,we also develop a 77GHz mmWave prototype system and evaluate the proposed methods extensively.Experimental results reveal that the proposed system can achieve average accuracies of 93%and 84%when 10 gestures are conducted in Received:Jun.18,2020 Revised:Aug.06,2020 Editor:Ning Ge different environments by two and four persons simultaneously,respectively. 展开更多
关键词 device-free gesture recognition wireless sensing multi-person deep-learning
下载PDF
A unified multi-view multi-person tracking framework
3
作者 Fan Yang Shigeyuki Odashima +3 位作者 Sosuke Yamao Hiroaki Fujimoto Shoichi Masui Shan Jiang 《Computational Visual Media》 SCIE EI CSCD 2024年第1期137-160,共24页
Despite significant developments in 3D multi-view multi-person (3D MM) tracking, current frameworks separately target footprint tracking, or pose tracking. Frameworks designed for the former cannot be used for the lat... Despite significant developments in 3D multi-view multi-person (3D MM) tracking, current frameworks separately target footprint tracking, or pose tracking. Frameworks designed for the former cannot be used for the latter, because they directly obtain 3D positions on the ground plane via a homography projection, which is inapplicable to 3D poses above the ground. In contrast, frameworks designed for pose tracking generally isolate multi-view and multi-frame associations and may not be sufficiently robust for footprint tracking, which utilizes fewer key points than pose tracking, weakening multi-view association cues in a single frame. This study presents a unified multi-view multi-person tracking framework to bridge the gap between footprint tracking and pose tracking. Without additional modifications, the framework can adopt monocular 2D bounding boxes and 2D poses as its input to produce robust 3D trajectories for multiple persons. Importantly, multi-frame and multi-view information are jointly employed to improve association and triangulation. Our framework is shown to provide state-of-the-art performance on the Campus and Shelf datasets for 3D pose tracking, with comparable results on the WILDTRACK and MMPTRACK datasets for 3D footprint tracking. 展开更多
关键词 multi-camera multi-person tracking pose tracking footprint tracking TRIANGULATION spatiotemporal clustering
原文传递
MILI:Multi-person inference from a low-resolution image
4
作者 Kun Li Yunke Liu +1 位作者 Yu-Kun Lai Jingyu Yang 《Fundamental Research》 CAS CSCD 2023年第3期434-441,共8页
Existing multi-person reconstruction methods require the human bodies in the input image to occupy a considerable portion of the picture.However,low-resolution human objects are ubiquitous due to trade-offbetween the ... Existing multi-person reconstruction methods require the human bodies in the input image to occupy a considerable portion of the picture.However,low-resolution human objects are ubiquitous due to trade-offbetween the field of view and target distance given a limited camera resolution.In this paper,we propose an end-to-end multi-task framework for multi-person inference from a low-resolution image(MILI).To perceive more information from a low-resolution image,we use pair-wise images at high resolution and low resolution for training,and design a restoration network with a simple loss for better feature extraction from the low-resolution image.To address the occlusion problem in multi-person scenes,we propose an occlusion-aware mask prediction network to estimate the mask of each person during 3D mesh regression.Experimental results on both small-scale scenes and large-scale scenes demonstrate that our method outperforms the state-of-the-art methods both quantitatively and qualitatively.The code is available at http://cic.tju.edu.cn/faculty/likun/projects/MILI. 展开更多
关键词 multi-person reconstruction Low-resolution human objects End-to-end Multi-task learning Occlusion-aware prediction
原文传递
FaSRnet:a feature and semantics refinement network for human pose estimation
5
作者 Yuanhong ZHONG Qianfeng XU +2 位作者 Daidi ZHONG Xun YANG Shanshan WANG 《Frontiers of Information Technology & Electronic Engineering》 SCIE EI CSCD 2024年第4期513-526,共14页
Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addre... Due to factors such as motion blur,video out-of-focus,and occlusion,multi-frame human pose estimation is a challenging task.Exploiting temporal consistency between consecutive frames is an efficient approach for addressing this issue.Currently,most methods explore temporal consistency through refinements of the final heatmaps.The heatmaps contain the semantics information of key points,and can improve the detection quality to a certain extent.However,they are generated by features,and feature-level refinements are rarely considered.In this paper,we propose a human pose estimation framework with refinements at the feature and semantics levels.We align auxiliary features with the features of the current frame to reduce the loss caused by different feature distributions.An attention mechanism is then used to fuse auxiliary features with current features.In terms of semantics,we use the difference information between adjacent heatmaps as auxiliary features to refine the current heatmaps.The method is validated on the large-scale benchmark datasets PoseTrack2017 and PoseTrack2018,and the results demonstrate the effectiveness of our method. 展开更多
关键词 Human pose estimation Multi-frame refinement Heatmap and offset estimation Feature alignment multi-person
原文传递
Bidirectional Optimization Coupled Ligh tweight Net works for Efficient and Robust Multi-Person 2D Pose Estimation 被引量:3
6
作者 Shnai Li Zheng Fang +2 位作者 Wen-Feng Song Ai-Min Hao Hong Qin 《Journal of Computer Science & Technology》 SCIE EI CSCD 2019年第3期522-536,共15页
For multi-person 2D pose estimation,current deep learning baised methods have exhibited impressive performance,but the trade-offs among efficiency,robustness,and accuracy in the existing approaches remain unavoidable.... For multi-person 2D pose estimation,current deep learning baised methods have exhibited impressive performance,but the trade-offs among efficiency,robustness,and accuracy in the existing approaches remain unavoidable.In principle,bottom-up methods are superior to top-down methods in efficiency,but they perform worse in accuracy.To make full use of their respective advantages,in this paper we design a novel bidirectional optimization coupled lightweight network(BOCLN)architecture for efficient,robust,and general-purpose multi-person 2D(2-dimensional)pose estimation from natural images.With the BOCLN framework,the bottom-up network focuses oil global features,while the top-down net work places emphasis on det ailed features.The entire framework shares global features along the bottom-up data stream,while the top-down data stream aims to accelerate the accurate pose estimation.In particular,to exploit the priors of human joints'relationship,we propose a probability limb heat map to represent the spatial context of the joints and guide the overall pose skeleton prediction,so that each person's pose estimation in cluttered scenes(involving crowd)could be as accurate and robust as possible.Therefore,benefiting from the novel BOCLN architecture,the tinie-consuming refinement procedure could be much simplified to an efficient lightweight network.Extensive experiments and evaluations on public benchmarks have confirmed that our new method is more efficient and robust,yet still attain competitive accuracy performance compared with the state-of-the-art methods.Our BOCLN shows even greater promise in online applications. 展开更多
关键词 BIDIRECTIONAL OPTIMIZATION computer vision deep learning probability LIMB heat map 2D multi-person POSE estimation
原文传递
Multi-person vision tracking approach based on human body localization features
7
作者 Ao-Lei Yang Hai-Yan Ren +1 位作者 Min-Rui Fei Wasif Naeem 《Advances in Manufacturing》 SCIE EI CAS CSCD 2021年第4期496-508,共13页
This paper presents a multi-person vision tracking approach based on human body localization features to address the problem of interactive object localization and tracking in a home monitoring scenario.Firstly,the hu... This paper presents a multi-person vision tracking approach based on human body localization features to address the problem of interactive object localization and tracking in a home monitoring scenario.Firstly,the human body localization model is used to obtain the 3D position of the human body,which is then used to construct the human body motion model based on the Kalman filter method.At the same time,the human appearance model is constructed by fusing human color features and features of the histogram of oriented gradient to better characterize the human body.Secondly,the human body observation model is constructed based on the human body motion model and appearance model to measure the similarities between the human body state sequence in the historical frame and the human body observation result in the current frame,and the cost matrix is then obtained.Thirdly,the Hungarian maximum matching algorithm is employed to match each human body in the current and historical frames,and the exception detection mechanism is simultaneously constructed to further reduce the probability of human tracking and matching failure.Finally,a multi-person vision tracking verification platform was constructed,and the achieved average accuracy was 96.6%in the case of human body overlapping,occlusion,disappearance,and appearance;this verifies the feasibility and effectiveness of the proposed method. 展开更多
关键词 multi-person vision tracking Human body positioning Motion model Body observation model
原文传递
3D hypothesis clustering for cross-view matching in multiperson motion capture 被引量:1
8
作者 Miaopeng Li Zimeng Zhou Xinguo Liu 《Computational Visual Media》 CSCD 2020年第2期147-156,共10页
We present a multiview method for markerless motion capture of multiple people. The main challenge in this problem is to determine crossview correspondences for the 2 D joints in the presence of noise. We propose a 3 ... We present a multiview method for markerless motion capture of multiple people. The main challenge in this problem is to determine crossview correspondences for the 2 D joints in the presence of noise. We propose a 3 D hypothesis clustering technique to solve this problem. The core idea is to transform joint matching in 2 D space into a clustering problem in a 3 D hypothesis space. In this way, evidence from photometric appearance, multiview geometry, and bone length can be integrated to solve the clustering problem efficiently and robustly. Each cluster encodes a set of matched 2 D joints for the same person across different views, from which the 3 D joints can be effectively inferred. We then assemble the inferred 3 D joints to form full-body skeletons for all persons in a bottom–up way. Our experiments demonstrate the robustness of our approach even in challenging cases with heavy occlusion,closely interacting people, and few cameras. We have evaluated our method on many datasets, and our results show that it has significantly lower estimation errors than many state-of-the-art methods. 展开更多
关键词 multi-person motion capture cross-view matching CLUSTERING human pose estimation
原文传递
上一页 1 下一页 到第
使用帮助 返回顶部