With the development of technology and the progress of life,more and more people,regardless of entertainment,learning,or work,cannot do without computer desks and cannot put down their mobile phones.Due to prolonged s...With the development of technology and the progress of life,more and more people,regardless of entertainment,learning,or work,cannot do without computer desks and cannot put down their mobile phones.Due to prolonged sitting and often neglecting the importance of posture,incorrect posture can often lead to health problems such as hunchback,lumbar muscle strain,and shoulder and neck pain over time.To address this issue,we designed a computer vision-based human body posture detection system.The system utilizes YOLOv8 technology to accurately locate key points of the human body skeleton,and then analyzes the coordinate positions and depth information of these key points to establish a criterion for distinguishing different postures.With the assistance of an SVM classifier,the system achieves an average recognition rate of 95%.Finally,we successfully deployed the posture detection system on Raspberry Pi hardware and conducted extensive testing.The test results demonstrate that the system can effectively detect various postures and provide real-time reminders to users to correct poor posture,demonstrating good practicality and stability.展开更多
Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized plann...Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized planning of corrective lenses and surgeries, vision-correcting displays, vision-related hardware development, and extended reality discomfort reduction. A critical property of human vision is that it is imperfect because of the highly influential wavefront aberrations that vary from person to person. This study provides an overview of the existing computational image generation techniques that properly simulate human vision in the presence of wavefront aberrations. These algorithms typically apply ray tracing with a detailed description of the simulated eye or utilize the point-spread func-tion of the eye to perform convolution on the input image. Based on the description of the vision simulation tech-niques, several of their characteristic features have been evaluated and some potential application areas and research directions have been outlined.展开更多
The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographica...The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographical regions.This work aimed to construct a computational classification model for classifying Indian regional face images acquired from south and east regions of India,referring to human vision.We have created an Automated Human Intelligence System(AHIS)to evaluate human visual capabilities.Analysis of AHIS response showed that face shape is a discriminative feature among the other facial features.We have developed a modified convolutional neural network to characterize the human vision response to improve face classification accuracy.The proposed model achieved mean F1 and Matthew Correlation Coefficient(MCC)of 0.92 and 0.84,respectively,on the validation set,outperforming the traditional Convolutional Neural Network(CNN).The CNN-Contoured Face(CNN-FC)model is developed to train contoured face images to investigate the influence of face shape.Finally,to cross-validate the accuracy of these models,the traditional CNN model is trained on the same dataset.With an accuracy of 92.98%,the Modified-CNN(M-CNN)model has demonstrated that the proposed method could facilitate the tangible impact in intra-classification problems.A novel Indian regional face dataset is created for supporting this supervised classification work,and it will be available to the research community.展开更多
Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.How...Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.However,the limitations of manual sweeps have become increasingly evident with the implementation of large-scale accelerators.By leveraging advancements in machine vision technology,the automatic identification of stranded personnel in controlled areas through camera imagery presents a viable solution for efficient search and security.Given the criticality of personal safety for stranded individuals,search and security processes must be sufficiently reliable.To ensure comprehensive coverage,180°camera groups were strategically positioned on both sides of the accelerator tunnel to eliminate blind spots within the monitoring range.The YOLOV8 network model was modified to enable the detection of small targets,such as hands and feet,as well as larger targets formed by individuals near the cameras.Furthermore,the system incorporates a pedestrian recognition model that detects human body parts,and an information fusion strategy is used to integrate the detected head,hands,and feet with the identified pedestrians as a cohesive unit.This strategy enhanced the capability of the model to identify pedestrians obstructed by equipment,resulting in a notable improvement in the recall rate.Specifically,recall rates of 0.915 and 0.82were obtained for Datasets 1 and 2,respectively.Although there was a slight decrease in accuracy,it aligned with the intended purpose of the search-and-secure software design.Experimental tests conducted within an accelerator tunnel demonstrated the effectiveness of this approach in achieving reliable recognition outcomes.展开更多
The human pose paradigm is estimated using a transformer-based multi-branch multidimensional directed the three-dimensional(3D)method that takes into account self-occlusion,badly posedness,and a lack of depth data in ...The human pose paradigm is estimated using a transformer-based multi-branch multidimensional directed the three-dimensional(3D)method that takes into account self-occlusion,badly posedness,and a lack of depth data in the per-frame 3D posture estimation from two-dimensional(2D)mapping to 3D mapping.Firstly,by examining the relationship between the movements of different bones in the human body,four virtual skeletons are proposed to enhance the cyclic constraints of limb joints.Then,multiple parameters describing the skeleton are fused and projected into a high-dimensional space.Utilizing a multi-branch network,motion features between bones and overall motion features are extracted to mitigate the drift error in the estimation results.Furthermore,the estimated relative depth is projected into 3D space,and the error is calculated against real 3D data,forming a loss function along with the relative depth error.This article adopts the average joint pixel error as the primary performance metric.Compared to the benchmark approach,the estimation findings indicate an increase in average precision of 1.8 mm within the Human3.6M sample.展开更多
图像质量的客观评价方法研究在实现图像质量评价仪器化的过程中起到决定性的作用。在分析最新全参考图像质量评价算法:特征相似法(feature similarity,FSIM)的基础上,利用对比敏感度函数(contrast sensitivity function,CSF)算子以及离...图像质量的客观评价方法研究在实现图像质量评价仪器化的过程中起到决定性的作用。在分析最新全参考图像质量评价算法:特征相似法(feature similarity,FSIM)的基础上,利用对比敏感度函数(contrast sensitivity function,CSF)算子以及离散余弦变换(discrete cosine transform,DCT)域的对比度掩盖效应,提出了一种改进的FSIM图像质量评价方法。该方法具有FSIM算法简单、高效等特性,同时又充分体现人眼视觉特性,更好地反映了人的主观感受。LIVE(laboratory for image and video engi-neering)测试数据集的实验结果证明,该方法在非线性回归后相关系数、斯皮尔曼相关系数、线外率等指标方面均优于传统的其他图像质量评价算法。展开更多
基金funded by the Science and Technology Project of Hebei Education Department (No.ZD2022100).
文摘With the development of technology and the progress of life,more and more people,regardless of entertainment,learning,or work,cannot do without computer desks and cannot put down their mobile phones.Due to prolonged sitting and often neglecting the importance of posture,incorrect posture can often lead to health problems such as hunchback,lumbar muscle strain,and shoulder and neck pain over time.To address this issue,we designed a computer vision-based human body posture detection system.The system utilizes YOLOv8 technology to accurately locate key points of the human body skeleton,and then analyzes the coordinate positions and depth information of these key points to establish a criterion for distinguishing different postures.With the assistance of an SVM classifier,the system achieves an average recognition rate of 95%.Finally,we successfully deployed the posture detection system on Raspberry Pi hardware and conducted extensive testing.The test results demonstrate that the system can effectively detect various postures and provide real-time reminders to users to correct poor posture,demonstrating good practicality and stability.
文摘Vision-simulated imagery―the process of generating images that mimic the human visual system―is a valuable tool with a wide spectrum of possible applications, including visual acuity measurements, personalized planning of corrective lenses and surgeries, vision-correcting displays, vision-related hardware development, and extended reality discomfort reduction. A critical property of human vision is that it is imperfect because of the highly influential wavefront aberrations that vary from person to person. This study provides an overview of the existing computational image generation techniques that properly simulate human vision in the presence of wavefront aberrations. These algorithms typically apply ray tracing with a detailed description of the simulated eye or utilize the point-spread func-tion of the eye to perform convolution on the input image. Based on the description of the vision simulation tech-niques, several of their characteristic features have been evaluated and some potential application areas and research directions have been outlined.
文摘The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographical regions.This work aimed to construct a computational classification model for classifying Indian regional face images acquired from south and east regions of India,referring to human vision.We have created an Automated Human Intelligence System(AHIS)to evaluate human visual capabilities.Analysis of AHIS response showed that face shape is a discriminative feature among the other facial features.We have developed a modified convolutional neural network to characterize the human vision response to improve face classification accuracy.The proposed model achieved mean F1 and Matthew Correlation Coefficient(MCC)of 0.92 and 0.84,respectively,on the validation set,outperforming the traditional Convolutional Neural Network(CNN).The CNN-Contoured Face(CNN-FC)model is developed to train contoured face images to investigate the influence of face shape.Finally,to cross-validate the accuracy of these models,the traditional CNN model is trained on the same dataset.With an accuracy of 92.98%,the Modified-CNN(M-CNN)model has demonstrated that the proposed method could facilitate the tangible impact in intra-classification problems.A novel Indian regional face dataset is created for supporting this supervised classification work,and it will be available to the research community.
文摘Prompt radiation emitted during accelerator operation poses a significant health risk,necessitating a thorough search and securing of hazardous areas prior to initiation.Currently,manual sweep methods are employed.However,the limitations of manual sweeps have become increasingly evident with the implementation of large-scale accelerators.By leveraging advancements in machine vision technology,the automatic identification of stranded personnel in controlled areas through camera imagery presents a viable solution for efficient search and security.Given the criticality of personal safety for stranded individuals,search and security processes must be sufficiently reliable.To ensure comprehensive coverage,180°camera groups were strategically positioned on both sides of the accelerator tunnel to eliminate blind spots within the monitoring range.The YOLOV8 network model was modified to enable the detection of small targets,such as hands and feet,as well as larger targets formed by individuals near the cameras.Furthermore,the system incorporates a pedestrian recognition model that detects human body parts,and an information fusion strategy is used to integrate the detected head,hands,and feet with the identified pedestrians as a cohesive unit.This strategy enhanced the capability of the model to identify pedestrians obstructed by equipment,resulting in a notable improvement in the recall rate.Specifically,recall rates of 0.915 and 0.82were obtained for Datasets 1 and 2,respectively.Although there was a slight decrease in accuracy,it aligned with the intended purpose of the search-and-secure software design.Experimental tests conducted within an accelerator tunnel demonstrated the effectiveness of this approach in achieving reliable recognition outcomes.
基金supported by the Medical Special Cultivation Project of Anhui University of Science and Technology(Grant No.YZ2023H2B013)the Anhui Provincial Key Research and Development Project(Grant No.2022i01020015)the Open Project of Key Laboratory of Conveyance Equipment(East China Jiaotong University),Ministry of Education(KLCE2022-01).
文摘The human pose paradigm is estimated using a transformer-based multi-branch multidimensional directed the three-dimensional(3D)method that takes into account self-occlusion,badly posedness,and a lack of depth data in the per-frame 3D posture estimation from two-dimensional(2D)mapping to 3D mapping.Firstly,by examining the relationship between the movements of different bones in the human body,four virtual skeletons are proposed to enhance the cyclic constraints of limb joints.Then,multiple parameters describing the skeleton are fused and projected into a high-dimensional space.Utilizing a multi-branch network,motion features between bones and overall motion features are extracted to mitigate the drift error in the estimation results.Furthermore,the estimated relative depth is projected into 3D space,and the error is calculated against real 3D data,forming a loss function along with the relative depth error.This article adopts the average joint pixel error as the primary performance metric.Compared to the benchmark approach,the estimation findings indicate an increase in average precision of 1.8 mm within the Human3.6M sample.
文摘图像质量的客观评价方法研究在实现图像质量评价仪器化的过程中起到决定性的作用。在分析最新全参考图像质量评价算法:特征相似法(feature similarity,FSIM)的基础上,利用对比敏感度函数(contrast sensitivity function,CSF)算子以及离散余弦变换(discrete cosine transform,DCT)域的对比度掩盖效应,提出了一种改进的FSIM图像质量评价方法。该方法具有FSIM算法简单、高效等特性,同时又充分体现人眼视觉特性,更好地反映了人的主观感受。LIVE(laboratory for image and video engi-neering)测试数据集的实验结果证明,该方法在非线性回归后相关系数、斯皮尔曼相关系数、线外率等指标方面均优于传统的其他图像质量评价算法。