Multimodal sentiment analysis is an essential area of research in artificial intelligence that combines multiple modes,such as text and image,to accurately assess sentiment.However,conventional approaches that rely on...Multimodal sentiment analysis is an essential area of research in artificial intelligence that combines multiple modes,such as text and image,to accurately assess sentiment.However,conventional approaches that rely on unimodal pre-trained models for feature extraction from each modality often overlook the intrinsic connections of semantic information between modalities.This limitation is attributed to their training on unimodal data,and necessitates the use of complex fusion mechanisms for sentiment analysis.In this study,we present a novel approach that combines a vision-language pre-trained model with a proposed multimodal contrastive learning method.Our approach harnesses the power of transfer learning by utilizing a vision-language pre-trained model to extract both visual and textual representations in a unified framework.We employ a Transformer architecture to integrate these representations,thereby enabling the capture of rich semantic infor-mation in image-text pairs.To further enhance the representation learning of these pairs,we introduce our proposed multimodal contrastive learning method,which leads to improved performance in sentiment analysis tasks.Our approach is evaluated through extensive experiments on two publicly accessible datasets,where we demonstrate its effectiveness.We achieve a significant improvement in sentiment analysis accuracy,indicating the supe-riority of our approach over existing techniques.These results highlight the potential of multimodal sentiment analysis and underscore the importance of considering the intrinsic semantic connections between modalities for accurate sentiment assessment.展开更多
A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed docume...A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different objectives.This research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten manuscripts.The proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative labeling.This approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex layouts.The experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten manuscripts.It also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts.展开更多
This paper proposes the analysis model of sports human body based on computer vision tracking technology. Visual target tracking is an important research field of the computer vision, motion trajectory and it can prov...This paper proposes the analysis model of sports human body based on computer vision tracking technology. Visual target tracking is an important research field of the computer vision, motion trajectory and it can provide not only the goal, to provide the initial data movement analysis, scene understanding, behavior or the event detection in intelligent surveillance, human-computer interaction, robot visual navigation and motion recognition based on field has a broad application prospect. For this reason, it is possible to consider the use of a large number of unlabeled samples to assist the training classifier to improve its performance. This type of machine learning method using both labeled and that unlabeled samples is called the semi-supervised learning. This paper proposes the novel idea of the related research topics to propose the new perspective of the model that will be later give us the novel idea of making it efficient for further development of sport science.展开更多
Cardiovascular disease is the leading cause of death globally.This disease causes loss of heart muscles and is also responsible for the death of heart cells,sometimes damaging their functionality.A person’s life may ...Cardiovascular disease is the leading cause of death globally.This disease causes loss of heart muscles and is also responsible for the death of heart cells,sometimes damaging their functionality.A person’s life may depend on receiving timely assistance as soon as possible.Thus,minimizing the death ratio can be achieved by early detection of heart attack(HA)symptoms.In the United States alone,an estimated 610,000 people die fromheart attacks each year,accounting for one in every four fatalities.However,by identifying and reporting heart attack symptoms early on,it is possible to reduce damage and save many lives significantly.Our objective is to devise an algorithm aimed at helping individuals,particularly elderly individuals living independently,to safeguard their lives.To address these challenges,we employ deep learning techniques.We have utilized a vision transformer(ViT)to address this problem.However,it has a significant overhead cost due to its memory consumption and computational complexity because of scaling dot-product attention.Also,since transformer performance typically relies on large-scale or adequate data,adapting ViT for smaller datasets is more challenging.In response,we propose a three-in-one steam model,theMulti-Head Attention Vision Hybrid(MHAVH).Thismodel integrates a real-time posture recognition framework to identify chest pain postures indicative of heart attacks using transfer learning techniques,such as ResNet-50 and VGG-16,renowned for their robust feature extraction capabilities.By incorporatingmultiple heads into the vision transformer to generate additional metrics and enhance heart-detection capabilities,we leverage a 2019 posture-based dataset comprising RGB images,a novel creation by the author that marks the first dataset tailored for posture-based heart attack detection.Given the limited online data availability,we segmented this dataset into gender categories(male and female)and conducted testing on both segmented and original datasets.The training accuracy of our model reached an impressive 99.77%.Upon testing,the accuracy for male and female datasets was recorded at 92.87%and 75.47%,respectively.The combined dataset accuracy is 93.96%,showcasing a commendable performance overall.Our proposed approach demonstrates versatility in accommodating small and large datasets,offering promising prospects for real-world applications.展开更多
A new approach based on stereo vision technology is introduced to analyzesheet metal deformation. By measuring the deformed circle grids that are printed on the sheetsurface before forming, the strain distribution of ...A new approach based on stereo vision technology is introduced to analyzesheet metal deformation. By measuring the deformed circle grids that are printed on the sheetsurface before forming, the strain distribution of the workpiece is obtained. The measurement andanalysis results can be used to verify numerical simulation results and guide production. To getgood accuracy, some new techniques are employed: camera calibration based on genetic algorithm,feature abstraction based on self-adaptive technology, image matching based on structure feature andcamera modeling pre-constrains, and parameter calculation based on curve and surface optimization.The experimental values show that the approach proposed is rational and practical, which can providebetter measurement accuracy with less time than the conventional method.展开更多
Cold Workability limits of Brass were studied as a function of friction, aspect ratio and specimen geometry. Five standard shapes of the axis symmetric specimens of cylindrical with aspect ratios 1.0 and 1.5, ring, ta...Cold Workability limits of Brass were studied as a function of friction, aspect ratio and specimen geometry. Five standard shapes of the axis symmetric specimens of cylindrical with aspect ratios 1.0 and 1.5, ring, tapered and flanged were selected for the present investigation. Specimens were deformed in compression between two flat platens to predict the metal flow at room temperature. The longitudinal and oblique cracks were obtained as the two major modes of surface fractures. Cylindrical and ring specimen shows the oblique surface crack while the tapered and flanged shows the longitudinal crack. Machine Vision system using PC based video recording with a CCD camera was used to analyze the deformation of 4 X 4 mm square grid marked at mid plane of the specimen. The strain paths obtained from different specimens exhibited nonlinearity from the beginning to the end of the strain path. The circumferential stress component Os increasingly becomes tensile with continued deformation. On the other hand the axial stress Oz , increased in the very initial stages of deformation but started becoming less compressive immediately as barreling develops. The nature of hydrostatic stress on the rim of the flanged specimen was found to be tensile. Finite element software ANSYS has been applied for the analysis of the upset forming process. When the stress values obtained from finite element analysis were compared to the measurements of grids using Machine Vision system it was found that they were in close proximity.展开更多
AIM: To evaluate the visual outcomes of Contoura Vision(CV) with automatic eye tracking system in eyes with myopia and myopic astigmatism.METHODS: This prospective study included 160 eyes(80 patients) with moderate my...AIM: To evaluate the visual outcomes of Contoura Vision(CV) with automatic eye tracking system in eyes with myopia and myopic astigmatism.METHODS: This prospective study included 160 eyes(80 patients) with moderate myopia and irregular astigmatism between January and August 2018. Subjects were randomly divided into CV group(80 eyes) that under went CV femtosecond laser-assisted in situ keratomileusis(FS-LASIK) and a control group(80 eyes) that underwent wavefrontoptimized FS-LASIK. Visual outcomes and astigmatic vector analysis were evaluated and compared between preoperatively and 3 mo postoperatively. RESULTS: Basic details were similar in both groups(P>0.05). At 3 mo postoperatively, uncorrected distance visual acuity was 20/16, 20/20, and 20/25 in 24, 76, and 80 eyes of patients in CV group, respectively. The CV group was better in predictability of astigmatism correction at 3 mo postoperatively. In CV group, 64 eyes had deviation of astigmatic axis within 15° and 28 eyes had deviation of astigmatic axis within 5°, both were better than those in the control group. The number of eyes with residual astigmatism within 0.5 D were less in CV group(48 eyes) than the control group(40 eyes). Compared with the preoperative, C7 significantly reduced to 0.056±0.030 in CV group at 3 mo after the procedure(P<0.05), and were significantly lower than those in the control group(P<0.05).CONCLUSION: CV with automatic eye tracking system is safe and effective for the correction of myopia and myopic astigmatism.展开更多
Detecting various parameters of woven fabrics is one of the important methods to evaluate the quality of fabrics.In the early stage of industrial development,fabrics were mainly relied on manual to determine the quali...Detecting various parameters of woven fabrics is one of the important methods to evaluate the quality of fabrics.In the early stage of industrial development,fabrics were mainly relied on manual to determine the quality,which was inefficient and unstable,so intelligent inspection is a popular development trend today.In recent years,computer vision technology has been widely used in the fields of fabric density measurement,color analysis,and weave pattern recognition.Based on the above three aspects,the advanced research progress of global researchers is reviewed in this paper and the shortcomings of current research and possible research directions in the future are analyzed.Computer vision technology is not only objective evaluation,but also has the advantages of accuracy and efficiency,and has a good development prospect in the field of textiles.展开更多
A slight uneven settlement of the foundation may cause the wind turbine to shake,tilt,or even collapse,so it is increasingly necessary to realize remote condition monitoring of the foundations.At present,the wind turb...A slight uneven settlement of the foundation may cause the wind turbine to shake,tilt,or even collapse,so it is increasingly necessary to realize remote condition monitoring of the foundations.At present,the wind turbine foundation monitoring system is incomplete.The current monitoring research of the tower foundation is mainly of contact measurements,using acceleration sensors and static-level sensors for monitoring multiple reference points.Such monitoring methods will face some disadvantages,such as the complexity of monitoring deployment,the cost of manpower,and the load effect on the tower structure.To solve above issues,this paper aims to investigate wind turbine tower foundation variation dynamic monitoring based on machine vision.Machine vision monitoring is a kind of noncontact measurement,which helps to realize comprehensive diagnosis of early foundation uneven settlement and loose faults.The FEA model is firstly investigated as the theoretical foundation to investigate the dynamics of the tower foundation.Second,the Gaussian-based vibration detection is adopted by tracking the tower edge points.Finally,a tower structure with distributed foundation support is tested.The modal parameters obtained from the visual measurement are compared with those from the accelerometer,proving the vision method can effectively monitor the issues with tower foundation changes.展开更多
Fish behavior refers to various movements of fish. Fish behavior is closely related to the ecology of fish, physiological changes of fish, aquaculture and so on. Related applications will be expanded if fish behavior ...Fish behavior refers to various movements of fish. Fish behavior is closely related to the ecology of fish, physiological changes of fish, aquaculture and so on. Related applications will be expanded if fish behavior is analyzed properly. Traditional analysis of fish behavior mainly relies on the observation of human eyes. With the deepening and extension of application and the rapid development of computer technology, computer vision technology is increasingly used to analyze fish behaviors. This paper summarized the research status, research progress and main problems of fish behavior analysis by using computer vision and made forecast about future research.展开更多
It is important to segment image correctly to extract guidance information for automatic agriculture vehicle. If we can make the computer know where the crops are, we can extract the guidance line easily. Images were ...It is important to segment image correctly to extract guidance information for automatic agriculture vehicle. If we can make the computer know where the crops are, we can extract the guidance line easily. Images were divided into some rec-tangle small windows, then a pair of 1-D arrays was constructed in each small windows. The correlation coefficients of every small window constructed the features to segment images. The results showed that correlation analysis is a potential approach for processing complex farmland for guidance system, and more correlation analysis methods must be researched.展开更多
AIM:To investigate the prevalence of color vision deficiency(CVD)among college students and their quality of life(QoL)in a Chinese college.METHODS:This cross-sectional study was performed in Sichuan University in Chen...AIM:To investigate the prevalence of color vision deficiency(CVD)among college students and their quality of life(QoL)in a Chinese college.METHODS:This cross-sectional study was performed in Sichuan University in Chengdu,China.The questionnaire containing participants’demographic data,as well as CVD related QoL was distributed to students who were screened as CVD[by Color Vision Examination Plates(Version 6)]in 2022 freshman entrance examination.Color blindness QoL(CBQoL)and utility analysis were used to evaluate the QoL of CVD students.RESULTS:There were 381 of 17303(2.20%)students screened as CVD,including 368(4.11%)males and 13(0.16%)females.A total of 317 students completed the questionnaire,the response rate was 83.20%.Only 166 participants(52.3%)knew they have CVD before the physical examination for college entrance examination,while a total of 145 participants(45.74%)hoped to be diagnosed earlier.The medians of CBQoL score and utility were 5.85(range 2.2-6)and 1(range 0-1),respectively.The proportions of students whose QoL is affected by CVD were 67.63%(211/312)and 42.27%(134/317)measured by CBQoL and utility analysis respectively.CONCLUSION:The prevalence of CVD in males is much higher than that in females.The time when CVD students first became aware of their CVD is relatively late.The QoL of the study population is quite high,while a large proportion of students’QoL are affected by CVD.It is suggested that as a congenital defect,CVD screening in China should be earlier,and appropriate guidance and support are needed for CVD patients to help them better adapt to life,study,and work.展开更多
The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video ana...The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video analysis techniques have significantly impacted today’s research,and numerous applications have been developed in this domain.This research proposed an anomaly detection technique applied to Umrah videos in Kaaba during the COVID-19 pandemic through sparse crowd analysis.Managing theKaaba rituals is crucial since the crowd gathers from around the world and requires proper analysis during these days of the pandemic.The Umrah videos are analyzed,and a system is devised that can track and monitor the crowd flow in Kaaba.The crowd in these videos is sparse due to the pandemic,and we have developed a technique to track the maximum crowd flow and detect any object(person)moving in the direction unlikely of the major flow.We have detected abnormal movement by creating the histograms for the vertical and horizontal flows and applying thresholds to identify the non-majority flow.Our algorithm aims to analyze the crowd through video surveillance and timely detect any abnormal activity tomaintain a smooth crowd flowinKaaba during the pandemic.展开更多
Tomato crops are considered the most important agricultural products worldwide.However,the quality of tomatoes depends mainly on the nutrient levels.Visual inspection is made by farmers to anticipate the nutrient defi...Tomato crops are considered the most important agricultural products worldwide.However,the quality of tomatoes depends mainly on the nutrient levels.Visual inspection is made by farmers to anticipate the nutrient deficiency of the plants.Recently,precision agriculture has explored opportunities to automate nutrient level monitoring.Previous work has demonstrated that a convolutional neural network is able to estimate low nutrients in tomato plants using images of their leaves.However,the performance of the convolutional neural network was not adequate.Thus,this work proposes a novel convolutional neural network-based classifier,namely,CNN+AHN,for estimating low nutrients in tomato crops using an image of the tomato leaves.The CNN+AHN incorporates a set of convolutional layers as the feature extraction part,and a supervised learning method called artificial hydrocarbon network as the dense layer.Different combinations of the architecture of CNN+AHN were examined.Experimental results showed that our best CNN+AHN classifier is able to estimate low nutrients in tomato plants with an accuracy of 95.57%and F1-score of 95.75%,outperforming the literature.展开更多
The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographica...The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographical regions.This work aimed to construct a computational classification model for classifying Indian regional face images acquired from south and east regions of India,referring to human vision.We have created an Automated Human Intelligence System(AHIS)to evaluate human visual capabilities.Analysis of AHIS response showed that face shape is a discriminative feature among the other facial features.We have developed a modified convolutional neural network to characterize the human vision response to improve face classification accuracy.The proposed model achieved mean F1 and Matthew Correlation Coefficient(MCC)of 0.92 and 0.84,respectively,on the validation set,outperforming the traditional Convolutional Neural Network(CNN).The CNN-Contoured Face(CNN-FC)model is developed to train contoured face images to investigate the influence of face shape.Finally,to cross-validate the accuracy of these models,the traditional CNN model is trained on the same dataset.With an accuracy of 92.98%,the Modified-CNN(M-CNN)model has demonstrated that the proposed method could facilitate the tangible impact in intra-classification problems.A novel Indian regional face dataset is created for supporting this supervised classification work,and it will be available to the research community.展开更多
Improved picture quality is critical to the effectiveness of object recog-nition and tracking.The consistency of those photos is impacted by night-video systems because the contrast between high-profile items and diffe...Improved picture quality is critical to the effectiveness of object recog-nition and tracking.The consistency of those photos is impacted by night-video systems because the contrast between high-profile items and different atmospheric conditions,such as mist,fog,dust etc.The pictures then shift in intensity,colour,polarity and consistency.A general challenge for computer vision analyses lies in the horrid appearance of night images in arbitrary illumination and ambient envir-onments.In recent years,target recognition techniques focused on deep learning and machine learning have become standard algorithms for object detection with the exponential growth of computer performance capabilities.However,the iden-tification of objects in the night world also poses further problems because of the distorted backdrop and dim light.The Correlation aware LSTM based YOLO(You Look Only Once)classifier method for exact object recognition and deter-mining its properties under night vision was a major inspiration for this work.In order to create virtual target sets similar to daily environments,we employ night images as inputs;and to obtain high enhanced image using histogram based enhancement and iterative wienerfilter for removing the noise in the image.The process of the feature extraction and feature selection was done for electing the potential features using the Adaptive internal linear embedding(AILE)and uplift linear discriminant analysis(ULDA).The region of interest mask can be segmen-ted using the Recurrent-Phase Level set Segmentation.Finally,we use deep con-volution feature fusion and region of interest pooling to integrate the presently extremely sophisticated quicker Long short term memory based(LSTM)with YOLO method for object tracking system.A range of experimentalfindings demonstrate that our technique achieves high average accuracy with a precision of 99.7%for object detection of SSAN datasets that is considerably more than that of the other standard object detection mechanism.Our approach may therefore satisfy the true demands of night scene target detection applications.We very much believe that our method will help future research.展开更多
基金supported by Science and Technology Research Project of Jiangxi Education Department.Project Grant No.GJJ2203306.
文摘Multimodal sentiment analysis is an essential area of research in artificial intelligence that combines multiple modes,such as text and image,to accurately assess sentiment.However,conventional approaches that rely on unimodal pre-trained models for feature extraction from each modality often overlook the intrinsic connections of semantic information between modalities.This limitation is attributed to their training on unimodal data,and necessitates the use of complex fusion mechanisms for sentiment analysis.In this study,we present a novel approach that combines a vision-language pre-trained model with a proposed multimodal contrastive learning method.Our approach harnesses the power of transfer learning by utilizing a vision-language pre-trained model to extract both visual and textual representations in a unified framework.We employ a Transformer architecture to integrate these representations,thereby enabling the capture of rich semantic infor-mation in image-text pairs.To further enhance the representation learning of these pairs,we introduce our proposed multimodal contrastive learning method,which leads to improved performance in sentiment analysis tasks.Our approach is evaluated through extensive experiments on two publicly accessible datasets,where we demonstrate its effectiveness.We achieve a significant improvement in sentiment analysis accuracy,indicating the supe-riority of our approach over existing techniques.These results highlight the potential of multimodal sentiment analysis and underscore the importance of considering the intrinsic semantic connections between modalities for accurate sentiment assessment.
基金This research was supported and funded by KAU Scientific Endowment,King Abdulaziz University,Jeddah,Saudi Arabia.
文摘A document layout can be more informative than merely a document’s visual and structural appearance.Thus,document layout analysis(DLA)is considered a necessary prerequisite for advanced processing and detailed document image analysis to be further used in several applications and different objectives.This research extends the traditional approaches of DLA and introduces the concept of semantic document layout analysis(SDLA)by proposing a novel framework for semantic layout analysis and characterization of handwritten manuscripts.The proposed SDLA approach enables the derivation of implicit information and semantic characteristics,which can be effectively utilized in dozens of practical applications for various purposes,in a way bridging the semantic gap and providingmore understandable high-level document image analysis and more invariant characterization via absolute and relative labeling.This approach is validated and evaluated on a large dataset ofArabic handwrittenmanuscripts comprising complex layouts.The experimental work shows promising results in terms of accurate and effective semantic characteristic-based clustering and retrieval of handwritten manuscripts.It also indicates the expected efficacy of using the capabilities of the proposed approach in automating and facilitating many functional,reallife tasks such as effort estimation and pricing of transcription or typing of such complex manuscripts.
文摘This paper proposes the analysis model of sports human body based on computer vision tracking technology. Visual target tracking is an important research field of the computer vision, motion trajectory and it can provide not only the goal, to provide the initial data movement analysis, scene understanding, behavior or the event detection in intelligent surveillance, human-computer interaction, robot visual navigation and motion recognition based on field has a broad application prospect. For this reason, it is possible to consider the use of a large number of unlabeled samples to assist the training classifier to improve its performance. This type of machine learning method using both labeled and that unlabeled samples is called the semi-supervised learning. This paper proposes the novel idea of the related research topics to propose the new perspective of the model that will be later give us the novel idea of making it efficient for further development of sport science.
基金Researchers Supporting Project Number(RSPD2024R576),King Saud University,Riyadh,Saudi Arabia。
文摘Cardiovascular disease is the leading cause of death globally.This disease causes loss of heart muscles and is also responsible for the death of heart cells,sometimes damaging their functionality.A person’s life may depend on receiving timely assistance as soon as possible.Thus,minimizing the death ratio can be achieved by early detection of heart attack(HA)symptoms.In the United States alone,an estimated 610,000 people die fromheart attacks each year,accounting for one in every four fatalities.However,by identifying and reporting heart attack symptoms early on,it is possible to reduce damage and save many lives significantly.Our objective is to devise an algorithm aimed at helping individuals,particularly elderly individuals living independently,to safeguard their lives.To address these challenges,we employ deep learning techniques.We have utilized a vision transformer(ViT)to address this problem.However,it has a significant overhead cost due to its memory consumption and computational complexity because of scaling dot-product attention.Also,since transformer performance typically relies on large-scale or adequate data,adapting ViT for smaller datasets is more challenging.In response,we propose a three-in-one steam model,theMulti-Head Attention Vision Hybrid(MHAVH).Thismodel integrates a real-time posture recognition framework to identify chest pain postures indicative of heart attacks using transfer learning techniques,such as ResNet-50 and VGG-16,renowned for their robust feature extraction capabilities.By incorporatingmultiple heads into the vision transformer to generate additional metrics and enhance heart-detection capabilities,we leverage a 2019 posture-based dataset comprising RGB images,a novel creation by the author that marks the first dataset tailored for posture-based heart attack detection.Given the limited online data availability,we segmented this dataset into gender categories(male and female)and conducted testing on both segmented and original datasets.The training accuracy of our model reached an impressive 99.77%.Upon testing,the accuracy for male and female datasets was recorded at 92.87%and 75.47%,respectively.The combined dataset accuracy is 93.96%,showcasing a commendable performance overall.Our proposed approach demonstrates versatility in accommodating small and large datasets,offering promising prospects for real-world applications.
文摘A new approach based on stereo vision technology is introduced to analyzesheet metal deformation. By measuring the deformed circle grids that are printed on the sheetsurface before forming, the strain distribution of the workpiece is obtained. The measurement andanalysis results can be used to verify numerical simulation results and guide production. To getgood accuracy, some new techniques are employed: camera calibration based on genetic algorithm,feature abstraction based on self-adaptive technology, image matching based on structure feature andcamera modeling pre-constrains, and parameter calculation based on curve and surface optimization.The experimental values show that the approach proposed is rational and practical, which can providebetter measurement accuracy with less time than the conventional method.
文摘Cold Workability limits of Brass were studied as a function of friction, aspect ratio and specimen geometry. Five standard shapes of the axis symmetric specimens of cylindrical with aspect ratios 1.0 and 1.5, ring, tapered and flanged were selected for the present investigation. Specimens were deformed in compression between two flat platens to predict the metal flow at room temperature. The longitudinal and oblique cracks were obtained as the two major modes of surface fractures. Cylindrical and ring specimen shows the oblique surface crack while the tapered and flanged shows the longitudinal crack. Machine Vision system using PC based video recording with a CCD camera was used to analyze the deformation of 4 X 4 mm square grid marked at mid plane of the specimen. The strain paths obtained from different specimens exhibited nonlinearity from the beginning to the end of the strain path. The circumferential stress component Os increasingly becomes tensile with continued deformation. On the other hand the axial stress Oz , increased in the very initial stages of deformation but started becoming less compressive immediately as barreling develops. The nature of hydrostatic stress on the rim of the flanged specimen was found to be tensile. Finite element software ANSYS has been applied for the analysis of the upset forming process. When the stress values obtained from finite element analysis were compared to the measurements of grids using Machine Vision system it was found that they were in close proximity.
文摘AIM: To evaluate the visual outcomes of Contoura Vision(CV) with automatic eye tracking system in eyes with myopia and myopic astigmatism.METHODS: This prospective study included 160 eyes(80 patients) with moderate myopia and irregular astigmatism between January and August 2018. Subjects were randomly divided into CV group(80 eyes) that under went CV femtosecond laser-assisted in situ keratomileusis(FS-LASIK) and a control group(80 eyes) that underwent wavefrontoptimized FS-LASIK. Visual outcomes and astigmatic vector analysis were evaluated and compared between preoperatively and 3 mo postoperatively. RESULTS: Basic details were similar in both groups(P>0.05). At 3 mo postoperatively, uncorrected distance visual acuity was 20/16, 20/20, and 20/25 in 24, 76, and 80 eyes of patients in CV group, respectively. The CV group was better in predictability of astigmatism correction at 3 mo postoperatively. In CV group, 64 eyes had deviation of astigmatic axis within 15° and 28 eyes had deviation of astigmatic axis within 5°, both were better than those in the control group. The number of eyes with residual astigmatism within 0.5 D were less in CV group(48 eyes) than the control group(40 eyes). Compared with the preoperative, C7 significantly reduced to 0.056±0.030 in CV group at 3 mo after the procedure(P<0.05), and were significantly lower than those in the control group(P<0.05).CONCLUSION: CV with automatic eye tracking system is safe and effective for the correction of myopia and myopic astigmatism.
基金National Natural Science Foundation of China(No.61876106)Shanghai Natural Science Foundation of China(No.18ZR1416600)+1 种基金Shanghai Local Capacity-Building Project,China(No.19030501200)Zhihong Scholars Plan of Shanghai University of Engineering Science,China(No.2018RC032017)。
文摘Detecting various parameters of woven fabrics is one of the important methods to evaluate the quality of fabrics.In the early stage of industrial development,fabrics were mainly relied on manual to determine the quality,which was inefficient and unstable,so intelligent inspection is a popular development trend today.In recent years,computer vision technology has been widely used in the fields of fabric density measurement,color analysis,and weave pattern recognition.Based on the above three aspects,the advanced research progress of global researchers is reviewed in this paper and the shortcomings of current research and possible research directions in the future are analyzed.Computer vision technology is not only objective evaluation,but also has the advantages of accuracy and efficiency,and has a good development prospect in the field of textiles.
基金the support of the National Natural Science Foundation of China(NSFC)(62076029)Guangdong provincial base platforms and major scientific research project:Research on Remote Large Facility Condition Monitoring Method Based on Motion Amplification(ZX-2021-040)+1 种基金Major Scientific and Technological Project in the Inner Mongolia Autonomous Region(2023YFSW0003)the Guangdong Basic and Applied Basic Research Fund Offshore Wind Power Scheme-General Project under Grant 2022A1515240042.
文摘A slight uneven settlement of the foundation may cause the wind turbine to shake,tilt,or even collapse,so it is increasingly necessary to realize remote condition monitoring of the foundations.At present,the wind turbine foundation monitoring system is incomplete.The current monitoring research of the tower foundation is mainly of contact measurements,using acceleration sensors and static-level sensors for monitoring multiple reference points.Such monitoring methods will face some disadvantages,such as the complexity of monitoring deployment,the cost of manpower,and the load effect on the tower structure.To solve above issues,this paper aims to investigate wind turbine tower foundation variation dynamic monitoring based on machine vision.Machine vision monitoring is a kind of noncontact measurement,which helps to realize comprehensive diagnosis of early foundation uneven settlement and loose faults.The FEA model is firstly investigated as the theoretical foundation to investigate the dynamics of the tower foundation.Second,the Gaussian-based vibration detection is adopted by tracking the tower edge points.Finally,a tower structure with distributed foundation support is tested.The modal parameters obtained from the visual measurement are compared with those from the accelerometer,proving the vision method can effectively monitor the issues with tower foundation changes.
基金Guangdong Province Key Laboratory of Popular High Performance Computers (SZU-GDPHPCL201805)Institute of Marine Industry Technology of Universities in Liaoning Province (2018-CY-34)+2 种基金National Natural Science Foundation of China (61701070)Liaoning Doctoral Start-up Fund (20180540090)China Postdoctoral Science Foundation (2018M640239).
文摘Fish behavior refers to various movements of fish. Fish behavior is closely related to the ecology of fish, physiological changes of fish, aquaculture and so on. Related applications will be expanded if fish behavior is analyzed properly. Traditional analysis of fish behavior mainly relies on the observation of human eyes. With the deepening and extension of application and the rapid development of computer technology, computer vision technology is increasingly used to analyze fish behaviors. This paper summarized the research status, research progress and main problems of fish behavior analysis by using computer vision and made forecast about future research.
文摘It is important to segment image correctly to extract guidance information for automatic agriculture vehicle. If we can make the computer know where the crops are, we can extract the guidance line easily. Images were divided into some rec-tangle small windows, then a pair of 1-D arrays was constructed in each small windows. The correlation coefficients of every small window constructed the features to segment images. The results showed that correlation analysis is a potential approach for processing complex farmland for guidance system, and more correlation analysis methods must be researched.
文摘AIM:To investigate the prevalence of color vision deficiency(CVD)among college students and their quality of life(QoL)in a Chinese college.METHODS:This cross-sectional study was performed in Sichuan University in Chengdu,China.The questionnaire containing participants’demographic data,as well as CVD related QoL was distributed to students who were screened as CVD[by Color Vision Examination Plates(Version 6)]in 2022 freshman entrance examination.Color blindness QoL(CBQoL)and utility analysis were used to evaluate the QoL of CVD students.RESULTS:There were 381 of 17303(2.20%)students screened as CVD,including 368(4.11%)males and 13(0.16%)females.A total of 317 students completed the questionnaire,the response rate was 83.20%.Only 166 participants(52.3%)knew they have CVD before the physical examination for college entrance examination,while a total of 145 participants(45.74%)hoped to be diagnosed earlier.The medians of CBQoL score and utility were 5.85(range 2.2-6)and 1(range 0-1),respectively.The proportions of students whose QoL is affected by CVD were 67.63%(211/312)and 42.27%(134/317)measured by CBQoL and utility analysis respectively.CONCLUSION:The prevalence of CVD in males is much higher than that in females.The time when CVD students first became aware of their CVD is relatively late.The QoL of the study population is quite high,while a large proportion of students’QoL are affected by CVD.It is suggested that as a congenital defect,CVD screening in China should be earlier,and appropriate guidance and support are needed for CVD patients to help them better adapt to life,study,and work.
基金The authors extend their appreciation to the Deputyship for Research and Innovation,Ministry of Education in Saudi Arabia for funding this research work through the Project Number QURDO001Project title:Intelligent Real-Time Crowd Monitoring System Using Unmanned Aerial Vehicle(UAV)Video and Global Positioning Systems(GPS)Data。
文摘The advent of the COVID-19 pandemic has adversely affected the entire world and has put forth high demand for techniques that remotely manage crowd-related tasks.Video surveillance and crowd management using video analysis techniques have significantly impacted today’s research,and numerous applications have been developed in this domain.This research proposed an anomaly detection technique applied to Umrah videos in Kaaba during the COVID-19 pandemic through sparse crowd analysis.Managing theKaaba rituals is crucial since the crowd gathers from around the world and requires proper analysis during these days of the pandemic.The Umrah videos are analyzed,and a system is devised that can track and monitor the crowd flow in Kaaba.The crowd in these videos is sparse due to the pandemic,and we have developed a technique to track the maximum crowd flow and detect any object(person)moving in the direction unlikely of the major flow.We have detected abnormal movement by creating the histograms for the vertical and horizontal flows and applying thresholds to identify the non-majority flow.Our algorithm aims to analyze the crowd through video surveillance and timely detect any abnormal activity tomaintain a smooth crowd flowinKaaba during the pandemic.
文摘Tomato crops are considered the most important agricultural products worldwide.However,the quality of tomatoes depends mainly on the nutrient levels.Visual inspection is made by farmers to anticipate the nutrient deficiency of the plants.Recently,precision agriculture has explored opportunities to automate nutrient level monitoring.Previous work has demonstrated that a convolutional neural network is able to estimate low nutrients in tomato plants using images of their leaves.However,the performance of the convolutional neural network was not adequate.Thus,this work proposes a novel convolutional neural network-based classifier,namely,CNN+AHN,for estimating low nutrients in tomato crops using an image of the tomato leaves.The CNN+AHN incorporates a set of convolutional layers as the feature extraction part,and a supervised learning method called artificial hydrocarbon network as the dense layer.Different combinations of the architecture of CNN+AHN were examined.Experimental results showed that our best CNN+AHN classifier is able to estimate low nutrients in tomato plants with an accuracy of 95.57%and F1-score of 95.75%,outperforming the literature.
文摘The inter-class face classification problem is more reasonable than the intra-class classification problem.To address this issue,we have carried out empirical research on classifying Indian people to their geographical regions.This work aimed to construct a computational classification model for classifying Indian regional face images acquired from south and east regions of India,referring to human vision.We have created an Automated Human Intelligence System(AHIS)to evaluate human visual capabilities.Analysis of AHIS response showed that face shape is a discriminative feature among the other facial features.We have developed a modified convolutional neural network to characterize the human vision response to improve face classification accuracy.The proposed model achieved mean F1 and Matthew Correlation Coefficient(MCC)of 0.92 and 0.84,respectively,on the validation set,outperforming the traditional Convolutional Neural Network(CNN).The CNN-Contoured Face(CNN-FC)model is developed to train contoured face images to investigate the influence of face shape.Finally,to cross-validate the accuracy of these models,the traditional CNN model is trained on the same dataset.With an accuracy of 92.98%,the Modified-CNN(M-CNN)model has demonstrated that the proposed method could facilitate the tangible impact in intra-classification problems.A novel Indian regional face dataset is created for supporting this supervised classification work,and it will be available to the research community.
文摘Improved picture quality is critical to the effectiveness of object recog-nition and tracking.The consistency of those photos is impacted by night-video systems because the contrast between high-profile items and different atmospheric conditions,such as mist,fog,dust etc.The pictures then shift in intensity,colour,polarity and consistency.A general challenge for computer vision analyses lies in the horrid appearance of night images in arbitrary illumination and ambient envir-onments.In recent years,target recognition techniques focused on deep learning and machine learning have become standard algorithms for object detection with the exponential growth of computer performance capabilities.However,the iden-tification of objects in the night world also poses further problems because of the distorted backdrop and dim light.The Correlation aware LSTM based YOLO(You Look Only Once)classifier method for exact object recognition and deter-mining its properties under night vision was a major inspiration for this work.In order to create virtual target sets similar to daily environments,we employ night images as inputs;and to obtain high enhanced image using histogram based enhancement and iterative wienerfilter for removing the noise in the image.The process of the feature extraction and feature selection was done for electing the potential features using the Adaptive internal linear embedding(AILE)and uplift linear discriminant analysis(ULDA).The region of interest mask can be segmen-ted using the Recurrent-Phase Level set Segmentation.Finally,we use deep con-volution feature fusion and region of interest pooling to integrate the presently extremely sophisticated quicker Long short term memory based(LSTM)with YOLO method for object tracking system.A range of experimentalfindings demonstrate that our technique achieves high average accuracy with a precision of 99.7%for object detection of SSAN datasets that is considerably more than that of the other standard object detection mechanism.Our approach may therefore satisfy the true demands of night scene target detection applications.We very much believe that our method will help future research.