●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS...●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS:Totally 203 infrared meibomian images from 138 patients with dry eye disease,accompanied by corresponding annotations,were gathered for the study.A rectified scribble-supervised gland segmentation(RSSGS)model,incorporating temporal ensemble prediction,uncertainty estimation,and a transformation equivariance constraint,was introduced to address constraints imposed by limited supervision information inherent in scribble annotations.The viability and efficacy of the proposed model were assessed based on accuracy,intersection over union(IoU),and dice coefficient.●RESULTS:Using manual labels as the gold standard,RSSGS demonstrated outcomes with an accuracy of 93.54%,a dice coefficient of 78.02%,and an IoU of 64.18%.Notably,these performance metrics exceed the current weakly supervised state-of-the-art methods by 0.76%,2.06%,and 2.69%,respectively.Furthermore,despite achieving a substantial 80%reduction in annotation costs,it only lags behind fully annotated methods by 0.72%,1.51%,and 2.04%.●CONCLUSION:An innovative automatic segmentation model is developed for MGs in infrared eyelid images,using scribble annotation for training.This model maintains an exceptionally high level of segmentation accuracy while substantially reducing training costs.It holds substantial utility for calculating clinical parameters,thereby greatly enhancing the diagnostic efficiency of ophthalmologists in evaluating meibomian gland dysfunction.展开更多
Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means...Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means of reducing the risk of accidents.To tackle the challenges posed by the low recognition accuracy and the substan-tial computational burden associated with current infrared pedestrian-vehicle detection methods,an infrared pedestrian-vehicle detection method A proposal is presented,based on an enhanced version of You Only Look Once version 5(YOLOv5).First,A head specifically designed for detecting small targets has been integrated into the model to make full use of shallow feature information to enhance the accuracy in detecting small targets.Second,the Focal Generalized Intersection over Union(GIoU)is employed as an alternative to the original loss function to address issues related to target overlap and category imbalance.Third,the distribution shift convolution optimization feature extraction operator is used to alleviate the computational burden of the model without significantly compromising detection accuracy.The test results of the improved algorithm show that its average accuracy(mAP)reaches 90.1%.Specifically,the Giga Floating Point Operations Per second(GFLOPs)of the improved algorithm is only 9.1.In contrast,the improved algorithms outperformed the other algorithms on similar GFLOPs,such as YOLOv6n(11.9),YOLOv8n(8.7),YOLOv7t(13.2)and YOLOv5s(16.0).The mAPs that are 4.4%,3%,3.5%,and 1.7%greater than those of these algorithms show that the improved algorithm achieves higher accuracy in target detection tasks under similar computational resource overhead.On the other hand,compared with other algorithms such as YOLOv8l(91.1%),YOLOv6l(89.5%),YOLOv7(90.8%),and YOLOv3(90.1%),the improved algorithm needs only 5.5%,2.3%,8.6%,and 2.3%,respectively,of the GFLOPs.The improved algorithm has shown significant advancements in balancing accuracy and computational efficiency,making it promising for practical use in resource-limited scenarios.展开更多
Heat transfer and temperature evolution in overburden fracture and ground fissures are one of the essential topics for the identification of ground fissures via unmanned aerial vehicle(UAV) infrared imager. In this st...Heat transfer and temperature evolution in overburden fracture and ground fissures are one of the essential topics for the identification of ground fissures via unmanned aerial vehicle(UAV) infrared imager. In this study, discrete element software UDEC was employed to investigate the overburden fracture field under different mining conditions. Multiphysics software COMSOL were employed to investigate heat transfer and temperature evolution of overburden fracture and ground fissures under the influence of mining condition, fissure depth, fissure width, and month alternation. The UAV infrared field measurements also provided a calibration for numerical simulation. The results showed that for ground fissures connected to underground goaf(Fissure Ⅰ), the temperature difference increased with larger mining height and shallow buried depth. In addition, Fissure Ⅰ located in the boundary of the goaf have a greater temperature difference and is easier to be identified than fissures located above the mining goaf. For ground fissures having no connection to underground goaf(Fissure Ⅱ), the heat transfer is affected by the internal resistance of the overlying strata fracture when the depth of Fissure Ⅱ is greater than10 m, the temperature of Fissure Ⅱ gradually equals to the ground temperature as the fissures’ depth increases, and the fissures are difficult to be identified. The identification effect is most obvious for fissures larger than 16 cm under the same depth. In spring and summer, UAV infrared identification of mining fissures should be carried out during nighttime. This study provides the basis for the optimal time and season for the UAV infrared identification of different types of mining ground fissures.展开更多
For better night-vision applications using the low-light-level visible and infrared imaging, a fusion framework for night-vision context enhancement(FNCE) method is proposed. An adaptive brightness stretching method...For better night-vision applications using the low-light-level visible and infrared imaging, a fusion framework for night-vision context enhancement(FNCE) method is proposed. An adaptive brightness stretching method is first proposed for enhancing the visible image. Then, a hybrid multi-scale decomposition with edge-preserving filtering is proposed to decompose the source images. Finally, the fused result is obtained via a combination of the decomposed images in three different rules. Experimental results demonstrate that the FNCE method has better performance on the details(edges), the contrast, the sharpness, and the human visual perception. Therefore,better results for the night-vision context enhancement can be achieved.展开更多
Facial expression and emotion recognition from thermal infrared images has attracted more and more attentions in recent years. However, the features adopted in current work are either temperature statistical parameter...Facial expression and emotion recognition from thermal infrared images has attracted more and more attentions in recent years. However, the features adopted in current work are either temperature statistical parameters extracted from the facial regions of interest or several hand-crafted features that are commonly used in visible spectrum. Till now there are no image features specially designed for thermal infrared images. In this paper, we propose using the deep Boltzmann machine to learn thermal features for emotion recognition from thermal infrared facial images. First, the face is located and normalized from the thermal infrared im- ages. Then, a deep Boltzmann machine model composed of two layers is trained. The parameters of the deep Boltzmann machine model are further fine-tuned for emotion recognition after pre-tralning of feature learning. Comparative experimental results on the NVIE database demonstrate that our approach outperforms other approaches using temperature statistic features or hand-crafted features borrowed from visible domain. The learned features from the forehead, eye, and mouth are more effective for discriminating valence dimension of emotion than other facial areas. In addition, our study shows that adding unlabeled data from other database during training can also improve feature learning performance.展开更多
As the representative of flexibility in optical imaging media,in recent years,fiber bundles have emerged as a promising architecture in the development of compact visual systems.Dedicated to tackling the problems of u...As the representative of flexibility in optical imaging media,in recent years,fiber bundles have emerged as a promising architecture in the development of compact visual systems.Dedicated to tackling the problems of universal honeycomb artifacts and low signal-to-noise ratio(SNR)imaging in fiber bundles,the iterative super-resolution reconstruction network based on a physical model is proposed.Under the constraint of solving the two subproblems of data fidelity and prior regularization term alternately,the network can efficiently“regenerate”the lost spatial resolution with deep learning.By building and calibrating a dual-path imaging system,the real-world dataset where paired low-resolution(LR)-high-resolution(HR)images on the same scene can be generated simultaneously.Numerical results on both the United States Air Force(USAF)resolution target and complex target objects demonstrate that the algorithm can restore high-contrast images without pixilated noise.On the basis of super-resolution reconstruction,compound eye image composition based on fiber bundle is also embedded in this paper for the actual imaging requirements.The proposed work is the first to apply a physical model-based deep learning network to fiber bundle imaging in the infrared band,effectively promoting the engineering application of thermal radiation detection.展开更多
Real-time polarization medium-wave infrared(MIR)optical imaging systems enable the acquisition of infrared and polarization information for a target.At present,real-time polarization MIR devices face the following pro...Real-time polarization medium-wave infrared(MIR)optical imaging systems enable the acquisition of infrared and polarization information for a target.At present,real-time polarization MIR devices face the following problems:poor real-time performance,low transmission and high requirements for fabrication and integration.Herein,we aim to improve the performance of real-time polarization imaging systems in the MIR waveband and solve the above-mentioned defects.Therefore,we propose a MIR polarization imaging system to achieve real-time polarization-modulated imaging with high transmission as well as improved performance based on a pixel-wise metasurface micro-polarization array(PMMPA).The PMMPA element comprises several linear polarization(LP)filters with different polarization angles.The optimization results demonstrate that the transmittance of the center field of view for the LP filters is up to 77%at a wavelength of4.0μm and an extinction ratio of 88 d B.In addition,a near-diffraction-limited real-time MIR imaging optical system is designed with a field of view of 5°and an F-number of 2.The simulation results show that an MIR polarization imaging system with excellent real-time performance and high transmission is achieved by using the optimized PMMPA element.Therefore,the method is compatible with the available optical system design technologies and provides a way to realize real-time polarization imaging in MIR wavebands.展开更多
Objective:To investigate the differences between meditation and resting states using infrared thermal imaging(IRTI)to determine facial temperature distribution features during meditation and annotate the patterns of f...Objective:To investigate the differences between meditation and resting states using infrared thermal imaging(IRTI)to determine facial temperature distribution features during meditation and annotate the patterns of facial temperature changes during meditation from the perspective of traditional Chinese medicine facial diagnosis.Methods:Each participant performed 10 min meditation and 10 min resting but in different sequences.A concentration test was set as the task load,followed by a meditation/resting or resting/meditation session,during which the participants'facial temperatures were observed using IRTI.Participants were scored on the Big Five Inventory(BFI)and Mindful Attention Awareness Scale(MAAS).Results:Forehead temperatures decreased more during meditation than during the resting state.The chin temperature increased only during meditation(P<.0001).For the subjects with meditation experience,there were significant differences in the temperatures of the left forehead(P<.01),right forehead(P<.01)and chin(P<.05)between the meditation and resting state at the 10~(th)min.In the nontask state,the BFI-Extraversion showed a negative correlation with the temperature of the left forehead(R=-0.41,P=.03).In the post-task state,the temperature of the left forehead was negatively correlated with scores on the MAAS(R=-0.42,P=.02).Conclusion:Using IRTI to study meditation offers a practical solution to the challenges in meditation research.The results indicate that an increase in chin temperature may be a representative feature of a meditation state,and forehead temperature is also a potential indicator.展开更多
Background Fiber maturity is a key cotton quality property,and its variability in a sample impacts fiber processing and dyeing performance.Currently,the maturity is determined by using established protocols in laborat...Background Fiber maturity is a key cotton quality property,and its variability in a sample impacts fiber processing and dyeing performance.Currently,the maturity is determined by using established protocols in laboratories under a controlled environment.There is an increasing need to measure fiber maturity using low-cost(in general less than $20000)and small portable systems.In this study,a laboratory feasibility was performed to assess the ability of the shortwave infrared hyperspectral imaging(SWIR HSI)technique for determining the conditioned fiber maturity,and as a comparison,a bench-top commercial and expensive(in general greater than $60000)near infrared(NIR)instrument was used.Results Although SWIR HSI and NIR represent different measurement technologies,consistent spectral characteristics were observed between the two instruments when they were used to measure the maturity of the locule fiber samples in seed cotton and of the well-defined fiber samples,respectively.Partial least squares(PLS)models were established using different spectral preprocessing parameters to predict fiber maturity.The high prediction precision was observed by a lower root mean square error of prediction(RMSEP)(<0.046),higher R_(p)^(2)(>0.518),and greater percentage(97.0%)of samples within the 95% agreement range in the entire NIR region(1000-2500 nm)without the moisture band at 1940 nm.Conclusion SWIR HSI has a good potential for assessing cotton fiber maturity in a laboratory environment.展开更多
This study aimed to propose road crack detection method based on infrared image fusion technology.By analyzing the characteristics of road crack images,this method uses a variety of infrared image fusion methods to pr...This study aimed to propose road crack detection method based on infrared image fusion technology.By analyzing the characteristics of road crack images,this method uses a variety of infrared image fusion methods to process different types of images.The use of this method allows the detection of road cracks,which not only reduces the professional requirements for inspectors,but also improves the accuracy of road crack detection.Based on infrared image processing technology,on the basis of in-depth analysis of infrared image features,a road crack detection method is proposed,which can accurately identify the road crack location,direction,length,and other characteristic information.Experiments showed that this method has a good effect,and can meet the requirement of road crack detection.展开更多
In the methods of image thresholding segmentation, such methods based on two-dimensional (2D) histogram and optimal objective functions are important. However, when they are used for infrared image segmentation, the...In the methods of image thresholding segmentation, such methods based on two-dimensional (2D) histogram and optimal objective functions are important. However, when they are used for infrared image segmentation, they are weak in suppressing background noises and worse in segmenting targets with non-uniform gray level. The concept of 2D histogram shape modification is proposed, which is realized by target information prior restraint after enhancing target information using plateau histogram equalization. The formula of 2D minimum Renyi entropy is deduced for image segmentation, then the shape-modified 2D histogram is combined wfth four optimal objective functions (i.e., maximum between-class variance, maximum entropy, maximum correlation and minimum Renyi entropy) respectively for the appli- cation of infrared image segmentation. Simultaneously, F-measure is introduced to evaluate the segmentation effects objectively. The experimental results show that F-measure is an effective evaluation index for image segmentation since its value is fully consistent with the subjective evaluation, and after 2D histogram shape modification, the methods of optimal objective functions can overcome their original forms' deficiency and their segmentation effects are more or less improvements, where the best one is the maximum entropy method based on 2D histogram shape modification.展开更多
It is crucial to maintain the safe and stable operation of distribution transformers,which constitute a key part of power systems.In the event of transformer failure,the fault type must be diagnosed in a timely and ac...It is crucial to maintain the safe and stable operation of distribution transformers,which constitute a key part of power systems.In the event of transformer failure,the fault type must be diagnosed in a timely and accurate manner.To this end,a transformer fault diagnosis method based on infrared image processing and semi-supervised learning is proposed herein.First,we perform feature extraction on the collected infrared-image data to extract temperature,texture,and shape features as the model reference vectors.Then,a generative adversarial network(GAN)is constructed to generate synthetic samples for the minority subset of labelled samples.The proposed method can learn information from unlabeled sample data,unlike conventional supervised learning methods.Subsequently,a semi-supervised graph model is trained on the entire dataset,i.e.,both labeled and unlabeled data.Finally,we test the proposed model on an actual dataset collected from a Chinese electricity provider.The experimental results show that the use of feature extraction,sample generation,and semi-supervised learning model can improve the accuracy of transformer fault classification.This verifies the effectiveness of the proposed method.展开更多
The stress and gas pressure in deep coal seams are very high,and instability and failure rapidly and intensely occur.It is important to study the infrared precursor characteristics of gas-bearing coal instability and ...The stress and gas pressure in deep coal seams are very high,and instability and failure rapidly and intensely occur.It is important to study the infrared precursor characteristics of gas-bearing coal instability and failure.In this paper,a self-developed stress-gas coupling failure infrared experimental system was used to analyse the infrared radiation temperature(IRT)and infrared thermal image precursor characteristics of gas-free coal and gas-bearing coal.The changes in the areas of the infrared temperature anomalous precursor regions and the effect of the gas on the infrared precursors were examined.The results show that high-temperature anomalous precursors arise mainly when the gas-free coal fails under loading,whereas the gas-bearing coal has high-temperature and low-temperature anomalous precursors.The area of the high-temperature anomalous precursor is approximately 30%–40%under gasbearing coal unstable failure,which is lower than the 60%–70%of the gas-free coal.The area of the low-temperature abnormal precursor is approximately 3%–6%,which is higher than the 1%–2%of the gas-free coal.With increasing gas pressure,the area of the high-temperature anomalous precursor gradually decreases,and the area of the low-temperature anomalous precursor gradually increases.The highand low-temperature anomalous precursors of gas-bearing coal are mainly caused by gas desorption,volume expansion,and thermal friction.The presence of gas inhibits the increase in IRT on the coal surface and increases the difficulty of infrared radiation(IR)monitoring and early warning for gas-bearing coal.展开更多
The present paper utilizes thermal infrared image for inversion of winter wheat yield and biomass with different technology of irrigation(drip irrigation,sprinkler irrigation,flood irrigation).It is the first time tha...The present paper utilizes thermal infrared image for inversion of winter wheat yield and biomass with different technology of irrigation(drip irrigation,sprinkler irrigation,flood irrigation).It is the first time that thermal infrared image is used for predicting the winter wheat yield and biomass.The temperature of crop and background was measured by thermal infrared image.It is necessary to get the crop background separation index(CBSIL,CBSIH),which can be used for distinguishing the crop value from the image.CBSIL and CBSIH(the temperature when the leaves are wet adequately;the temperature when the stomata of leaf is closed completely) are the threshold values.The temperature of crop ranged from CBSIL to CBSIH.Then the ICWSI was calculated based on relevant theoretical method.The value of stomata leaf has strong negative correlation with ICWSI proving the reliable value of ICWSI.In order to construct the high accuracy simulation model,the samples were divided into two parts.One was used for constructing the simulation model,the other for checking the accuracy of the model.Such result of the model was concluded as:(1) As for the simulation model of soil moisture,the correlation coefficient(R2) is larger than 0.887 6,the average of relative error(Er) ranges from 13.33% to 16.88%;(2) As for the simulation model of winter wheat yield,drip irrigation(0.887 6,16.89%,-0.12),sprinkler irrigation(0.970 0,14.85%,-0.12),flood irrigation(0.969 0,18.87%,0.18),with the values of R2,Er and CRM listed in the parentheses followed by the individual term.(3) As for winter wheat biomass,drip irrigation(0.980 0,13.70%,0.13),sprinkler irrigation(0.95,13.15%,-0.14),flood irrigation(0.970 0,14.48%,-0.13),and the values in the parentheses are demonstrated the same as above.Both the CRM and Er are shown to be very low values,which points to the accuracy and reliability of the model investigated.The accuracy of model is high and reliable.The results indicated that thermal infrared image can be used potentially for inversion of winter wheat yield and biomass.展开更多
Multi-source information can be obtained through the fusion of infrared images and visible light images,which have the characteristics of complementary information.However,the existing acquisition methods of fusion im...Multi-source information can be obtained through the fusion of infrared images and visible light images,which have the characteristics of complementary information.However,the existing acquisition methods of fusion images have disadvantages such as blurred edges,low contrast,and loss of details.Based on convolution sparse representation and improved pulse-coupled neural network this paper proposes an image fusion algorithm that decompose the source images into high-frequency and low-frequency subbands by non-subsampled Shearlet Transform(NSST).Furthermore,the low-frequency subbands were fused by convolutional sparse representation(CSR),and the high-frequency subbands were fused by an improved pulse coupled neural network(IPCNN)algorithm,which can effectively solve the problem of difficulty in setting parameters of the traditional PCNN algorithm,improving the performance of sparse representation with details injection.The result reveals that the proposed method in this paper has more advantages than the existing mainstream fusion algorithms in terms of visual effects and objective indicators.展开更多
Background: Despite its variety of potential applications, the wide implementation of infrared technology in cattle production faces technical, environmental and biological challenges similar to other indicators of m...Background: Despite its variety of potential applications, the wide implementation of infrared technology in cattle production faces technical, environmental and biological challenges similar to other indicators of metabolic state. Nine trials, divided into three classes (technological, environmental and biological factors) were conducted to illustrate the influence of these factors on body surface temperature assessed through infrared imaging. Results: Evaluation of technological factors indicated the following: measurements of body temperatures were strongly repeatable when taken within ]0 s; appropriateness of differing infrared camera technologies was influenced by distance to the target; and results were consistent when analysis of thermographs was compared between judges. Evaluation of environmental factors illustrated that wind and debris caused decreases in body surface temperatures without affecting metabolic rate; additionally, body surface temperature increased due to sunlight but returned to baseline values within minutes of shade exposure. Examination/investigation/exploration of animal factors demonstrated that exercise caused an increase in body surface temperature and metabolic rate. Administration of sedative and anti-sedative caused changes on body surface temperature and metabolic rate, and during late pregnancy a foetal thermal imprint was visible through abdominal infrared imaging. Conclusion: The above factors should be considered in order to standardize operational procedures for taking thermographs, thereby optimizing the use of such technology in cattle operations.展开更多
A novel portable infrared imaging system based on uncooled focal plane array and programmable system-on-chip(SoC)was proposed.The latest Xilinx Zynq-7000 was used to integrate the main part of the system into a sing...A novel portable infrared imaging system based on uncooled focal plane array and programmable system-on-chip(SoC)was proposed.The latest Xilinx Zynq-7000 was used to integrate the main part of the system into a single SoC.Parallel arithmetic units and digital modules were implemented on the programmable logic(PL)of Zynq-7000 to decrease system size and ensure the real-time p nonuniformity correction,while programs running on the processing system(PS)of Zynq-7000 controlled the system work flow and provided human-machine interfaces using open-source software such as Linux and OpenCV.Meanwhile,industry standard advanced extendable interface(AXI)buses were adopted to encapsulating standardized IP cores and build high speed data exchange bridges between units within Zynq-7000.Test results indicate that the image quality and real-time performance of the system can meet application requirements.And it provided a more flexible and extendable solution for evaluating and deploying infrared image enhancement and nonuniformity correction algorithms.展开更多
Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhanc...Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.展开更多
Keyhole is one of the important phenomena in high-power laser welding process. By studying the keyhole characteristic and detecting the seam offset during high-power fiber laser welding, an infrared sensitive high-spe...Keyhole is one of the important phenomena in high-power laser welding process. By studying the keyhole characteristic and detecting the seam offset during high-power fiber laser welding, an infrared sensitive high-speed camera arranged off-axis orientation of laser beam was applied to capture the dynamic thermal images of a molten pool. The 304 austenitic stainless steel plate butt joint welding experiment with laser power 10 kW was carried out. Through analyzing the keyhole infrared image, the weld position was calculated. Least squares method was used to determine the actual weld position. Image filtering technique was used to process the keyhole image, and the keyhole centroid coordinate were calculated. Also, least squares method was used to fit the keyhole centroid curve equation and establish a nonlinear continuous model which described the deviation between keyhole centroid and weld seam. The heat accumulation effect affected by the infrared radiation was analyzed to determine whether the laser beam focus spot deviated from the desired welding seam. Experimental results showed that the keyhole centroid has related to the seam offset, and can reflect the welding quality.展开更多
The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients ar...The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients around the zero value are very few, so we cannot sparsely represent low-frequency image information. The low-frequency component contains the main energy of the image and depicts the profile of the image. Direct fusion of the low-frequency component will not be conducive to obtain highly accurate fusion result. Therefore, this paper presents an infrared and visible image fusion method combining the multi-scale and top-hat transforms. On one hand, the new top-hat-transform can effectively extract the salient features of the low-frequency component. On the other hand, the multi-scale transform can extract highfrequency detailed information in multiple scales and from diverse directions. The combination of the two methods is conducive to the acquisition of more characteristics and more accurate fusion results. Among them, for the low-frequency component, a new type of top-hat transform is used to extract low-frequency features, and then different fusion rules are applied to fuse the low-frequency features and low-frequency background; for high-frequency components, the product of characteristics method is used to integrate the detailed information in high-frequency. Experimental results show that the proposed algorithm can obtain more detailed information and clearer infrared target fusion results than the traditional multiscale transform methods. Compared with the state-of-the-art fusion methods based on sparse representation, the proposed algorithm is simple and efficacious, and the time consumption is significantly reduced.展开更多
基金Supported by Natural Science Foundation of Fujian Province(No.2020J011084)Fujian Province Technology and Economy Integration Service Platform(No.2023XRH001)Fuzhou-Xiamen-Quanzhou National Independent Innovation Demonstration Zone Collaborative Innovation Platform(No.2022FX5)。
文摘●AIM:To investigate a pioneering framework for the segmentation of meibomian glands(MGs),using limited annotations to reduce the workload on ophthalmologists and enhance the efficiency of clinical diagnosis.●METHODS:Totally 203 infrared meibomian images from 138 patients with dry eye disease,accompanied by corresponding annotations,were gathered for the study.A rectified scribble-supervised gland segmentation(RSSGS)model,incorporating temporal ensemble prediction,uncertainty estimation,and a transformation equivariance constraint,was introduced to address constraints imposed by limited supervision information inherent in scribble annotations.The viability and efficacy of the proposed model were assessed based on accuracy,intersection over union(IoU),and dice coefficient.●RESULTS:Using manual labels as the gold standard,RSSGS demonstrated outcomes with an accuracy of 93.54%,a dice coefficient of 78.02%,and an IoU of 64.18%.Notably,these performance metrics exceed the current weakly supervised state-of-the-art methods by 0.76%,2.06%,and 2.69%,respectively.Furthermore,despite achieving a substantial 80%reduction in annotation costs,it only lags behind fully annotated methods by 0.72%,1.51%,and 2.04%.●CONCLUSION:An innovative automatic segmentation model is developed for MGs in infrared eyelid images,using scribble annotation for training.This model maintains an exceptionally high level of segmentation accuracy while substantially reducing training costs.It holds substantial utility for calculating clinical parameters,thereby greatly enhancing the diagnostic efficiency of ophthalmologists in evaluating meibomian gland dysfunction.
文摘Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means of reducing the risk of accidents.To tackle the challenges posed by the low recognition accuracy and the substan-tial computational burden associated with current infrared pedestrian-vehicle detection methods,an infrared pedestrian-vehicle detection method A proposal is presented,based on an enhanced version of You Only Look Once version 5(YOLOv5).First,A head specifically designed for detecting small targets has been integrated into the model to make full use of shallow feature information to enhance the accuracy in detecting small targets.Second,the Focal Generalized Intersection over Union(GIoU)is employed as an alternative to the original loss function to address issues related to target overlap and category imbalance.Third,the distribution shift convolution optimization feature extraction operator is used to alleviate the computational burden of the model without significantly compromising detection accuracy.The test results of the improved algorithm show that its average accuracy(mAP)reaches 90.1%.Specifically,the Giga Floating Point Operations Per second(GFLOPs)of the improved algorithm is only 9.1.In contrast,the improved algorithms outperformed the other algorithms on similar GFLOPs,such as YOLOv6n(11.9),YOLOv8n(8.7),YOLOv7t(13.2)and YOLOv5s(16.0).The mAPs that are 4.4%,3%,3.5%,and 1.7%greater than those of these algorithms show that the improved algorithm achieves higher accuracy in target detection tasks under similar computational resource overhead.On the other hand,compared with other algorithms such as YOLOv8l(91.1%),YOLOv6l(89.5%),YOLOv7(90.8%),and YOLOv3(90.1%),the improved algorithm needs only 5.5%,2.3%,8.6%,and 2.3%,respectively,of the GFLOPs.The improved algorithm has shown significant advancements in balancing accuracy and computational efficiency,making it promising for practical use in resource-limited scenarios.
基金supported by the National Natural Science Foundation of China(Nos.52225402 and U1910206).
文摘Heat transfer and temperature evolution in overburden fracture and ground fissures are one of the essential topics for the identification of ground fissures via unmanned aerial vehicle(UAV) infrared imager. In this study, discrete element software UDEC was employed to investigate the overburden fracture field under different mining conditions. Multiphysics software COMSOL were employed to investigate heat transfer and temperature evolution of overburden fracture and ground fissures under the influence of mining condition, fissure depth, fissure width, and month alternation. The UAV infrared field measurements also provided a calibration for numerical simulation. The results showed that for ground fissures connected to underground goaf(Fissure Ⅰ), the temperature difference increased with larger mining height and shallow buried depth. In addition, Fissure Ⅰ located in the boundary of the goaf have a greater temperature difference and is easier to be identified than fissures located above the mining goaf. For ground fissures having no connection to underground goaf(Fissure Ⅱ), the heat transfer is affected by the internal resistance of the overlying strata fracture when the depth of Fissure Ⅱ is greater than10 m, the temperature of Fissure Ⅱ gradually equals to the ground temperature as the fissures’ depth increases, and the fissures are difficult to be identified. The identification effect is most obvious for fissures larger than 16 cm under the same depth. In spring and summer, UAV infrared identification of mining fissures should be carried out during nighttime. This study provides the basis for the optimal time and season for the UAV infrared identification of different types of mining ground fissures.
基金supported by the National Natural Science Foundation of China(No.61231014)the Foundation of Army Armaments Department of China(No.6140414050327)the Foundation of Science and Technology on Low-Light-Level Night Vision Laboratory(No.BJ2017001)
文摘For better night-vision applications using the low-light-level visible and infrared imaging, a fusion framework for night-vision context enhancement(FNCE) method is proposed. An adaptive brightness stretching method is first proposed for enhancing the visible image. Then, a hybrid multi-scale decomposition with edge-preserving filtering is proposed to decompose the source images. Finally, the fused result is obtained via a combination of the decomposed images in three different rules. Experimental results demonstrate that the FNCE method has better performance on the details(edges), the contrast, the sharpness, and the human visual perception. Therefore,better results for the night-vision context enhancement can be achieved.
基金This paper was supported by the National Natural Science Foundation of China (Grant Nos. 61175037, 61228304), Special Innovation Project on Speech of Anhui Province (11010202192), Project from Anhui Science and Technology Agency (1106c0805008) and the Fundamental Research Funds for the Central Universities. We also acknowledge partial support from the US National Science Foundation (1205664).
文摘Facial expression and emotion recognition from thermal infrared images has attracted more and more attentions in recent years. However, the features adopted in current work are either temperature statistical parameters extracted from the facial regions of interest or several hand-crafted features that are commonly used in visible spectrum. Till now there are no image features specially designed for thermal infrared images. In this paper, we propose using the deep Boltzmann machine to learn thermal features for emotion recognition from thermal infrared facial images. First, the face is located and normalized from the thermal infrared im- ages. Then, a deep Boltzmann machine model composed of two layers is trained. The parameters of the deep Boltzmann machine model are further fine-tuned for emotion recognition after pre-tralning of feature learning. Comparative experimental results on the NVIE database demonstrate that our approach outperforms other approaches using temperature statistic features or hand-crafted features borrowed from visible domain. The learned features from the forehead, eye, and mouth are more effective for discriminating valence dimension of emotion than other facial areas. In addition, our study shows that adding unlabeled data from other database during training can also improve feature learning performance.
基金the National Natural Science Foundation of China(Grant Nos.61905115,62105151,62175109,U21B2033)Leading Technology of Jiangsu Basic Research Plan(Grant No.BK20192003)+2 种基金Youth Foundation of Jiangsu Province(Grant Nos.BK20190445,BK20210338)Fundamental Research Funds for the Central Universities(Grant No.30920032101)Open Research Fund of Jiangsu Key Laboratory of Spectral Imaging&Intelligent Sense(Grant No.JSGP202105)to provide fund for conducting experiments。
文摘As the representative of flexibility in optical imaging media,in recent years,fiber bundles have emerged as a promising architecture in the development of compact visual systems.Dedicated to tackling the problems of universal honeycomb artifacts and low signal-to-noise ratio(SNR)imaging in fiber bundles,the iterative super-resolution reconstruction network based on a physical model is proposed.Under the constraint of solving the two subproblems of data fidelity and prior regularization term alternately,the network can efficiently“regenerate”the lost spatial resolution with deep learning.By building and calibrating a dual-path imaging system,the real-world dataset where paired low-resolution(LR)-high-resolution(HR)images on the same scene can be generated simultaneously.Numerical results on both the United States Air Force(USAF)resolution target and complex target objects demonstrate that the algorithm can restore high-contrast images without pixilated noise.On the basis of super-resolution reconstruction,compound eye image composition based on fiber bundle is also embedded in this paper for the actual imaging requirements.The proposed work is the first to apply a physical model-based deep learning network to fiber bundle imaging in the infrared band,effectively promoting the engineering application of thermal radiation detection.
基金Project supported by the National Key R&D Program of China(Grant No.SKLA02020001A05)。
文摘Real-time polarization medium-wave infrared(MIR)optical imaging systems enable the acquisition of infrared and polarization information for a target.At present,real-time polarization MIR devices face the following problems:poor real-time performance,low transmission and high requirements for fabrication and integration.Herein,we aim to improve the performance of real-time polarization imaging systems in the MIR waveband and solve the above-mentioned defects.Therefore,we propose a MIR polarization imaging system to achieve real-time polarization-modulated imaging with high transmission as well as improved performance based on a pixel-wise metasurface micro-polarization array(PMMPA).The PMMPA element comprises several linear polarization(LP)filters with different polarization angles.The optimization results demonstrate that the transmittance of the center field of view for the LP filters is up to 77%at a wavelength of4.0μm and an extinction ratio of 88 d B.In addition,a near-diffraction-limited real-time MIR imaging optical system is designed with a field of view of 5°and an F-number of 2.The simulation results show that an MIR polarization imaging system with excellent real-time performance and high transmission is achieved by using the optimized PMMPA element.Therefore,the method is compatible with the available optical system design technologies and provides a way to realize real-time polarization imaging in MIR wavebands.
基金supported by the Fundamental Research Funds for the Central Universities(x2021-JYB-XJSJJ-032)Beijing Municipal Commission of Education,Double First-class,High-caliber Talents Grant(1000041510156)。
文摘Objective:To investigate the differences between meditation and resting states using infrared thermal imaging(IRTI)to determine facial temperature distribution features during meditation and annotate the patterns of facial temperature changes during meditation from the perspective of traditional Chinese medicine facial diagnosis.Methods:Each participant performed 10 min meditation and 10 min resting but in different sequences.A concentration test was set as the task load,followed by a meditation/resting or resting/meditation session,during which the participants'facial temperatures were observed using IRTI.Participants were scored on the Big Five Inventory(BFI)and Mindful Attention Awareness Scale(MAAS).Results:Forehead temperatures decreased more during meditation than during the resting state.The chin temperature increased only during meditation(P<.0001).For the subjects with meditation experience,there were significant differences in the temperatures of the left forehead(P<.01),right forehead(P<.01)and chin(P<.05)between the meditation and resting state at the 10~(th)min.In the nontask state,the BFI-Extraversion showed a negative correlation with the temperature of the left forehead(R=-0.41,P=.03).In the post-task state,the temperature of the left forehead was negatively correlated with scores on the MAAS(R=-0.42,P=.02).Conclusion:Using IRTI to study meditation offers a practical solution to the challenges in meditation research.The results indicate that an increase in chin temperature may be a representative feature of a meditation state,and forehead temperature is also a potential indicator.
基金supported partially by the USDA-ARS Research Project#6054-44000-080-00D.
文摘Background Fiber maturity is a key cotton quality property,and its variability in a sample impacts fiber processing and dyeing performance.Currently,the maturity is determined by using established protocols in laboratories under a controlled environment.There is an increasing need to measure fiber maturity using low-cost(in general less than $20000)and small portable systems.In this study,a laboratory feasibility was performed to assess the ability of the shortwave infrared hyperspectral imaging(SWIR HSI)technique for determining the conditioned fiber maturity,and as a comparison,a bench-top commercial and expensive(in general greater than $60000)near infrared(NIR)instrument was used.Results Although SWIR HSI and NIR represent different measurement technologies,consistent spectral characteristics were observed between the two instruments when they were used to measure the maturity of the locule fiber samples in seed cotton and of the well-defined fiber samples,respectively.Partial least squares(PLS)models were established using different spectral preprocessing parameters to predict fiber maturity.The high prediction precision was observed by a lower root mean square error of prediction(RMSEP)(<0.046),higher R_(p)^(2)(>0.518),and greater percentage(97.0%)of samples within the 95% agreement range in the entire NIR region(1000-2500 nm)without the moisture band at 1940 nm.Conclusion SWIR HSI has a good potential for assessing cotton fiber maturity in a laboratory environment.
文摘This study aimed to propose road crack detection method based on infrared image fusion technology.By analyzing the characteristics of road crack images,this method uses a variety of infrared image fusion methods to process different types of images.The use of this method allows the detection of road cracks,which not only reduces the professional requirements for inspectors,but also improves the accuracy of road crack detection.Based on infrared image processing technology,on the basis of in-depth analysis of infrared image features,a road crack detection method is proposed,which can accurately identify the road crack location,direction,length,and other characteristic information.Experiments showed that this method has a good effect,and can meet the requirement of road crack detection.
基金supported by the China Postdoctoral Science Foundation(20100471451)the Science and Technology Foundation of State Key Laboratory of Underwater Measurement&Control Technology(9140C2603051003)
文摘In the methods of image thresholding segmentation, such methods based on two-dimensional (2D) histogram and optimal objective functions are important. However, when they are used for infrared image segmentation, they are weak in suppressing background noises and worse in segmenting targets with non-uniform gray level. The concept of 2D histogram shape modification is proposed, which is realized by target information prior restraint after enhancing target information using plateau histogram equalization. The formula of 2D minimum Renyi entropy is deduced for image segmentation, then the shape-modified 2D histogram is combined wfth four optimal objective functions (i.e., maximum between-class variance, maximum entropy, maximum correlation and minimum Renyi entropy) respectively for the appli- cation of infrared image segmentation. Simultaneously, F-measure is introduced to evaluate the segmentation effects objectively. The experimental results show that F-measure is an effective evaluation index for image segmentation since its value is fully consistent with the subjective evaluation, and after 2D histogram shape modification, the methods of optimal objective functions can overcome their original forms' deficiency and their segmentation effects are more or less improvements, where the best one is the maximum entropy method based on 2D histogram shape modification.
基金supported by China Southern Power Grid Co.Ltd.science and technology project(Research on the theory,technology and application of stereoscopic disaster defense for power distribution network in large city,GZHKJXM20180060)National Natural Science Foundation of China(No.51477100).
文摘It is crucial to maintain the safe and stable operation of distribution transformers,which constitute a key part of power systems.In the event of transformer failure,the fault type must be diagnosed in a timely and accurate manner.To this end,a transformer fault diagnosis method based on infrared image processing and semi-supervised learning is proposed herein.First,we perform feature extraction on the collected infrared-image data to extract temperature,texture,and shape features as the model reference vectors.Then,a generative adversarial network(GAN)is constructed to generate synthetic samples for the minority subset of labelled samples.The proposed method can learn information from unlabeled sample data,unlike conventional supervised learning methods.Subsequently,a semi-supervised graph model is trained on the entire dataset,i.e.,both labeled and unlabeled data.Finally,we test the proposed model on an actual dataset collected from a Chinese electricity provider.The experimental results show that the use of feature extraction,sample generation,and semi-supervised learning model can improve the accuracy of transformer fault classification.This verifies the effectiveness of the proposed method.
基金supported by the National Natural Science Foundation of China(No.52074280)the National Natural Science Foundation of China(No.52004016)the Priority Academic Program Development(PAPD)of Jiangsu Higher Education Institutions。
文摘The stress and gas pressure in deep coal seams are very high,and instability and failure rapidly and intensely occur.It is important to study the infrared precursor characteristics of gas-bearing coal instability and failure.In this paper,a self-developed stress-gas coupling failure infrared experimental system was used to analyse the infrared radiation temperature(IRT)and infrared thermal image precursor characteristics of gas-free coal and gas-bearing coal.The changes in the areas of the infrared temperature anomalous precursor regions and the effect of the gas on the infrared precursors were examined.The results show that high-temperature anomalous precursors arise mainly when the gas-free coal fails under loading,whereas the gas-bearing coal has high-temperature and low-temperature anomalous precursors.The area of the high-temperature anomalous precursor is approximately 30%–40%under gasbearing coal unstable failure,which is lower than the 60%–70%of the gas-free coal.The area of the low-temperature abnormal precursor is approximately 3%–6%,which is higher than the 1%–2%of the gas-free coal.With increasing gas pressure,the area of the high-temperature anomalous precursor gradually decreases,and the area of the low-temperature anomalous precursor gradually increases.The highand low-temperature anomalous precursors of gas-bearing coal are mainly caused by gas desorption,volume expansion,and thermal friction.The presence of gas inhibits the increase in IRT on the coal surface and increases the difficulty of infrared radiation(IR)monitoring and early warning for gas-bearing coal.
基金China-Germany international cooperation project(IRTG1070)National Natural Science Foundation of China(Item number:0971940)
文摘The present paper utilizes thermal infrared image for inversion of winter wheat yield and biomass with different technology of irrigation(drip irrigation,sprinkler irrigation,flood irrigation).It is the first time that thermal infrared image is used for predicting the winter wheat yield and biomass.The temperature of crop and background was measured by thermal infrared image.It is necessary to get the crop background separation index(CBSIL,CBSIH),which can be used for distinguishing the crop value from the image.CBSIL and CBSIH(the temperature when the leaves are wet adequately;the temperature when the stomata of leaf is closed completely) are the threshold values.The temperature of crop ranged from CBSIL to CBSIH.Then the ICWSI was calculated based on relevant theoretical method.The value of stomata leaf has strong negative correlation with ICWSI proving the reliable value of ICWSI.In order to construct the high accuracy simulation model,the samples were divided into two parts.One was used for constructing the simulation model,the other for checking the accuracy of the model.Such result of the model was concluded as:(1) As for the simulation model of soil moisture,the correlation coefficient(R2) is larger than 0.887 6,the average of relative error(Er) ranges from 13.33% to 16.88%;(2) As for the simulation model of winter wheat yield,drip irrigation(0.887 6,16.89%,-0.12),sprinkler irrigation(0.970 0,14.85%,-0.12),flood irrigation(0.969 0,18.87%,0.18),with the values of R2,Er and CRM listed in the parentheses followed by the individual term.(3) As for winter wheat biomass,drip irrigation(0.980 0,13.70%,0.13),sprinkler irrigation(0.95,13.15%,-0.14),flood irrigation(0.970 0,14.48%,-0.13),and the values in the parentheses are demonstrated the same as above.Both the CRM and Er are shown to be very low values,which points to the accuracy and reliability of the model investigated.The accuracy of model is high and reliable.The results indicated that thermal infrared image can be used potentially for inversion of winter wheat yield and biomass.
基金supported in part by the National Natural Science Foundation of China under Grant 41505017.
文摘Multi-source information can be obtained through the fusion of infrared images and visible light images,which have the characteristics of complementary information.However,the existing acquisition methods of fusion images have disadvantages such as blurred edges,low contrast,and loss of details.Based on convolution sparse representation and improved pulse-coupled neural network this paper proposes an image fusion algorithm that decompose the source images into high-frequency and low-frequency subbands by non-subsampled Shearlet Transform(NSST).Furthermore,the low-frequency subbands were fused by convolutional sparse representation(CSR),and the high-frequency subbands were fused by an improved pulse coupled neural network(IPCNN)algorithm,which can effectively solve the problem of difficulty in setting parameters of the traditional PCNN algorithm,improving the performance of sparse representation with details injection.The result reveals that the proposed method in this paper has more advantages than the existing mainstream fusion algorithms in terms of visual effects and objective indicators.
基金the Beef Producers of Ontario,Ontario Ministry of Agriculture and Rural Affairs,Beef Cattle Research Council and Agri-Food Canada for financial support
文摘Background: Despite its variety of potential applications, the wide implementation of infrared technology in cattle production faces technical, environmental and biological challenges similar to other indicators of metabolic state. Nine trials, divided into three classes (technological, environmental and biological factors) were conducted to illustrate the influence of these factors on body surface temperature assessed through infrared imaging. Results: Evaluation of technological factors indicated the following: measurements of body temperatures were strongly repeatable when taken within ]0 s; appropriateness of differing infrared camera technologies was influenced by distance to the target; and results were consistent when analysis of thermographs was compared between judges. Evaluation of environmental factors illustrated that wind and debris caused decreases in body surface temperatures without affecting metabolic rate; additionally, body surface temperature increased due to sunlight but returned to baseline values within minutes of shade exposure. Examination/investigation/exploration of animal factors demonstrated that exercise caused an increase in body surface temperature and metabolic rate. Administration of sedative and anti-sedative caused changes on body surface temperature and metabolic rate, and during late pregnancy a foetal thermal imprint was visible through abdominal infrared imaging. Conclusion: The above factors should be considered in order to standardize operational procedures for taking thermographs, thereby optimizing the use of such technology in cattle operations.
文摘A novel portable infrared imaging system based on uncooled focal plane array and programmable system-on-chip(SoC)was proposed.The latest Xilinx Zynq-7000 was used to integrate the main part of the system into a single SoC.Parallel arithmetic units and digital modules were implemented on the programmable logic(PL)of Zynq-7000 to decrease system size and ensure the real-time p nonuniformity correction,while programs running on the processing system(PS)of Zynq-7000 controlled the system work flow and provided human-machine interfaces using open-source software such as Linux and OpenCV.Meanwhile,industry standard advanced extendable interface(AXI)buses were adopted to encapsulating standardized IP cores and build high speed data exchange bridges between units within Zynq-7000.Test results indicate that the image quality and real-time performance of the system can meet application requirements.And it provided a more flexible and extendable solution for evaluating and deploying infrared image enhancement and nonuniformity correction algorithms.
基金supported by the China Postdoctoral Science Foundation Funded Project(No.2021M690385)the National Natural Science Foundation of China(No.62101045).
文摘Infrared-visible image fusion plays an important role in multi-source data fusion,which has the advantage of integrating useful information from multi-source sensors.However,there are still challenges in target enhancement and visual improvement.To deal with these problems,a sub-regional infrared-visible image fusion method(SRF)is proposed.First,morphology and threshold segmentation is applied to extract targets interested in infrared images.Second,the infrared back-ground is reconstructed based on extracted targets and the visible image.Finally,target and back-ground regions are fused using a multi-scale transform.Experimental results are obtained using public data for comparison and evaluation,which demonstrate that the proposed SRF has poten-tial benefits over other methods.
文摘Keyhole is one of the important phenomena in high-power laser welding process. By studying the keyhole characteristic and detecting the seam offset during high-power fiber laser welding, an infrared sensitive high-speed camera arranged off-axis orientation of laser beam was applied to capture the dynamic thermal images of a molten pool. The 304 austenitic stainless steel plate butt joint welding experiment with laser power 10 kW was carried out. Through analyzing the keyhole infrared image, the weld position was calculated. Least squares method was used to determine the actual weld position. Image filtering technique was used to process the keyhole image, and the keyhole centroid coordinate were calculated. Also, least squares method was used to fit the keyhole centroid curve equation and establish a nonlinear continuous model which described the deviation between keyhole centroid and weld seam. The heat accumulation effect affected by the infrared radiation was analyzed to determine whether the laser beam focus spot deviated from the desired welding seam. Experimental results showed that the keyhole centroid has related to the seam offset, and can reflect the welding quality.
基金Project supported by the National Natural Science Foundation of China(Grant No.61402368)Aerospace Support Fund,China(Grant No.2017-HT-XGD)Aerospace Science and Technology Innovation Foundation,China(Grant No.2017 ZD 53047)
文摘The high-frequency components in the traditional multi-scale transform method are approximately sparse, which can represent different information of the details. But in the low-frequency component, the coefficients around the zero value are very few, so we cannot sparsely represent low-frequency image information. The low-frequency component contains the main energy of the image and depicts the profile of the image. Direct fusion of the low-frequency component will not be conducive to obtain highly accurate fusion result. Therefore, this paper presents an infrared and visible image fusion method combining the multi-scale and top-hat transforms. On one hand, the new top-hat-transform can effectively extract the salient features of the low-frequency component. On the other hand, the multi-scale transform can extract highfrequency detailed information in multiple scales and from diverse directions. The combination of the two methods is conducive to the acquisition of more characteristics and more accurate fusion results. Among them, for the low-frequency component, a new type of top-hat transform is used to extract low-frequency features, and then different fusion rules are applied to fuse the low-frequency features and low-frequency background; for high-frequency components, the product of characteristics method is used to integrate the detailed information in high-frequency. Experimental results show that the proposed algorithm can obtain more detailed information and clearer infrared target fusion results than the traditional multiscale transform methods. Compared with the state-of-the-art fusion methods based on sparse representation, the proposed algorithm is simple and efficacious, and the time consumption is significantly reduced.