为了提高多视图深度估计结果精度,提出一种基于自适应空间特征增强的多视图深度估计算法。设计了由改进后的特征金字塔网络(feature pyramid network,FPN)和自适应空间特征增强(adaptive space feature enhancement,ASFE)组成的多尺度...为了提高多视图深度估计结果精度,提出一种基于自适应空间特征增强的多视图深度估计算法。设计了由改进后的特征金字塔网络(feature pyramid network,FPN)和自适应空间特征增强(adaptive space feature enhancement,ASFE)组成的多尺度特征提取模块,获取到具有全局上下文信息和位置信息的多尺度特征图像。通过残差学习网络对深度图进行优化,防止多次卷积操作出现重建边缘模糊的问题。通过分类的思想构建focal loss函数增强网络模型的判断能力。由实验结果可知,该算法在DTU(technical university of denmark)数据集上和CasMVSNet(Cascade MVSNet)算法相比,在整体精度误差、运行时间、显存资源占用上分别降低了14.08%、72.15%、4.62%。在Tanks and Temples数据集整体评价指标Mean上该模型优于其他算法,证明提出的基于自适应空间特征增强的多视图深度估计算法的有效性。展开更多
为解决自然条件下人脸表情识别易受角度、光线、遮挡物的影响以及人脸表情数据集各类表情数量不均衡等问题,提出基于Res2Net的人脸表情识别方法。使用Res2Net50作为特征提取的主干网络,在预处理阶段对图像随机翻转、缩放、裁剪进行数据...为解决自然条件下人脸表情识别易受角度、光线、遮挡物的影响以及人脸表情数据集各类表情数量不均衡等问题,提出基于Res2Net的人脸表情识别方法。使用Res2Net50作为特征提取的主干网络,在预处理阶段对图像随机翻转、缩放、裁剪进行数据增强,提升模型的泛化性。引入广义平均池化(generalized mean pooling, GeM)方式,关注图像中比较显著的区域,增强模型的鲁棒性;选用Focal Loss损失函数,针对表情类别不平衡和错误分类问题,提高较难识别表情的识别率。该方法在FER2013数据集上准确率达到了70.41%,相较于原Res2Net50网络提高了1.53%。结果表明,在自然条件下对人脸表情识别具有更好的准确性。展开更多
BACKGROUND In hepatology,the clinical use of endoscopic ultrasound(EUS)has experienced a notable increase in recent times.These applications range from the diagnosis to the treatment of various liver diseases.Therefor...BACKGROUND In hepatology,the clinical use of endoscopic ultrasound(EUS)has experienced a notable increase in recent times.These applications range from the diagnosis to the treatment of various liver diseases.Therefore,this systematic review summarizes the evidence for the diagnostic and therapeutic roles of EUS in liver diseases.AIM To examine and summarize the current available evidence of the possible roles of the EUS in making a suitable diagnosis in liver diseases as well as the therapeutic accuracy and efficacy.METHODS PubMed,Medline,Cochrane Library,Web of Science,and Google Scholar databases were extensively searched until October 2023.The methodological quality of the eligible articles was assessed using the Newcastle-Ottawa scale or Cochrane Risk of Bias tool.In addition,statistical analyses were performed using the Comprehensive Meta-Analysis software.RESULTS Overall,45 articles on EUS were included(28 on diagnostic role and 17 on therapeutic role).Pooled analysis demonstrated that EUS diagnostic tests had an accuracy of 92.4%for focal liver lesions(FLL)and 96.6%for parenchymal liver diseases.EUS-guided liver biopsies with either fine needle aspiration or fine needle biopsy had low complication rates when sampling FLL and parenchymal liver diseases(3.1%and 8.7%,respectively).Analysis of data from four studies showed that EUS-guided liver abscess had high clinical(90.7%)and technical success(90.7%)without significant complications.Similarly,EUS-guided interventions for the treatment of gastric varices(GV)have high technical success(98%)and GV obliteration rate(84%)with few complications(15%)and rebleeding events(17%).CONCLUSION EUS in liver diseases is a promising technique with the potential to be considered a first-line therapeutic and diagnostic option in selected cases.展开更多
On December 18,2023,the Jishishan area in Gansu Province was jolted by a M_(S) 6.2 earthquake,which is the most powerful seismic event that occurred throughout the year in China.The earthquake occurred along the NWtre...On December 18,2023,the Jishishan area in Gansu Province was jolted by a M_(S) 6.2 earthquake,which is the most powerful seismic event that occurred throughout the year in China.The earthquake occurred along the NWtrending Lajishan fault(LJSF),a large tectonic transformation zone.After this event,China Earthquake Networks Center(CENC)has timely published several reports about seismic sources for emergency responses.The earthquake early warning system issued the first alert 4.9 s after the earthquake occurrence,providing prompt notification that effectively mitigated panics,injuries,and deaths of residents.The near real-time focal mechanism solution indicates that this earthquake is associated with a thrust fault.The distribution of aftershocks,the rupture process,and the recorded amplitudes from seismic monitoring and GNSS stations,all suggest that the mainshock rupture predominately propagates to the northwest direction.The duration of the rupture process is~12 s,and the largest slip is located at approximately 6.3 km to the NNW from the epicenter,with a peak slip of 0.12 m at~8 km depth.Seismic station N0028 recorded the highest instrumental intensity,which is 9.4 on the Mercalli scale.The estimated intensity map shows a seismic intensity reaching up to IX near the rupture area,consistent with field survey results.The aftershocks(up to December 22,2023)are mostly distributed in the northwest direction within~20 km of the epicenter.This earthquake caused serious casualties and house collapses,which requires further investigations into the impact of this earthquake.展开更多
The M6.2 earthquake in Jishishan,Gansu Province,on December 18,2023,caused extraordinary earthquake disasters.It was located in the northern part of the north−south seismic zone,which is a key area for earthquake moni...The M6.2 earthquake in Jishishan,Gansu Province,on December 18,2023,caused extraordinary earthquake disasters.It was located in the northern part of the north−south seismic zone,which is a key area for earthquake monitoring in China.The newly built dense strong motion stations in this area provide unprecedented conditions for high-precision earthquake relocation,especially the earthquake focal depth.This paper uses the newly built strong motion and traditional broadband seismic networks to relocate the source locations of the M3.0 and above aftershocks and to invert their focal mechanisms.The horizontal error of earthquake location is estimated to be 0.5−1 km,and the vertical error is 1−2 km.The focal depth range of aftershocks is 9.6−14.6 km,distributed in a 12-km-long strip with SSE direction.Aftershocks in the south are more concentrated horizontally and vertically,while aftershocks in the north are more scattered.The focal mechanisms of the main shock and aftershocks are relatively consistent,and the P-axis orientation is consistent with the regional strain direction.There is a seismic blank area of M3.0 and above,about 3−5 km between the main shock and aftershocks.It is suggested that the energy released by the main shock rupture is concentrated in this area.Based on the earthquake location and focal mechanism of the main shock,it is inferred that the Northern Lajishan fault zone is the seismogenic structure of the main shock,and the main shock did not occur on the main fault,but on a secondary fault.The initial rupture depth and centroid depth of the main shock were 12.8 and 14.0 km,respectively.The source rupture depth may not be the main reason for the severe earthquake disaster.展开更多
A compound varifocal lens based on electromagnetic drive technology is designed and fabricated, where the polydimethylsiloxane(PDMS) film acts as a driving component, while the PDMS biconvex lens and the plane-concave...A compound varifocal lens based on electromagnetic drive technology is designed and fabricated, where the polydimethylsiloxane(PDMS) film acts as a driving component, while the PDMS biconvex lens and the plane-concave lens form a coaxial compound lens system. The plane-concave lens equipped with driving coils is installed directly above the PDMS lens surrounded by the annular magnet. When different currents are applied, the annular magnet moves up and down, driving the PDMS film to undergo elastic deformation, and then resulting in longitudinal movement of the PDMS lens. The position change of the PDMS lens changes the focal length of the compound lens system. To verify the feasibility and practicability of this design, a prototype of our compound lens system is fabricated in experiment. Our proposed compound lens shows that its zoom ability reaches 9.28 mm when the current ranges from -0.20 A to 0.21 A.展开更多
Different from other normal modes of the Earth’s free oscillation that depend on all the six components(M_(rr),M_(tt),M_(pp),M_(rt),M_(rp),and M_(tp))of the centroid moment tensor,the amplitudes of the radial modes d...Different from other normal modes of the Earth’s free oscillation that depend on all the six components(M_(rr),M_(tt),M_(pp),M_(rt),M_(rp),and M_(tp))of the centroid moment tensor,the amplitudes of the radial modes depend on the M_(rr)component(e.g.,scalar moment(M_(0)),dip(δ),and slip(λ))and hypocenter depth of the focal mechanism,and hence can be easily used to constrain these parameters of the focal mechanism.In this study,we use the superconducting gravimeter(SG)records after the 2011 Tohoku earthquake to analyze the radial modes_(0)S_(0)and_(1)S_(0).Based on the solutions of the focal mechanism provided by the GCMT and USGS,we can obtain the theoretical amplitudes of these two radial modes.Comparing the theoretical amplitudes with the observation amplitudes,it is found that there are obvious differences between the former and the latter,which means that the GCMT and USGS focal mechanisms cannot well represent the real focal mechanism of the 2011 event.Taking the GCMT solution as a reference and changing the depth and the three parameters of the M_(rr)moment,the scalar moment(M_(0))and the dip(δ)have significant influences,but the effects of the slip(λ)and the depth are minor.After comparisons,we provide a new constraint(M_(0)=5.8±0.09×10^(22)N·m,δ=10.1±0.08°,λ=88°,and depth=20 km)for the focal mechanism of the 2011 event.In addition,we further determine the center frequency(1.631567±2.6e^(-6)mHz)and quality factor(2046.4±50.1)of the_(1)S_(0)mode.展开更多
This paper proposes a new network structure,namely the ProNet network.Retinal medical image segmentation can help clinical diagnosis of related eye diseases and is essential for subsequent rational treatment.The basel...This paper proposes a new network structure,namely the ProNet network.Retinal medical image segmentation can help clinical diagnosis of related eye diseases and is essential for subsequent rational treatment.The baseline model of the ProNet network is UperNet(Unified perceptual parsing Network),and the backbone network is ConvNext(Convolutional Network).A network structure based on depth-separable convolution and 1×1 convolution is used,which has good performance and robustness.We further optimise ProNet mainly in two aspects.One is data enhancement using increased noise and slight angle rotation,which can significantly increase the diversity of data and help the model better learn the patterns and features of the data and improve the model’s performance.Meanwhile,it can effectively expand the training data set,reduce the influence of noise and abnormal data in the data set on the model,and improve the accuracy and reliability of the model.Another is the loss function aspect,and we finally use the focal loss function.The focal loss function is well suited for complex tasks such as object detection.The function will penalise the loss carried by samples that the model misclassifies,thus enabling better training of the model to avoid these errors while solving the category imbalance problem as a way to improve image segmentation density and segmentation accuracy.From the experimental results,the evaluation metrics mIoU(mean Intersection over Union)enhanced by 4.47%,and mDice enhanced by 2.92% compared to the baseline network.Better generalization effects and more accurate image segmentation are achieved.展开更多
A medium wave(MW)640×512(25μm)Mercury Cadmium Telluride(HgCdTe)polarimetric focal plane array(FPA)was demonstrated.The micro-polarizer array(MPA)has been carefully designed in terms of line grating structure opt...A medium wave(MW)640×512(25μm)Mercury Cadmium Telluride(HgCdTe)polarimetric focal plane array(FPA)was demonstrated.The micro-polarizer array(MPA)has been carefully designed in terms of line grating structure optimization and crosstalk suppression.A monolithic fabrication process with low damage was explored,which was verified to be compatible well with HgCdTe devices.After monolithic integration of MPA,NETD<9.5 mK was still maintained.Furthermore,to figure out the underlying mechanism that dominat⁃ed the extinction ratio(ER),specialized MPA layouts were designed,and the crosstalk was experimentally vali⁃dated as the major source that impacted ER.By expanding opaque regions at pixel edges to 4μm,crosstalk rates from adjacent pixels could be effectively reduced to approximately 2%,and promising ERs ranging from 17.32 to 27.41 were implemented.展开更多
Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means...Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means of reducing the risk of accidents.To tackle the challenges posed by the low recognition accuracy and the substan-tial computational burden associated with current infrared pedestrian-vehicle detection methods,an infrared pedestrian-vehicle detection method A proposal is presented,based on an enhanced version of You Only Look Once version 5(YOLOv5).First,A head specifically designed for detecting small targets has been integrated into the model to make full use of shallow feature information to enhance the accuracy in detecting small targets.Second,the Focal Generalized Intersection over Union(GIoU)is employed as an alternative to the original loss function to address issues related to target overlap and category imbalance.Third,the distribution shift convolution optimization feature extraction operator is used to alleviate the computational burden of the model without significantly compromising detection accuracy.The test results of the improved algorithm show that its average accuracy(mAP)reaches 90.1%.Specifically,the Giga Floating Point Operations Per second(GFLOPs)of the improved algorithm is only 9.1.In contrast,the improved algorithms outperformed the other algorithms on similar GFLOPs,such as YOLOv6n(11.9),YOLOv8n(8.7),YOLOv7t(13.2)and YOLOv5s(16.0).The mAPs that are 4.4%,3%,3.5%,and 1.7%greater than those of these algorithms show that the improved algorithm achieves higher accuracy in target detection tasks under similar computational resource overhead.On the other hand,compared with other algorithms such as YOLOv8l(91.1%),YOLOv6l(89.5%),YOLOv7(90.8%),and YOLOv3(90.1%),the improved algorithm needs only 5.5%,2.3%,8.6%,and 2.3%,respectively,of the GFLOPs.The improved algorithm has shown significant advancements in balancing accuracy and computational efficiency,making it promising for practical use in resource-limited scenarios.展开更多
文摘为了提高多视图深度估计结果精度,提出一种基于自适应空间特征增强的多视图深度估计算法。设计了由改进后的特征金字塔网络(feature pyramid network,FPN)和自适应空间特征增强(adaptive space feature enhancement,ASFE)组成的多尺度特征提取模块,获取到具有全局上下文信息和位置信息的多尺度特征图像。通过残差学习网络对深度图进行优化,防止多次卷积操作出现重建边缘模糊的问题。通过分类的思想构建focal loss函数增强网络模型的判断能力。由实验结果可知,该算法在DTU(technical university of denmark)数据集上和CasMVSNet(Cascade MVSNet)算法相比,在整体精度误差、运行时间、显存资源占用上分别降低了14.08%、72.15%、4.62%。在Tanks and Temples数据集整体评价指标Mean上该模型优于其他算法,证明提出的基于自适应空间特征增强的多视图深度估计算法的有效性。
文摘为解决自然条件下人脸表情识别易受角度、光线、遮挡物的影响以及人脸表情数据集各类表情数量不均衡等问题,提出基于Res2Net的人脸表情识别方法。使用Res2Net50作为特征提取的主干网络,在预处理阶段对图像随机翻转、缩放、裁剪进行数据增强,提升模型的泛化性。引入广义平均池化(generalized mean pooling, GeM)方式,关注图像中比较显著的区域,增强模型的鲁棒性;选用Focal Loss损失函数,针对表情类别不平衡和错误分类问题,提高较难识别表情的识别率。该方法在FER2013数据集上准确率达到了70.41%,相较于原Res2Net50网络提高了1.53%。结果表明,在自然条件下对人脸表情识别具有更好的准确性。
文摘BACKGROUND In hepatology,the clinical use of endoscopic ultrasound(EUS)has experienced a notable increase in recent times.These applications range from the diagnosis to the treatment of various liver diseases.Therefore,this systematic review summarizes the evidence for the diagnostic and therapeutic roles of EUS in liver diseases.AIM To examine and summarize the current available evidence of the possible roles of the EUS in making a suitable diagnosis in liver diseases as well as the therapeutic accuracy and efficacy.METHODS PubMed,Medline,Cochrane Library,Web of Science,and Google Scholar databases were extensively searched until October 2023.The methodological quality of the eligible articles was assessed using the Newcastle-Ottawa scale or Cochrane Risk of Bias tool.In addition,statistical analyses were performed using the Comprehensive Meta-Analysis software.RESULTS Overall,45 articles on EUS were included(28 on diagnostic role and 17 on therapeutic role).Pooled analysis demonstrated that EUS diagnostic tests had an accuracy of 92.4%for focal liver lesions(FLL)and 96.6%for parenchymal liver diseases.EUS-guided liver biopsies with either fine needle aspiration or fine needle biopsy had low complication rates when sampling FLL and parenchymal liver diseases(3.1%and 8.7%,respectively).Analysis of data from four studies showed that EUS-guided liver abscess had high clinical(90.7%)and technical success(90.7%)without significant complications.Similarly,EUS-guided interventions for the treatment of gastric varices(GV)have high technical success(98%)and GV obliteration rate(84%)with few complications(15%)and rebleeding events(17%).CONCLUSION EUS in liver diseases is a promising technique with the potential to be considered a first-line therapeutic and diagnostic option in selected cases.
基金supported by China Earthquake Administration Science for Earthquake Resilience(XH23050YB)Natural Science Foundation of China(42304072).
文摘On December 18,2023,the Jishishan area in Gansu Province was jolted by a M_(S) 6.2 earthquake,which is the most powerful seismic event that occurred throughout the year in China.The earthquake occurred along the NWtrending Lajishan fault(LJSF),a large tectonic transformation zone.After this event,China Earthquake Networks Center(CENC)has timely published several reports about seismic sources for emergency responses.The earthquake early warning system issued the first alert 4.9 s after the earthquake occurrence,providing prompt notification that effectively mitigated panics,injuries,and deaths of residents.The near real-time focal mechanism solution indicates that this earthquake is associated with a thrust fault.The distribution of aftershocks,the rupture process,and the recorded amplitudes from seismic monitoring and GNSS stations,all suggest that the mainshock rupture predominately propagates to the northwest direction.The duration of the rupture process is~12 s,and the largest slip is located at approximately 6.3 km to the NNW from the epicenter,with a peak slip of 0.12 m at~8 km depth.Seismic station N0028 recorded the highest instrumental intensity,which is 9.4 on the Mercalli scale.The estimated intensity map shows a seismic intensity reaching up to IX near the rupture area,consistent with field survey results.The aftershocks(up to December 22,2023)are mostly distributed in the northwest direction within~20 km of the epicenter.This earthquake caused serious casualties and house collapses,which requires further investigations into the impact of this earthquake.
文摘The M6.2 earthquake in Jishishan,Gansu Province,on December 18,2023,caused extraordinary earthquake disasters.It was located in the northern part of the north−south seismic zone,which is a key area for earthquake monitoring in China.The newly built dense strong motion stations in this area provide unprecedented conditions for high-precision earthquake relocation,especially the earthquake focal depth.This paper uses the newly built strong motion and traditional broadband seismic networks to relocate the source locations of the M3.0 and above aftershocks and to invert their focal mechanisms.The horizontal error of earthquake location is estimated to be 0.5−1 km,and the vertical error is 1−2 km.The focal depth range of aftershocks is 9.6−14.6 km,distributed in a 12-km-long strip with SSE direction.Aftershocks in the south are more concentrated horizontally and vertically,while aftershocks in the north are more scattered.The focal mechanisms of the main shock and aftershocks are relatively consistent,and the P-axis orientation is consistent with the regional strain direction.There is a seismic blank area of M3.0 and above,about 3−5 km between the main shock and aftershocks.It is suggested that the energy released by the main shock rupture is concentrated in this area.Based on the earthquake location and focal mechanism of the main shock,it is inferred that the Northern Lajishan fault zone is the seismogenic structure of the main shock,and the main shock did not occur on the main fault,but on a secondary fault.The initial rupture depth and centroid depth of the main shock were 12.8 and 14.0 km,respectively.The source rupture depth may not be the main reason for the severe earthquake disaster.
文摘A compound varifocal lens based on electromagnetic drive technology is designed and fabricated, where the polydimethylsiloxane(PDMS) film acts as a driving component, while the PDMS biconvex lens and the plane-concave lens form a coaxial compound lens system. The plane-concave lens equipped with driving coils is installed directly above the PDMS lens surrounded by the annular magnet. When different currents are applied, the annular magnet moves up and down, driving the PDMS film to undergo elastic deformation, and then resulting in longitudinal movement of the PDMS lens. The position change of the PDMS lens changes the focal length of the compound lens system. To verify the feasibility and practicability of this design, a prototype of our compound lens system is fabricated in experiment. Our proposed compound lens shows that its zoom ability reaches 9.28 mm when the current ranges from -0.20 A to 0.21 A.
基金supported by the National Natural Science Foundation of China(Grants:41974022 and 42192531)the Educational Commission of Hubei Province of China(Grant:2020CFA109)+1 种基金the Special Fund of Hubei Luojia Laboratory(grant#220100002)Key Laboratory of Geospace Environment and Geodesy,Ministry of Education,Wuhan University(grant#210204)。
文摘Different from other normal modes of the Earth’s free oscillation that depend on all the six components(M_(rr),M_(tt),M_(pp),M_(rt),M_(rp),and M_(tp))of the centroid moment tensor,the amplitudes of the radial modes depend on the M_(rr)component(e.g.,scalar moment(M_(0)),dip(δ),and slip(λ))and hypocenter depth of the focal mechanism,and hence can be easily used to constrain these parameters of the focal mechanism.In this study,we use the superconducting gravimeter(SG)records after the 2011 Tohoku earthquake to analyze the radial modes_(0)S_(0)and_(1)S_(0).Based on the solutions of the focal mechanism provided by the GCMT and USGS,we can obtain the theoretical amplitudes of these two radial modes.Comparing the theoretical amplitudes with the observation amplitudes,it is found that there are obvious differences between the former and the latter,which means that the GCMT and USGS focal mechanisms cannot well represent the real focal mechanism of the 2011 event.Taking the GCMT solution as a reference and changing the depth and the three parameters of the M_(rr)moment,the scalar moment(M_(0))and the dip(δ)have significant influences,but the effects of the slip(λ)and the depth are minor.After comparisons,we provide a new constraint(M_(0)=5.8±0.09×10^(22)N·m,δ=10.1±0.08°,λ=88°,and depth=20 km)for the focal mechanism of the 2011 event.In addition,we further determine the center frequency(1.631567±2.6e^(-6)mHz)and quality factor(2046.4±50.1)of the_(1)S_(0)mode.
文摘This paper proposes a new network structure,namely the ProNet network.Retinal medical image segmentation can help clinical diagnosis of related eye diseases and is essential for subsequent rational treatment.The baseline model of the ProNet network is UperNet(Unified perceptual parsing Network),and the backbone network is ConvNext(Convolutional Network).A network structure based on depth-separable convolution and 1×1 convolution is used,which has good performance and robustness.We further optimise ProNet mainly in two aspects.One is data enhancement using increased noise and slight angle rotation,which can significantly increase the diversity of data and help the model better learn the patterns and features of the data and improve the model’s performance.Meanwhile,it can effectively expand the training data set,reduce the influence of noise and abnormal data in the data set on the model,and improve the accuracy and reliability of the model.Another is the loss function aspect,and we finally use the focal loss function.The focal loss function is well suited for complex tasks such as object detection.The function will penalise the loss carried by samples that the model misclassifies,thus enabling better training of the model to avoid these errors while solving the category imbalance problem as a way to improve image segmentation density and segmentation accuracy.From the experimental results,the evaluation metrics mIoU(mean Intersection over Union)enhanced by 4.47%,and mDice enhanced by 2.92% compared to the baseline network.Better generalization effects and more accurate image segmentation are achieved.
基金Supported by the self-funded project of Kunming Institute of Physics。
文摘A medium wave(MW)640×512(25μm)Mercury Cadmium Telluride(HgCdTe)polarimetric focal plane array(FPA)was demonstrated.The micro-polarizer array(MPA)has been carefully designed in terms of line grating structure optimization and crosstalk suppression.A monolithic fabrication process with low damage was explored,which was verified to be compatible well with HgCdTe devices.After monolithic integration of MPA,NETD<9.5 mK was still maintained.Furthermore,to figure out the underlying mechanism that dominat⁃ed the extinction ratio(ER),specialized MPA layouts were designed,and the crosstalk was experimentally vali⁃dated as the major source that impacted ER.By expanding opaque regions at pixel edges to 4μm,crosstalk rates from adjacent pixels could be effectively reduced to approximately 2%,and promising ERs ranging from 17.32 to 27.41 were implemented.
文摘Road traffic safety can decrease when drivers drive in a low-visibility environment.The application of visual perception technology to detect vehicles and pedestrians in infrared images proves to be an effective means of reducing the risk of accidents.To tackle the challenges posed by the low recognition accuracy and the substan-tial computational burden associated with current infrared pedestrian-vehicle detection methods,an infrared pedestrian-vehicle detection method A proposal is presented,based on an enhanced version of You Only Look Once version 5(YOLOv5).First,A head specifically designed for detecting small targets has been integrated into the model to make full use of shallow feature information to enhance the accuracy in detecting small targets.Second,the Focal Generalized Intersection over Union(GIoU)is employed as an alternative to the original loss function to address issues related to target overlap and category imbalance.Third,the distribution shift convolution optimization feature extraction operator is used to alleviate the computational burden of the model without significantly compromising detection accuracy.The test results of the improved algorithm show that its average accuracy(mAP)reaches 90.1%.Specifically,the Giga Floating Point Operations Per second(GFLOPs)of the improved algorithm is only 9.1.In contrast,the improved algorithms outperformed the other algorithms on similar GFLOPs,such as YOLOv6n(11.9),YOLOv8n(8.7),YOLOv7t(13.2)and YOLOv5s(16.0).The mAPs that are 4.4%,3%,3.5%,and 1.7%greater than those of these algorithms show that the improved algorithm achieves higher accuracy in target detection tasks under similar computational resource overhead.On the other hand,compared with other algorithms such as YOLOv8l(91.1%),YOLOv6l(89.5%),YOLOv7(90.8%),and YOLOv3(90.1%),the improved algorithm needs only 5.5%,2.3%,8.6%,and 2.3%,respectively,of the GFLOPs.The improved algorithm has shown significant advancements in balancing accuracy and computational efficiency,making it promising for practical use in resource-limited scenarios.