Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,...Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,sentiment analysis and question-answering systems.However,previous studies ignored much unusedstructural information in sentences that could enhance the performance of the relation extraction task.Moreover,most existing dependency-based models utilize self-attention to distinguish the importance of context,whichhardly deals withmultiple-structure information.To efficiently leverage multiple structure information,this paperproposes a dynamic structure attention mechanism model based on textual structure information,which deeplyintegrates word embedding,named entity recognition labels,part of speech,dependency tree and dependency typeinto a graph convolutional network.Specifically,our model extracts text features of different structures from theinput sentence.Textual Structure information Graph Convolutional Networks employs the dynamic structureattention mechanism to learn multi-structure attention,effectively distinguishing important contextual features invarious structural information.In addition,multi-structure weights are carefully designed as amergingmechanismin the different structure attention to dynamically adjust the final attention.This paper combines these featuresand trains a graph convolutional network for relation extraction.We experiment on supervised relation extractiondatasets including SemEval 2010 Task 8,TACRED,TACREV,and Re-TACED,the result significantly outperformsthe previous.展开更多
BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone i...BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone in the exact classification of FLLs due to its noninvasive nature,high scanning speed,and high-density resolution.Since their recent development,convolutional neural network-based deep learning techniques has been recognized to have high potential for image recognition tasks.AIM To develop and evaluate an automated multiphase convolutional dense network(MP-CDN)to classify FLLs on multiphase CT.METHODS A total of 517 FLLs scanned on a 320-detector CT scanner using a four-phase DCECT imaging protocol(including precontrast phase,arterial phase,portal venous phase,and delayed phase)from 2012 to 2017 were retrospectively enrolled.FLLs were classified into four categories:Category A,hepatocellular carcinoma(HCC);category B,liver metastases;category C,benign non-inflammatory FLLs including hemangiomas,focal nodular hyperplasias and adenomas;and category D,hepatic abscesses.Each category was split into a training set and test set in an approximate 8:2 ratio.An MP-CDN classifier with a sequential input of the fourphase CT images was developed to automatically classify FLLs.The classification performance of the model was evaluated on the test set;the accuracy and specificity were calculated from the confusion matrix,and the area under the receiver operating characteristic curve(AUC)was calculated from the SoftMax probability outputted from the last layer of the MP-CDN.RESULTS A total of 410 FLLs were used for training and 107 FLLs were used for testing.The mean classification accuracy of the test set was 81.3%(87/107).The accuracy/specificity of distinguishing each category from the others were 0.916/0.964,0.925/0.905,0.860/0.918,and 0.925/0.963 for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.The AUC(95%confidence interval)for differentiating each category from the others was 0.92(0.837-0.992),0.99(0.967-1.00),0.88(0.795-0.955)and 0.96(0.914-0.996)for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.CONCLUSION MP-CDN accurately classified FLLs detected on four-phase CT as HCC,metastases,benign non-inflammatory FLLs and hepatic abscesses and may assist radiologists in identifying the different types of FLLs.展开更多
Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been con...Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been conducted on two aspects.The first aspect is to encode speech frame by frame.The second aspect is to encode the whole speech to a vector.But the scale in these aspects is fixed.So,encoding speech with an adjustable scale for more latent information is worthy of investigation.But current alignment approaches only support frame-by-frame encoding and speech-to-vector encoding.It remains a challenge to propose a new alignment approach to support adjustable scale speech encoding.This paper presents the dynamic speech encoder with a new alignment approach in conjunction with frame-by-frame encoding and speech-to-vector encoding.The speech feature fromourmodel achieves three functions.First,the speech feature can reconstruct the origin speech while the length of the speech feature is equal to the text length.Second,our model can get text embedding fromspeech,and the encoded speech feature is similar to the text embedding result.Finally,it can transfer the style of synthesis speech and make it more similar to the given reference speech.展开更多
Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly...Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly, the discrete singular convolution (DSC) is used for the first time to analyze the impact dynamics. Secondly, the efficiency of various numerical methods for dynamic analysis is explored via an example of a flexible rod hit by a rigid ball. Three numerical methods, including the conventional finite element (FE) method, the DSC algorithm, and the spectral finite element (SFE) method, and one proposed modeling strategy, the improved spectral finite element (ISFE) method, are involved. Numerical results are compared with the known analytical solutions to show their efficiency. It is demonstrated that the proposed ISFE modeling strategy with a proper length of con- ventional FE yields the most accurate contact stress among the four investigated models. It is also found that the DSC algorithm is an alternative method for collision problems.展开更多
Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was arg...Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was argued that: (1) under reasonable assumptions (approximating the frequency response function by the discrete Fourier transform of the discretized unitary impulse response function), the matrix formulation by FD is equivalent to a circular convolution; (2) to avoid the wraparound interference, the excitation vector and impulse response must be padded with enough zeros; (3) provided that the zero padding requirement satisfied, the convergence and accuracy of direct time domain analysis, which is equivalent to that by FD, are guaranteed by the numerical integration scheme; (4) the imaginary part of the computational response approaching zero is due to the continuity of the impulse response functions.展开更多
Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the sam...Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the same scene(assuming a stationary scene),we introduce a learning-based model for single-image HDR reconstruction.An input LDR image is sequentially segmented into the local region maps based on the cumulative histogram of the input brightness distribution.Using the local region maps,SParam-Net estimates the parameters of an inverse tone mapping function to generate a pseudo-HDR image.We process the segmented region maps as the input sequences on long short-term memory.Finally,a fast super-resolution convolutional neural network is used for HDR image reconstruction.The proposed method was trained and tested on datasets including HDR-Real,LDR-HDR-pair,and HDR-Eye.The experimental results revealed that HDR images can be generated more reliably than using contemporary end-to-end approaches.展开更多
Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, ...Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, networkconnectivity is facilitated between smart devices from anyplace and anytime.IoT-based health monitoring systems are gaining popularity and acceptance forcontinuous monitoring and detect health abnormalities from the data collected.Electrocardiographic (ECG) signals are widely used for heart diseases detection.A novel method has been proposed in this work for ECG monitoring using IoTtechniques. In this work, a two-stage approach is employed. In the first stage, arouting protocol based on Dynamic Source Routing (DSR) and Routing byEnergy and Link quality (REL) for IoT healthcare platform is proposed for effi-cient data collection, and in the second stage, classification of ECG for Arrhythmia. Furthermore, this work has evaluated Support Vector Machine (SVM),Artificial Neural Network (ANN), and Convolution Neural Networks (CNNs)-based approach for ECG signals classification. Deep-ECG will use a deep CNNto extract critical features and then compare through evaluation of simple and fastdistance functions in order to obtain an efficient classification of heart abnormalities. For the identification of abnormal data, this work has proposed techniquesfor the classification of ECG data, which has been obtained from mobile watchusers. For experimental verification of the proposed methods, the Beth Israel Hospital (MIT/BIH) Arrhythmia and Massachusetts Institute of Technology (MIT)Database was used for evaluation. Results confirm the presented method’s superior performance with regards to the accuracy of classification. The CNN achievedan accuracy of 91.92% and has a higher accuracy of 4.98% for the SVM and2.68% for the ANN.展开更多
动态网络链路预测广泛的应用前景,使得其逐渐成为网络科学研究的热点.动态网络链路演化过程中具有复杂的空间相关性和时间依赖性,导致其链路预测任务极具挑战.提出一个基于时序图卷积的动态网络链路预测模型(dynamic network link predi...动态网络链路预测广泛的应用前景,使得其逐渐成为网络科学研究的热点.动态网络链路演化过程中具有复杂的空间相关性和时间依赖性,导致其链路预测任务极具挑战.提出一个基于时序图卷积的动态网络链路预测模型(dynamic network link prediction based on sequential graph convolution, DNLP-SGC).针对网络快照序列不能有效反映动态网络连续性的问题,采用边缘触发机制对原始网络权重矩阵进行修正,弥补了离散快照表示动态网络存在时序信息丢失的不足.从网络演化过程出发,综合考虑节点间的特征相似性以及历史交互信息,采用时序图卷积提取动态网络中节点的特征,该方法融合了节点时空依赖关系.进一步,采用因果卷积网络捕获网络演化过程中潜在的全局时序特征,实现动态网络链路预测.在2个真实的网络数据集上的实验结果表明,DNLP-SGC在precision, recall, AUC指标上均优于对比的基线模型.展开更多
针对单目3D目标检测在视角变化引起的物体大小变化以及物体遮挡等情况下效果不佳的问题,提出一种融合深度信息和实例分割掩码的新型单目3D目标检测方法。首先,通过深度-掩码注意力融合(DMAF)模块,将深度信息与实例分割掩码结合,以提供...针对单目3D目标检测在视角变化引起的物体大小变化以及物体遮挡等情况下效果不佳的问题,提出一种融合深度信息和实例分割掩码的新型单目3D目标检测方法。首先,通过深度-掩码注意力融合(DMAF)模块,将深度信息与实例分割掩码结合,以提供更准确的物体边界;其次,引入动态卷积,并利用DMAF模块得到的融合特征引导动态卷积核的生成,以处理不同尺度的物体;再次,在损失函数中引入2D-3D边界框一致性损失函数,调整预测的3D边界框与对应的2D检测框高度一致,以提高实例分割和3D目标检测任务的效果;最后,通过消融实验验证该方法的有效性,并在KITTI测试集上对该方法进行验证。实验结果表明,与仅使用深度估计图和实例分割掩码的方法相比,在中等难度下对车辆类别检测的平均精度提高了6.36个百分点,且3D目标检测和鸟瞰图目标检测任务的效果均优于D4LCN(Depth-guided Dynamic-Depthwise-Dilated Local Convolutional Network)、M3D-RPN(Monocular 3D Region Proposal Network)等对比方法。展开更多
胚蛋活性检测对疫苗生产的质量与安全至关重要。传统机器视觉检测方法严重依赖人为设计的特征提取算法,对图像质量和环境条件要求高,检测结果稳定性和容错性差,导致实际检测过程中的通用性受到限制,为克服这种缺陷,该研究提出一种基于改...胚蛋活性检测对疫苗生产的质量与安全至关重要。传统机器视觉检测方法严重依赖人为设计的特征提取算法,对图像质量和环境条件要求高,检测结果稳定性和容错性差,导致实际检测过程中的通用性受到限制,为克服这种缺陷,该研究提出一种基于改进YOLOv8的疫苗胚蛋活性检测模型。采用自主设计图像采集装置,采集孵化10~11 d的胚蛋图像,通过几何变换、颜色调整、图像增强等方式构建并扩充数据集;采用Shuffle Net V2替换YOLOv8模型的骨干网络,在保持准确率的同时显著减少了计算复杂度,能更好地部署到嵌入式设备中;在YOLOv8颈部网络的卷积层后添加动态蛇形卷积层,通过其自适应地聚焦于细长和迂回的局部结构,准确地捕捉管状结构的性质特征,从而提高胚蛋检测的准确率;使用EIOU(embedding intersection over union)损失函数,用于适应研究中边界框对齐和形状相似的场景,构建了符合试验中胚蛋图像的网络模型,以实现疫苗胚蛋活性快速、无损、批量检测。试验结果表明,改进YOLOv8模型精确率、召回率、平均精度均值(m AP_(50-95))分别达99.2%、98.2%、96.9%,对比原始YOLOv8模型分别提高了2.0、0.3、1.5个百分点,模型计算复杂度与推理时间相较与原模型分别降低60.9%、60.5%。说明此模型可以更好地实现疫苗胚蛋活性无损检测,为自动化批量检测提供理论依据。展开更多
文摘Deep neural network-based relational extraction research has made significant progress in recent years,andit provides data support for many natural language processing downstream tasks such as building knowledgegraph,sentiment analysis and question-answering systems.However,previous studies ignored much unusedstructural information in sentences that could enhance the performance of the relation extraction task.Moreover,most existing dependency-based models utilize self-attention to distinguish the importance of context,whichhardly deals withmultiple-structure information.To efficiently leverage multiple structure information,this paperproposes a dynamic structure attention mechanism model based on textual structure information,which deeplyintegrates word embedding,named entity recognition labels,part of speech,dependency tree and dependency typeinto a graph convolutional network.Specifically,our model extracts text features of different structures from theinput sentence.Textual Structure information Graph Convolutional Networks employs the dynamic structureattention mechanism to learn multi-structure attention,effectively distinguishing important contextual features invarious structural information.In addition,multi-structure weights are carefully designed as amergingmechanismin the different structure attention to dynamically adjust the final attention.This paper combines these featuresand trains a graph convolutional network for relation extraction.We experiment on supervised relation extractiondatasets including SemEval 2010 Task 8,TACRED,TACREV,and Re-TACED,the result significantly outperformsthe previous.
基金Supported by National Natural Science Foundation of China,No.91959118Science and Technology Program of Guangzhou,China,No.201704020016+1 种基金SKY Radiology Department International Medical Research Foundation of China,No.Z-2014-07-1912-15Clinical Research Foundation of the 3rd Affiliated Hospital of Sun Yat-Sen University,No.YHJH201901.
文摘BACKGROUND The accurate classification of focal liver lesions(FLLs)is essential to properly guide treatment options and predict prognosis.Dynamic contrast-enhanced computed tomography(DCE-CT)is still the cornerstone in the exact classification of FLLs due to its noninvasive nature,high scanning speed,and high-density resolution.Since their recent development,convolutional neural network-based deep learning techniques has been recognized to have high potential for image recognition tasks.AIM To develop and evaluate an automated multiphase convolutional dense network(MP-CDN)to classify FLLs on multiphase CT.METHODS A total of 517 FLLs scanned on a 320-detector CT scanner using a four-phase DCECT imaging protocol(including precontrast phase,arterial phase,portal venous phase,and delayed phase)from 2012 to 2017 were retrospectively enrolled.FLLs were classified into four categories:Category A,hepatocellular carcinoma(HCC);category B,liver metastases;category C,benign non-inflammatory FLLs including hemangiomas,focal nodular hyperplasias and adenomas;and category D,hepatic abscesses.Each category was split into a training set and test set in an approximate 8:2 ratio.An MP-CDN classifier with a sequential input of the fourphase CT images was developed to automatically classify FLLs.The classification performance of the model was evaluated on the test set;the accuracy and specificity were calculated from the confusion matrix,and the area under the receiver operating characteristic curve(AUC)was calculated from the SoftMax probability outputted from the last layer of the MP-CDN.RESULTS A total of 410 FLLs were used for training and 107 FLLs were used for testing.The mean classification accuracy of the test set was 81.3%(87/107).The accuracy/specificity of distinguishing each category from the others were 0.916/0.964,0.925/0.905,0.860/0.918,and 0.925/0.963 for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.The AUC(95%confidence interval)for differentiating each category from the others was 0.92(0.837-0.992),0.99(0.967-1.00),0.88(0.795-0.955)and 0.96(0.914-0.996)for HCC,metastases,benign non-inflammatory FLLs,and abscesses on the test set,respectively.CONCLUSION MP-CDN accurately classified FLLs detected on four-phase CT as HCC,metastases,benign non-inflammatory FLLs and hepatic abscesses and may assist radiologists in identifying the different types of FLLs.
基金supported by National Key R&D Program of China (2020AAA0107901).
文摘Latent information is difficult to get from the text in speech synthesis.Studies show that features from speech can get more information to help text encoding.In the field of speech encoding,a lot of work has been conducted on two aspects.The first aspect is to encode speech frame by frame.The second aspect is to encode the whole speech to a vector.But the scale in these aspects is fixed.So,encoding speech with an adjustable scale for more latent information is worthy of investigation.But current alignment approaches only support frame-by-frame encoding and speech-to-vector encoding.It remains a challenge to propose a new alignment approach to support adjustable scale speech encoding.This paper presents the dynamic speech encoder with a new alignment approach in conjunction with frame-by-frame encoding and speech-to-vector encoding.The speech feature fromourmodel achieves three functions.First,the speech feature can reconstruct the origin speech while the length of the speech feature is equal to the text length.Second,our model can get text embedding fromspeech,and the encoded speech feature is similar to the text embedding result.Finally,it can transfer the style of synthesis speech and make it more similar to the given reference speech.
基金Supported by the National Natural Science Foundation of China(50830201)the Priority Academic Program Development of Jiangsu Higher Education Institutions~~
文摘Impact dynamics of flexible solids is important in engineering practice. Obtaining its dynamic response is a challenging task and usually achieved by numerical methods. The objectives of the study are twofold. Firstly, the discrete singular convolution (DSC) is used for the first time to analyze the impact dynamics. Secondly, the efficiency of various numerical methods for dynamic analysis is explored via an example of a flexible rod hit by a rigid ball. Three numerical methods, including the conventional finite element (FE) method, the DSC algorithm, and the spectral finite element (SFE) method, and one proposed modeling strategy, the improved spectral finite element (ISFE) method, are involved. Numerical results are compared with the known analytical solutions to show their efficiency. It is demonstrated that the proposed ISFE modeling strategy with a proper length of con- ventional FE yields the most accurate contact stress among the four investigated models. It is also found that the DSC algorithm is an alternative method for collision problems.
文摘Venanico-Filho et al. developed an elegant matrix formulation for dynamic analysis by frequency domain (FD), but the convergence, causality and extended period need further refining. In the present paper, it was argued that: (1) under reasonable assumptions (approximating the frequency response function by the discrete Fourier transform of the discretized unitary impulse response function), the matrix formulation by FD is equivalent to a circular convolution; (2) to avoid the wraparound interference, the excitation vector and impulse response must be padded with enough zeros; (3) provided that the zero padding requirement satisfied, the convergence and accuracy of direct time domain analysis, which is equivalent to that by FD, are guaranteed by the numerical integration scheme; (4) the imaginary part of the computational response approaching zero is due to the continuity of the impulse response functions.
基金This study was supported by the Basic Science Research Program through the National Research Foundation of Korea(NRF)funded by the Ministry of Education(NRF-2018R1D1A1B07049932).
文摘Low dynamic range(LDR)images captured by consumer cameras have a limited luminance range.As the conventional method for generating high dynamic range(HDR)images involves merging multiple-exposure LDR images of the same scene(assuming a stationary scene),we introduce a learning-based model for single-image HDR reconstruction.An input LDR image is sequentially segmented into the local region maps based on the cumulative histogram of the input brightness distribution.Using the local region maps,SParam-Net estimates the parameters of an inverse tone mapping function to generate a pseudo-HDR image.We process the segmented region maps as the input sequences on long short-term memory.Finally,a fast super-resolution convolutional neural network is used for HDR image reconstruction.The proposed method was trained and tested on datasets including HDR-Real,LDR-HDR-pair,and HDR-Eye.The experimental results revealed that HDR images can be generated more reliably than using contemporary end-to-end approaches.
文摘Much attention has been given to the Internet of Things (IoT) by citizens, industries, governments, and universities for applications like smart buildings, environmental monitoring, health care and so on. With IoT, networkconnectivity is facilitated between smart devices from anyplace and anytime.IoT-based health monitoring systems are gaining popularity and acceptance forcontinuous monitoring and detect health abnormalities from the data collected.Electrocardiographic (ECG) signals are widely used for heart diseases detection.A novel method has been proposed in this work for ECG monitoring using IoTtechniques. In this work, a two-stage approach is employed. In the first stage, arouting protocol based on Dynamic Source Routing (DSR) and Routing byEnergy and Link quality (REL) for IoT healthcare platform is proposed for effi-cient data collection, and in the second stage, classification of ECG for Arrhythmia. Furthermore, this work has evaluated Support Vector Machine (SVM),Artificial Neural Network (ANN), and Convolution Neural Networks (CNNs)-based approach for ECG signals classification. Deep-ECG will use a deep CNNto extract critical features and then compare through evaluation of simple and fastdistance functions in order to obtain an efficient classification of heart abnormalities. For the identification of abnormal data, this work has proposed techniquesfor the classification of ECG data, which has been obtained from mobile watchusers. For experimental verification of the proposed methods, the Beth Israel Hospital (MIT/BIH) Arrhythmia and Massachusetts Institute of Technology (MIT)Database was used for evaluation. Results confirm the presented method’s superior performance with regards to the accuracy of classification. The CNN achievedan accuracy of 91.92% and has a higher accuracy of 4.98% for the SVM and2.68% for the ANN.
文摘动态网络链路预测广泛的应用前景,使得其逐渐成为网络科学研究的热点.动态网络链路演化过程中具有复杂的空间相关性和时间依赖性,导致其链路预测任务极具挑战.提出一个基于时序图卷积的动态网络链路预测模型(dynamic network link prediction based on sequential graph convolution, DNLP-SGC).针对网络快照序列不能有效反映动态网络连续性的问题,采用边缘触发机制对原始网络权重矩阵进行修正,弥补了离散快照表示动态网络存在时序信息丢失的不足.从网络演化过程出发,综合考虑节点间的特征相似性以及历史交互信息,采用时序图卷积提取动态网络中节点的特征,该方法融合了节点时空依赖关系.进一步,采用因果卷积网络捕获网络演化过程中潜在的全局时序特征,实现动态网络链路预测.在2个真实的网络数据集上的实验结果表明,DNLP-SGC在precision, recall, AUC指标上均优于对比的基线模型.
文摘针对单目3D目标检测在视角变化引起的物体大小变化以及物体遮挡等情况下效果不佳的问题,提出一种融合深度信息和实例分割掩码的新型单目3D目标检测方法。首先,通过深度-掩码注意力融合(DMAF)模块,将深度信息与实例分割掩码结合,以提供更准确的物体边界;其次,引入动态卷积,并利用DMAF模块得到的融合特征引导动态卷积核的生成,以处理不同尺度的物体;再次,在损失函数中引入2D-3D边界框一致性损失函数,调整预测的3D边界框与对应的2D检测框高度一致,以提高实例分割和3D目标检测任务的效果;最后,通过消融实验验证该方法的有效性,并在KITTI测试集上对该方法进行验证。实验结果表明,与仅使用深度估计图和实例分割掩码的方法相比,在中等难度下对车辆类别检测的平均精度提高了6.36个百分点,且3D目标检测和鸟瞰图目标检测任务的效果均优于D4LCN(Depth-guided Dynamic-Depthwise-Dilated Local Convolutional Network)、M3D-RPN(Monocular 3D Region Proposal Network)等对比方法。
文摘胚蛋活性检测对疫苗生产的质量与安全至关重要。传统机器视觉检测方法严重依赖人为设计的特征提取算法,对图像质量和环境条件要求高,检测结果稳定性和容错性差,导致实际检测过程中的通用性受到限制,为克服这种缺陷,该研究提出一种基于改进YOLOv8的疫苗胚蛋活性检测模型。采用自主设计图像采集装置,采集孵化10~11 d的胚蛋图像,通过几何变换、颜色调整、图像增强等方式构建并扩充数据集;采用Shuffle Net V2替换YOLOv8模型的骨干网络,在保持准确率的同时显著减少了计算复杂度,能更好地部署到嵌入式设备中;在YOLOv8颈部网络的卷积层后添加动态蛇形卷积层,通过其自适应地聚焦于细长和迂回的局部结构,准确地捕捉管状结构的性质特征,从而提高胚蛋检测的准确率;使用EIOU(embedding intersection over union)损失函数,用于适应研究中边界框对齐和形状相似的场景,构建了符合试验中胚蛋图像的网络模型,以实现疫苗胚蛋活性快速、无损、批量检测。试验结果表明,改进YOLOv8模型精确率、召回率、平均精度均值(m AP_(50-95))分别达99.2%、98.2%、96.9%,对比原始YOLOv8模型分别提高了2.0、0.3、1.5个百分点,模型计算复杂度与推理时间相较与原模型分别降低60.9%、60.5%。说明此模型可以更好地实现疫苗胚蛋活性无损检测,为自动化批量检测提供理论依据。