基于改进YOLACT实例分割网络的人耳关键生理曲线提取被引量：2

Physiological curve extraction of the human ear based on the improved YOLACT

下载PDF

导出

摘要在人耳形状聚类、3D人耳建模、个人定制耳机等相关工作中,获取人耳的一些关键生理曲线和关键点的准确位置非常重要.传统的边缘提取方法对光照和姿势变化非常敏感.本文提出了一种基于ResNeSt和筛选模板策略的改进YOLACT实例分割网络,分别从定位和分割两方面对原始YOLACT算法进行改进,通过标注人耳数据集,训练改进的YOLACT模型,并在预测阶段使用改进的筛选模板策略,可以准确地分割人耳的不同区域并提取关键的生理曲线.相较于其他方法,本文方法在测试图像集上显示出更好的分割精度,且对人耳姿态变化时具有一定的鲁棒性. In related work, such as human ear shape clustering, three-dimensional human ear modeling, and personal customized headphones, the key physiological curves of the human ear and the accurate positions of key points need to be determined. Moreover, as an important biological feature, the morphological analysis and classification of the human ear are of considerable value for medical work related to the human ear. However, because of the complex morphological structure of the human ear, the generation of a general standard for the morphological structure of the human ear is difficult. This study divided the morphological structure of the human ear into three regions, namely, helix, antihelix, and concha, for instance segmentation and key physiological curve extraction. Traditional edge extraction methods are sensitive to illumination and posture variations. Moreover, the color distribution of one human ear image is relatively consistent. Thus, the transition among the three regions may not be obvious, which will cause poor adaptability for traditional edge extraction methods when extracting the key physiological curves of the human ear. To address this problem, this study proposed an improved YOLACT(You Only Look At CoefficienTs) instance segmentation model based on the ResNeSt backbone and the “screening mask” strategy, which improves the original YOLACT model from two aspects, namely, localization and segmentation. Our ResNeStbased YOLACT model was trained with labeled ear images from the USTB-Helloear image set. In the prediction stage, the original cropping mask strategy was discarded and replaced with our proposed screening mask strategy to ensure the integrity of the edges of the segmentation area. These improvements enhance the accuracy of curve detection and extraction and can accurately segment different regions of the human ear and extract key physiological curves. Compared with other methods, our proposed method shows better segmentation accuracy on the test image set and is more robust to posture variations of the human ear.

作者袁立夏桐张晓爽 YUAN Li;XIA Tong;ZHANG Xiao-shuang(School of Automation,University of Science and Technology Beijing,Beijing 100083,China)

机构地区北京科技大学自动化学院

出处《工程科学学报》 EI CSCD 北大核心 2022年第8期1386-1395,共10页 Chinese Journal of Engineering

基金国家自然科学基金资助项目(61472031)。

关键词人耳生理曲线提取实例分割改进YOLACT ResNeSt human ear physiological curve extraction instance segmentation improved YOLACT ResNeSt

分类号 TP391.41 [自动化与计算机技术—计算机应用技术]

引文网络
相关文献

参考文献3

1杨月如,吴红斌.耳廓的解剖学研究[J].解剖学杂志,1988(1):56-58. 被引量：12
2齐娜,李莉,赵伟.中国成年人耳廓形态测量及分类[J].声学技术,2010,29(5):518-522. 被引量：25
3王志明,刘志辉,黄洋科,邢宇翔.基于深度学习的高效火车号识别[J].工程科学学报,2020,42(11):1525-1533. 被引量：10

二级参考文献21

1杨月如,吴红斌.耳廓的解剖学研究[J].解剖学杂志,1988(1):56-58. 被引量：12
2钟小丽,谢菠荪.衣服、耳廓对肩部反射及头相关传输函数的综合影响(英文)[J].声学技术,2006,25(2):113-118. 被引量：8
3谢菠荪,钟小丽,饶丹,梁志强.头相关传输函数数据库及其特性分析[J].中国科学（G辑）,2006,36(5):464-479. 被引量：19
4GB/T2428-1998成年人头面部尺寸[S].
5Batteau D W.The role of the pinna in human localization[J].Proc.R.Soc.London,Ser.B.1967,168:158-180.
6Lopez-Poveda E A,Meddis R.A physical model of sound diffraction and reflections in the human concha[J].J.Acoust.Soc.Am,1996,100:3248-3259.
7Gardner M B,Gardner R S.Problem of localization in the median plane:Effect of pinna cavity occlusion[J].J.Aconst.Soc.Am.1974,53:400-408.
8Hebrank J,Wright D.Spectral cues used in the location of sound sources on the median plane[J].J.Acuust.Soc.Am.1974,56:1829-1834.
9Musicant A,Buffer R.The influence of pinnae-based spectral cues on sound localization[J].J.Aconst.Soc.Am,1984,75:1195-1200.
10Asano F,Suzuki Y,Sone T.Role of spectral cues in median plane localization[J].J.Aconst.Soc.Am,1990,85:159-168.

共引文献43

1仝欣,齐娜.视觉辅助信息对声学头模录音听感效果的影响[J].声学技术,2013,32(S1):223-224. 被引量：1
2罗传富,胡兴宇,胡佳,丁尔英.耳部的应用人类学研究[J].泸州医学院学报,1994,17(2):106-108. 被引量：1
3樊晓光,丁士海,夏玉军.山东半岛地区汉族大学生耳廓的研究[J].康复与疗养杂志,1995,10(2):49-51. 被引量：1
4朱世柱,王东川,张先芷,吴怀安,郭毅,俞善雨,李建屏,张益诚,杨勤.恩施土家族人耳廓的形态特征[J].湖北医科大学学报,1996,17(2):113-116.
5齐娜,李莉,赵伟.中国成年人耳廓形态测量及分类[J].声学技术,2010,29(5):518-522. 被引量：25
6刘慧芬,魏会平.不同群体的512例个体耳垂基因调查[J].神经药理学报,1997,0(1):20-20+23. 被引量：3
7刘慧芬,魏会平,田光辉.河北三民族耳垂基因频率调查[J].中国优生与遗传杂志,1996,4(S1):118-118.
8仝欣,齐娜.椭球头模与仿真头模的指向性比较[J].电声技术,2012,36(1):43-45. 被引量：8
9杨天琪,齐娜.声学头模双耳录音听感效果分析[J].电声技术,2013,37(1):70-72. 被引量：7
10杨雷,张培华,徐国昌,席焕久,张海龙,裴林国.河南汉族成人耳部生物学形态的横断面调查[J].河南师范大学学报（自然科学版）,2013,41(1):138-141. 被引量：6

同被引文献15

1陶丽珍.棉/再生纤维素纤维混纺产品定量分析方法比较[J].上海纺织科技,2012,40(4):8-9. 被引量：12
2朱洪亮,马鸣,樊微,李波.丝光棉/再生纤维素纤维混纺织物定量分析[J].中国纤检,2014(7):67-71. 被引量：4
3李培玲,杨萍,亓兴华.纤维混纺产品定量方法中甲酸/氯化锌法的局限性[J].现代纺织技术,2014,22(6):43-47. 被引量：7
4侯文锐,刘建昆,孙磊,杨轶捷.服装面料混纺纤维鉴别的研究现状、技术对策[J].广东化工,2016,43(24):98-99. 被引量：3
5李小红.棉与再生纤维素纤维混纺产品定量分析研究[J].棉纺织技术,2018,46(10):71-73. 被引量：2
6孙建桐,孙意凡,赵然,季宇寒,张漫,李寒.基于几何形态学与迭代随机圆的番茄识别方法[J].农业机械学报,2019,50(B07):22-26. 被引量：27
7朱超,苗腾,许童羽,李娜,邓寒冰,周云成.基于骨架的玉米植株三维点云果穗分割与表型参数提取[J].农业工程学报,2021,37(6):295-301. 被引量：7
8龙洁花,赵春江,林森,郭文忠,文朝武,张宇.改进Mask R-CNN的温室环境下不同成熟度番茄果实分割方法[J].农业工程学报,2021,37(18):100-108. 被引量：33
9邓寒冰,许童羽,周云成,苗腾,李娜,吴琼,朱超,沈德政.基于深度掩码的玉米植株图像分割模型[J].农业工程学报,2021,37(18):109-120. 被引量：12
10贾伟宽,李倩雯,张中华,刘国良,侯素娟,Ji Ze,郑元杰.复杂环境下柿子和苹果绿色果实的优化SOLO分割算法[J].农业工程学报,2021,37(18):121-127. 被引量：8

引证文献2

1易宏,赵丽,曹月婵.智能识别在棉和莱赛尔定量测试中的研究和应用[J].中国纤检,2023(4):82-84.
2朱德利,余茂生,梁明飞.基于SwinT-YOLACT的玉米果穗实时实例分割[J].农业工程学报,2023,39(14):164-172. 被引量：1

二级引证文献1

1郭文娟,冯全.基于类激活映射的可解释性方法在农作物检测识别中的发展现状与趋势[J].智能化农业装备学报（中英文）,2023,4(4):41-48. 被引量：1

工程科学学报

2022年第8期

浏览历史

内容加载中请稍等...

基于改进YOLACT实例分割网络的人耳关键生理曲线提取被引量：2

参考文献3

二级参考文献21

共引文献43

同被引文献15

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进YOLACT实例分割网络的人耳关键生理曲线提取 被引量：2

参考文献3

二级参考文献21

共引文献43

同被引文献15

引证文献2

二级引证文献1

相关作者

相关机构

相关主题

浏览历史

基于改进YOLACT实例分割网络的人耳关键生理曲线提取被引量：2