期刊文献+

基于改进YOLACT实例分割网络的人耳关键生理曲线提取 被引量:2

Physiological curve extraction of the human ear based on the improved YOLACT
下载PDF
导出
摘要 在人耳形状聚类、3D人耳建模、个人定制耳机等相关工作中,获取人耳的一些关键生理曲线和关键点的准确位置非常重要.传统的边缘提取方法对光照和姿势变化非常敏感.本文提出了一种基于ResNeSt和筛选模板策略的改进YOLACT实例分割网络,分别从定位和分割两方面对原始YOLACT算法进行改进,通过标注人耳数据集,训练改进的YOLACT模型,并在预测阶段使用改进的筛选模板策略,可以准确地分割人耳的不同区域并提取关键的生理曲线.相较于其他方法,本文方法在测试图像集上显示出更好的分割精度,且对人耳姿态变化时具有一定的鲁棒性. In related work, such as human ear shape clustering, three-dimensional human ear modeling, and personal customized headphones, the key physiological curves of the human ear and the accurate positions of key points need to be determined. Moreover, as an important biological feature, the morphological analysis and classification of the human ear are of considerable value for medical work related to the human ear. However, because of the complex morphological structure of the human ear, the generation of a general standard for the morphological structure of the human ear is difficult. This study divided the morphological structure of the human ear into three regions, namely, helix, antihelix, and concha, for instance segmentation and key physiological curve extraction. Traditional edge extraction methods are sensitive to illumination and posture variations. Moreover, the color distribution of one human ear image is relatively consistent. Thus, the transition among the three regions may not be obvious, which will cause poor adaptability for traditional edge extraction methods when extracting the key physiological curves of the human ear. To address this problem, this study proposed an improved YOLACT(You Only Look At CoefficienTs) instance segmentation model based on the ResNeSt backbone and the “screening mask” strategy, which improves the original YOLACT model from two aspects, namely, localization and segmentation. Our ResNeStbased YOLACT model was trained with labeled ear images from the USTB-Helloear image set. In the prediction stage, the original cropping mask strategy was discarded and replaced with our proposed screening mask strategy to ensure the integrity of the edges of the segmentation area. These improvements enhance the accuracy of curve detection and extraction and can accurately segment different regions of the human ear and extract key physiological curves. Compared with other methods, our proposed method shows better segmentation accuracy on the test image set and is more robust to posture variations of the human ear.
作者 袁立 夏桐 张晓爽 YUAN Li;XIA Tong;ZHANG Xiao-shuang(School of Automation,University of Science and Technology Beijing,Beijing 100083,China)
出处 《工程科学学报》 EI CSCD 北大核心 2022年第8期1386-1395,共10页 Chinese Journal of Engineering
基金 国家自然科学基金资助项目(61472031)。
关键词 人耳 生理曲线提取 实例分割 改进YOLACT ResNeSt human ear physiological curve extraction instance segmentation improved YOLACT ResNeSt
  • 相关文献

参考文献3

二级参考文献21

  • 1杨月如,吴红斌.耳廓的解剖学研究[J].解剖学杂志,1988(1):56-58. 被引量:12
  • 2钟小丽,谢菠荪.衣服、耳廓对肩部反射及头相关传输函数的综合影响(英文)[J].声学技术,2006,25(2):113-118. 被引量:8
  • 3谢菠荪,钟小丽,饶丹,梁志强.头相关传输函数数据库及其特性分析[J].中国科学(G辑),2006,36(5):464-479. 被引量:19
  • 4GB/T2428-1998成年人头面部尺寸[S].
  • 5Batteau D W.The role of the pinna in human localization[J].Proc.R.Soc.London,Ser.B.1967,168:158-180.
  • 6Lopez-Poveda E A,Meddis R.A physical model of sound diffraction and reflections in the human concha[J].J.Acoust.Soc.Am,1996,100:3248-3259.
  • 7Gardner M B,Gardner R S.Problem of localization in the median plane:Effect of pinna cavity occlusion[J].J.Aconst.Soc.Am.1974,53:400-408.
  • 8Hebrank J,Wright D.Spectral cues used in the location of sound sources on the median plane[J].J.Acuust.Soc.Am.1974,56:1829-1834.
  • 9Musicant A,Buffer R.The influence of pinnae-based spectral cues on sound localization[J].J.Aconst.Soc.Am,1984,75:1195-1200.
  • 10Asano F,Suzuki Y,Sone T.Role of spectral cues in median plane localization[J].J.Aconst.Soc.Am,1990,85:159-168.

共引文献43

同被引文献15

引证文献2

二级引证文献1

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部