基于分形特征的音频检索被引量：2

Fractal Feature-based Audio Retrieval

下载PDF

导出

摘要提出利用分形几何抽取音频特征的全局化音频检索,将其学习阶段计算音频数据库中每个音频的分维作为特征向量,保存在音频特征数据库中,并建立索引。其检索阶段则首先计算查询音频的分维,然后从音频数据库中快速找出分维最相似的若干音频对象。分维刻画了音频的内在属性如自相似性,使其具有片段检索对匹配的起点不敏感、抗噪音、检索速度快等优点。用FRACTAL,MFCC和SOLAR3种方法对数据集分别检索,实验结果表明基于分维的音频检索在性能和时间复杂度上有显著优势。 The fractal geometry-based feature extraction is proposed for audio retrieval system. During the learning process, the system computes the fractal dimension as the feature vector for each audio in audio database and then saves it in the feature vector database. In the retrieval process, the fractal dimension for the query audio is firstly extracted, by which the most similar audios from the audio database are retrieved. The fractal dimension is intrinsic for each audio such as self-similarity so as to make it not sensitive to noise and position of the audio fragment to be retrieved from the long audio. It also retrieves the audios quickly. Compared with FRACTAL, MFCC and SOLAR, the experimental results validate that the proposed approach advances in performance and time complexity.

作者李坚毛先领文贵华

机构地区华南理工大学计算机应用工程研究所华南理工大学计算机学院

出处《计算机工程》 CAS CSCD 北大核心 2008年第11期211-213,共3页 Computer Engineering

关键词音频检索分形音频特征 audio retrieval fractal audio feature

分类号 TP18 [自动化与计算机技术—控制理论与控制工程]

引文网络
相关文献

参考文献5

1柳群英.基于内容的音频信息检索技术[J].现代情报,2005,25(6):91-93. 被引量：7
2卢坚,陈毅松,孙正兴,张福炎.基于隐马尔可夫模型的音频自动分类[J].软件学报,2002,13(8):1593-1597. 被引量：47
3Boshoff H F V. A Fast Box Counting Algorithm for Determining the Fractal Dimension of Sampled Continuous Functions[C]// Proceedings of the 1992 South African Symposium on Communications and Signal Processing. [S. l.]: ACM Press, 1992.
4Hoiem D, Sukthankar R. SOLAR: Sound Object Localization and Retrieval in Complex Audio Environments[C]//Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing. [S. l.]: IEEE Press, 2005.
5Kim K, Kim Se-Young, Jeon Jae-Kuk. et al. Quick Audio Retrieval Using Multiple Feature Vectors[J]. IEEE Transactions on Consumer Electronics, 2006, 52(1 ): 200-205.

二级参考文献25

1[1]Feiten, B., Frank, R., Ungvary, T. Organization of sounds with neural nets. In: Proceedings of the 1991 International Computer Music Conference, International Computer Music Association. San Francisco, 1991. 441～444.
2[2]Feiten, B., Günzel, S. Automatic indexing of a sound database using self-organizing neural nets. Computer Music Journal, 1994,18(3):53～65.
3[3]Wold, E., Blum, T., Keislar, D., et al. Content-Based classification, search and retrieval of audio. IEEE Multimedia Magazine, 1996,3(3):27～36.
4[4]Foote, J.T. Content-Based retrieval of music and audio. Multimedia Storage and Archiving Systems II, 1997,32(29):138～147.
5[5]Li, S.Z. Content-Based classification and retrieval of audio using the nearest feature line method. IEEE Transactions on Speech and Audio Processing, 2000,8(5):619～625.
6[6]Li, S.Z., Guo, Guo-dong. Content-Based audio classification and retrieval using SVM learning. In: Proceedings of the 1st IEEE Pacific-Rim Conference on Multimedia. 2000.
7[7]Jiang, Hao, Lin, Tony, Zhang, Hong-jiang. Video segmentation with the support of audio segmentation and classification. In: Proceedings of the IEEE International Conference on Multimedia and Expo (ICME 2000), Vol 3. NY: IEEE, 2000. 1507～1510.
8[8]He, Li-wei, Sanocki, E., Gupta, A., et al. Auto-Summarization of audio-video presentations. In: Proceedings of the 7th ACM International Conference on Multimedia. Orlando: ACM Press, 1999. 489～498.
9[9]Patel, N., Sethi, I. Audio characterization for video indexing. In: Proceedings of the SPIE on Storage and Retrieval for Still Image and Video Databases, Vol 2670. 1996. 373～384.
10[10]Liu, Zhu, Huang, J., Wang, Y. Classification of TV programs based on audio information using hidden Markov model. In: Proceedings of the IEEE Signal Processing Society 1998 Workshop on Multimedia Signal Processing. IEEE, 1998. 27～32.

共引文献52

1齐俊英,孙劲光,高爱东.基于内容的音频自动分类方法[J].辽宁工程技术大学学报（自然科学版）,2005,24(z1):170-172. 被引量：5
2郑继明,李瑞仙,蒲兴成.基于单状态HMM的音频分类方法研究[J].计算机应用,2009,29(2):392-394.
3陈姗姗.未来广播中的音频检索技术[J].视听界（广播电视技术）,2010(3):62-64.
4柳群英.基于内容的音频信息检索技术[J].现代情报,2005,25(6):91-93. 被引量：7
5郑贵滨,韩纪庆,李海峰,郑铁然.基于分段的实时声频检索方法[J].声学学报,2006,31(2):101-108. 被引量：5
6郭兴吉,范秉琪.基于特征的音频比对技术[J].河南师范大学学报（自然科学版）,2006,34(2):35-38. 被引量：15
7郑贵滨,韩纪庆.基于直方图的树与链表相结合的音频索引方法[J].哈尔滨工业大学学报,2006,38(11):1915-1918. 被引量：1
8季春.音频信息检索技术的发展及应用[J].现代情报,2007,27(1):157-160. 被引量：9
9郭兴吉.隐马尔科夫模型在音频波形识别中的应用研究[J].福建电脑,2007,23(3):13-14.
10黄光球,汪晓海.基于BP-HMM的网络入侵检测方法研究[J].计算机工程,2007,33(10):131-133. 被引量：2

同被引文献13

1Bigerelle M, Iost A.Fractal dimension and classification of music[J]. Chaos, Solitons and Fractals, 2000, 11 ( 14 ) : 2179-2 192.
2Traina C,Jr,Traina A.Fast features selection using fractal dimen- sion[C]//Proc of XV Brazilian Database Symposium on Data- base.Berlin: Springer, 2000 : 158-171.
3Godin R, Missaoui R, Alaoui H.Incremental concept formation algorithms based on Galois(concept) lattices[J].Computational Intelligence, 1995, 11(2) : 246-267.
4郭平,陈其鑫,王艳霞.基于分形维数的属性约简[J].计算机科学,2007,34(9):189-190. 被引量：5
5闫光辉,李战怀.两阶段无监督顺序前向分形属性规约算法[J].计算机研究与发展,2008,45(11):1955-1964. 被引量：4
6孔旭,关佶红.以声谱图相似度为度量的波形音乐检索[J].计算机工程与应用,2009,45(13):136-141. 被引量：7
7刘亚多,李伟,李晓强,汪竹蓉,冯瑞.压缩域鲁棒音乐指纹算法研究[J].电子学报,2010,38(5):1172-1176. 被引量：9
8李晓丽,杜振龙.模糊粗糙集在音频检索中的应用[J].计算机工程与应用,2010,46(15):124-126. 被引量：1
9张建华,汪鑫.基于内容音频检索综述[J].商情,2012(2):215-217. 被引量：2
10黄秋兰,程耀东,陈刚.分布式存储系统的哈希算法研究[J].计算机工程与应用,2014,50(1):1-4. 被引量：17

引证文献2

1张燕,唐振民,李燕萍.面向推荐系统的音乐特征抽取[J].计算机工程与应用,2011,47(5):130-133. 被引量：8
2叶循澹.基于多级索引的音频特征检索比对算法[J].电子技术与软件工程,2018(11):171-172.

二级引证文献8

1陈雅茜.音乐推荐系统及相关技术研究[J].计算机工程与应用,2012,48(18):9-16. 被引量：14
2谭学清,何珊.音乐个性化推荐系统研究综述[J].现代图书情报技术,2014(9):22-32. 被引量：23
3刘红星,吴九汇,张俊,张林,郭峰.基于时频域分形维数差的声品质评价新方法[J].噪声与振动控制,2018,38(A02):526-531. 被引量：3
4Sun Nan,Liu Borui,Liu Meiran,Cui Jizhe.Application of Personalized Recommendation System in Music Platform[J].管理科学与研究（中英文版）,2017,6(1):1-7.
5翟姗姗,孙雪莹,李进华.基于社交体验的移动APP持续使用意愿研究——以网易云音乐为例[J].现代情报,2019,39(2):128-135. 被引量：20
6薛钧星,孔钧昊.基于CatBoost算法的个人歌曲喜好预测[J].电脑编程技巧与维护,2022(1):110-111. 被引量：1
7刘彦会.协同过滤算法在微信推荐小程序的应用[J].武夷学院学报,2024,43(6):51-57.
8冯鹏宇,陈平华,申建芳.融合LSTM和注意力机制的音乐分类推荐方法[J].计算机科学与应用,2020,10(12):2280-2290.

1巨小澎.浅谈建立电台音频数据库[J].世界广播电视,2000,14(3):27-30.
2欧阳浩,肖建华.基于网格的最小生成树聚类算法[J].计算机与现代化,2006(12):81-82. 被引量：3
3许俊.音频数据库的设计与实现[J].计算机与现代化,2000(4):73-76. 被引量：4
4徐俭.浅议MPEG-4标准及其应用[J].电视工程,2006,0(1):2-6.
5徐俭.浅议MPEG-4标准及其应用[J].有线电视技术,2006,13(1):40-45. 被引量：1
6崔新春,贺洁,秦小麟.基于盲源分离的多重音频数据库水印算法[J].电子学报,2012,40(1):78-83. 被引量：5
7王池社,张燕.基于内容的音频数据库的构建与应用[J].微计算机信息,2010,26(33):12-13.
8马驰,张红云,苗夺谦,张学东.改进的多阈值动态二值化算法[J].计算机工程,2006,32(6):203-205. 被引量：25
9陈兆乾,周志华,姜远,陈世福.一种具有抗噪音能力的增量式混合学习算法[J].计算机研究与发展,1999,36(6):675-680. 被引量：1
10吴震菊.基于S变换融合Canny算子的汽车轮廓提取[J].计算机与现代化,2014(4):29-32. 被引量：2

计算机工程

2008年第11期

浏览历史

内容加载中请稍等...

基于分形特征的音频检索被引量：2

参考文献5

二级参考文献25

共引文献52

同被引文献13

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于分形特征的音频检索 被引量：2

参考文献5

二级参考文献25

共引文献52

同被引文献13

引证文献2

二级引证文献8

相关作者

相关机构

相关主题

浏览历史

基于分形特征的音频检索被引量：2