基于双哈希索引的高效语音生物哈希安全检索算法

Efficient Speech Biological Hashing Secure Retrieval Algorithm Based on Double Hash Index

下载PDF

导出

摘要针对语音数据在信道传输与云端存储时的安全性问题,以及由于语音数据数目大、维数高、空间复杂度高带来的检索效率问题,提出了一种基于双哈希索引的高效语音生物哈希安全检索算法。首先,在服务端分别提取语音信号的频谱通量与峭度因子特征并将两种特征融合,利用Bagging分类对语音信号的差分哈希分类,并基于分类结果构建密钥分配索引表;然后,根据密钥分配索引表建立具有单一映射密钥的生物特征模板,并将其量化构造生物哈希,得到哈希索引;同时,采用混合域置乱加密算法对原始语音加密,构建密文语音库;最后,将哈希索引与密文语音库上传至云端并构建云端生物哈希索引表。在移动端,采用归一化汉明距离进行匹配检索。实验结果表明:本文算法的匹配阈值区间为(0.2694, 0.4173),说明该检索算法能够灵活选取匹配阈值,具有较好的鲁棒性和区分性;检索过程中单条语音平均检索时间仅为9.4957×10^(–4)s,并且经过15种内容保持操作后的查全率与查准率均为100%,说明该算法具有较好的检索性能,可以满足各种环境下的语音检索需求;同时提出的加密算法密钥空间大小为1060,说明能够抵御穷举密钥攻击、保证语音数据的安全;此外,构建的生物特征模板具有良好的多样性、安全性和可撤销性。 Aiming at the security of speech data in channel transmission and cloud storage,as well as the problems of retrieval efficiency caused by the large number,high dimension and high spatial complexity of speech data,an efficient speech biological hashing secure retrieval algorithm based on double hash index is proposed.Firstly,the spectral flux and kurtosis factor features of speech signal are extracted in the server terminal,and then the two features are fused,Bagging classification is used to classify speech signals by differential hashing,and the key distribution index table is constructed based on the classification results;then,according to the key distribution index table,the biometric template with a single mapping key is established,and its biometric hash is quantized to obtain the hash index;at the same time,the mixed do-main scrambling encryption is used to encrypt the original speech and construct the encrypted speech database;finally,the hash index and encrypted speech database are uploaded to the cloud and the biological hash index table is constructed.In the mobile terminal,using normalized hamming distance for matching retrieval.The experimental results show that the matching threshold interval obtained by the algorithm is(0.2694,0.4173),which shows that the retrieval system can flexi-bly select the matching threshold and has good robustness and discrimination;the average retrieval time of a single speech in the retrieval process is only 9.4957×10^(–4)s,and the recall and precision after 15 kinds of content preservation operations are 100%,it shows that the algorithm has good retrieval performance and can meet the needs of speech retrieval in various environments;at the same time,the size of the encryption algorithm key space is 1060,which shows that it can resist ex-haustive key attack and ensure the security of speech data;in addition,the constructed biometric templates have good di-versity,security and revocability.

作者黄羿博陈德怀张秋余 HUANG Yibo;CHEN Dehuai;ZHANG Qiuyu(College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China;School of Computer and Communication,Lanzhou University of Technology,Lanzhou 730050,China)

机构地区西北师范大学物理与电子工程学院兰州理工大学计算机与通信学院

出处《信息安全学报》 CSCD 2024年第2期69-83,共15页 Journal of Cyber Security

基金甘肃省科技计划项目资助甘肃省自然科学基金(No.21JR7RA120) 国家自然科学基金(No.61862041)资助。

关键词安全语音检索双哈希索引生物特征模板生物哈希密文语音 secure speech retrieval double hash index biometric template biological hashing encrypted speech

分类号 TP391.3 [自动化与计算机技术—计算机应用技术] TN912.3 [电子电信—通信与信息系统]

引文网络
相关文献

参考文献5

1牛夏牧,焦玉华.感知哈希综述[J].电子学报,2008,36(7):1405-1411. 被引量：97
2毋立芳,马玉琨,周鹏,郑伟诗.生物特征模板保护综述[J].仪器仪表学报,2016,37(11):2407-2420. 被引量：15
3黄羿博,王勇,张秋余,陈腾飞.基于混沌测量矩阵的生物哈希密文语音检索[J].华中科技大学学报（自然科学版）,2020,48(12):32-37. 被引量：5
4郑铁然,韩纪庆.基于音节Lattice的汉语语音检索技术及其索引去冗余方法[J].声学学报,2008,33(6):526-533. 被引量：7
5曾珂,禹思敏,胡迎春,张泽清.基于3D-LSCM的图像混沌加密算法[J].电子技术应用,2020,46(1):86-91. 被引量：13

二级参考文献65

1王欢良,韩纪庆,李海峰.基于特征似然度加权和维数缩减的Robust语音端点检测[J].声学学报,2007,32(1):62-68. 被引量：7
2Abberley D, Renals S, Cook G. Retrieval of broadcast news documents with the THISL system. In: Proc. ICASSP98, Seattle, 1998:3781--3784
3Beth L, Pedro M, Om D. Word and subword indexing approaches for reducing the effects of OOV queries on spoken audio. In: Proc. HLT2002, San Diego, 2002
4Ng K, Zue V W. Subword-based approaches for spoken document retrieval. Speech Communication, 2000; 32: 157-- 186
5Wechsler M, Schauble P. Speech retrieval based on automatic indexing. In: Proc.MIRO '95, Glasgow, 1995
6Foote J T et al. Unconstrained keyword spotting using phone lattices with application to spoken document retrieval. Computer Speech and Language, 1997; 2: 207-- 224
7Cardillo P S, Clements M, Miller M S. Phonetic searching vs. LVCSR: How to find what you really want in audio archives. International Journal of Speech Technology, 2002; 5:9--22
8Seide F et al. Vocabulary independent search in spontaneous speech. In: Proc. ICASSP'04, Montreal, 2004: I253--I256
9Yu P, Seide P. A hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech. In: Proc. INTERSPEECH-2004, 3eju Island, korea, 2004:293--296
10Woodland P C et al. Effects of out of vocabulary words in spoken document retrieval. In: Proe. SIGIR, Athens, Greece, 2000:372--374

共引文献132

1刘笑楠,张文云,高艳娜.局部置乱结合双随机相位编码的双虹膜身份模板保护方法[J].仪器仪表学报,2020(6):233-239. 被引量：5
2邓子云.基于Scrapy的网站增量式爬取功能的研制与应用[J].湖南工业职业技术学院学报,2022,22(6):25-29.
3韩琦,王志芳,牛夏牧,李琼.针对索引图像的人脸区域分级加密算法[J].电子学报,2008,36(B12):25-29. 被引量：2
4王阿川,陈海涛.基于离散余弦变换的鲁棒感知图像哈希技术[J].中国安全科学学报,2009,19(4):91-96. 被引量：9
5刘亚多,李伟,李晓强,汪竹蓉,冯瑞.压缩域鲁棒音乐指纹算法研究[J].电子学报,2010,38(5):1172-1176. 被引量：9
6古今,郭立,梁惠,程龙.一种高效鲁棒的语音感知认证算法[J].小型微型计算机系统,2010,31(7):1461-1465. 被引量：1
7王霅煜,涂惠燕.基于内容的语音课件关键词检索系统:设计与实现[J].计算机应用与软件,2011,28(4):120-123. 被引量：1
8张磊,陆冬,项学智.改进的Katz算法及其在基于Lattice识别系统中的应用[J].模式识别与人工智能,2011,24(2):249-254.
9孙锐,闫晓星,高隽.基于视皮层全局感知特征的顽健图像散列方法[J].通信学报,2011,32(6):60-66.
10欧阳杰,高金花,文振焜,张盟,刘朋飞,杜以华.融合HVS计算模型的视频感知哈希算法研究[J].中国图象图形学报,2011,16(10):1883-1889. 被引量：7

1黄羿博,王宁,张秋余.基于卢氏特征安全模板的语音生物哈希检索算法[J].华中科技大学学报（自然科学版）,2023,51(11):60-66.
2史桂梅,雷红彦,严晓芸,王庚,沙琼玥,史春波,马少元,李岳,马晓明.青海省2015—2019年职业性尘肺病现况与疾病负担[J].环境与职业医学,2023,40(11):1278-1282.
3李旭,孙文松,刘莹,杨正书.北五味子有效成分含量与土壤因子的相关性分析[J].园艺与种苗,2024,44(1):68-71.
4孙术桓.朝阳市灌木林修复技术[J].辽宁林业科技,2024(1):76-78.
5王娟,曹树金,王志红,彭碧涛.面向探索式搜索的领域知识图谱构建及实验探索[J].图书情报工作,2024,68(3):105-116.
6赵耿,马英杰,董有恒.混沌密码理论研究与应用新进展[J].信息网络安全,2024(2):203-216.
7王娟,努尔买买提·黑力力.基于字典分级和属性加权的密文排序检索方案[J].新疆大学学报（自然科学版中英文）,2024,41(2):246-256.

信息安全学报

2024年第2期

浏览历史

内容加载中请稍等...

基于双哈希索引的高效语音生物哈希安全检索算法

参考文献5

二级参考文献65

共引文献132

相关作者

相关机构

相关主题

浏览历史