期刊文献+

基于双哈希索引的高效语音生物哈希安全检索算法

Efficient Speech Biological Hashing Secure Retrieval Algorithm Based on Double Hash Index
下载PDF
导出
摘要 针对语音数据在信道传输与云端存储时的安全性问题,以及由于语音数据数目大、维数高、空间复杂度高带来的检索效率问题,提出了一种基于双哈希索引的高效语音生物哈希安全检索算法。首先,在服务端分别提取语音信号的频谱通量与峭度因子特征并将两种特征融合,利用Bagging分类对语音信号的差分哈希分类,并基于分类结果构建密钥分配索引表;然后,根据密钥分配索引表建立具有单一映射密钥的生物特征模板,并将其量化构造生物哈希,得到哈希索引;同时,采用混合域置乱加密算法对原始语音加密,构建密文语音库;最后,将哈希索引与密文语音库上传至云端并构建云端生物哈希索引表。在移动端,采用归一化汉明距离进行匹配检索。实验结果表明:本文算法的匹配阈值区间为(0.2694, 0.4173),说明该检索算法能够灵活选取匹配阈值,具有较好的鲁棒性和区分性;检索过程中单条语音平均检索时间仅为9.4957×10^(–4)s,并且经过15种内容保持操作后的查全率与查准率均为100%,说明该算法具有较好的检索性能,可以满足各种环境下的语音检索需求;同时提出的加密算法密钥空间大小为1060,说明能够抵御穷举密钥攻击、保证语音数据的安全;此外,构建的生物特征模板具有良好的多样性、安全性和可撤销性。 Aiming at the security of speech data in channel transmission and cloud storage,as well as the problems of retrieval efficiency caused by the large number,high dimension and high spatial complexity of speech data,an efficient speech biological hashing secure retrieval algorithm based on double hash index is proposed.Firstly,the spectral flux and kurtosis factor features of speech signal are extracted in the server terminal,and then the two features are fused,Bagging classification is used to classify speech signals by differential hashing,and the key distribution index table is constructed based on the classification results;then,according to the key distribution index table,the biometric template with a single mapping key is established,and its biometric hash is quantized to obtain the hash index;at the same time,the mixed do-main scrambling encryption is used to encrypt the original speech and construct the encrypted speech database;finally,the hash index and encrypted speech database are uploaded to the cloud and the biological hash index table is constructed.In the mobile terminal,using normalized hamming distance for matching retrieval.The experimental results show that the matching threshold interval obtained by the algorithm is(0.2694,0.4173),which shows that the retrieval system can flexi-bly select the matching threshold and has good robustness and discrimination;the average retrieval time of a single speech in the retrieval process is only 9.4957×10^(–4)s,and the recall and precision after 15 kinds of content preservation operations are 100%,it shows that the algorithm has good retrieval performance and can meet the needs of speech retrieval in various environments;at the same time,the size of the encryption algorithm key space is 1060,which shows that it can resist ex-haustive key attack and ensure the security of speech data;in addition,the constructed biometric templates have good di-versity,security and revocability.
作者 黄羿博 陈德怀 张秋余 HUANG Yibo;CHEN Dehuai;ZHANG Qiuyu(College of Physics and Electronic Engineering,Northwest Normal University,Lanzhou 730070,China;School of Computer and Communication,Lanzhou University of Technology,Lanzhou 730050,China)
出处 《信息安全学报》 CSCD 2024年第2期69-83,共15页 Journal of Cyber Security
基金 甘肃省科技计划项目资助 甘肃省自然科学基金(No.21JR7RA120) 国家自然科学基金(No.61862041)资助。
关键词 安全语音检索 双哈希索引 生物特征模板 生物哈希 密文语音 secure speech retrieval double hash index biometric template biological hashing encrypted speech
  • 相关文献

参考文献5

二级参考文献65

  • 1王欢良,韩纪庆,李海峰.基于特征似然度加权和维数缩减的Robust语音端点检测[J].声学学报,2007,32(1):62-68. 被引量:7
  • 2Abberley D, Renals S, Cook G. Retrieval of broadcast news documents with the THISL system. In: Proc. ICASSP98, Seattle, 1998:3781--3784
  • 3Beth L, Pedro M, Om D. Word and subword indexing approaches for reducing the effects of OOV queries on spoken audio. In: Proc. HLT2002, San Diego, 2002
  • 4Ng K, Zue V W. Subword-based approaches for spoken document retrieval. Speech Communication, 2000; 32: 157-- 186
  • 5Wechsler M, Schauble P. Speech retrieval based on automatic indexing. In: Proc.MIRO '95, Glasgow, 1995
  • 6Foote J T et al. Unconstrained keyword spotting using phone lattices with application to spoken document retrieval. Computer Speech and Language, 1997; 2: 207-- 224
  • 7Cardillo P S, Clements M, Miller M S. Phonetic searching vs. LVCSR: How to find what you really want in audio archives. International Journal of Speech Technology, 2002; 5:9--22
  • 8Seide F et al. Vocabulary independent search in spontaneous speech. In: Proc. ICASSP'04, Montreal, 2004: I253--I256
  • 9Yu P, Seide P. A hybrid word / phoneme-based approach for improved vocabulary-independent search in spontaneous speech. In: Proc. INTERSPEECH-2004, 3eju Island, korea, 2004:293--296
  • 10Woodland P C et al. Effects of out of vocabulary words in spoken document retrieval. In: Proe. SIGIR, Athens, Greece, 2000:372--374

共引文献132

相关作者

内容加载中请稍等...

相关机构

内容加载中请稍等...

相关主题

内容加载中请稍等...

浏览历史

内容加载中请稍等...
;
使用帮助 返回顶部