为解决目前深度仿造检测方法对于跨数据集的检测性能难以提高的问题,提出基于注意力机制和一致性损失相结合的深度伪造人脸检测方法(method based on attention mechanism and consistency loss,MAMCL)。采用多注意力机制,迫使网络捕捉...为解决目前深度仿造检测方法对于跨数据集的检测性能难以提高的问题,提出基于注意力机制和一致性损失相结合的深度伪造人脸检测方法(method based on attention mechanism and consistency loss,MAMCL)。采用多注意力机制,迫使网络捕捉到更细微的局部异常。采用基于注意力机制的擦除方式,鼓励模型深入挖掘之前忽略的区域。设计一致性模块获取伪造图像中普遍存在的不一致细节特征,并应用一致性损失引导模型更加关注伪造细节。在面部取证++(FaceForensics++,FF++)数据集上进行实验,准确率达到96.38%,受试者工作特征曲线(receiver operating characteristic curve,ROC)的曲线下面积达到99.34%,在泛化性能测试中也取得了良好的效果。通过消融实验,证明了每个模块的有效性。结果表明,提出的检测方法能够较为准确地检测深度伪造人脸,且具有良好的泛化性能,可以作为应对当前人脸伪造威胁的有效检测手段。展开更多
In recent years,various speech embedding methods based on deep learning have been proposed and have shown better performance in speaker verification.Those new technologies will inevitably promote the development of fo...In recent years,various speech embedding methods based on deep learning have been proposed and have shown better performance in speaker verification.Those new technologies will inevitably promote the development of forensic speaker verification.We propose a new forensic speaker verification method based on embeddings trained with loss function called generalized end-to-end(GE2E)loss.First,a long short-term memory(LSTM)based deep neural network(DNN)is trained as the embedding extractor,then the cosine similarity scores between embeddings from same speaker comparison pairs and different speaker comparison pairs are trained to represent within-speaker model and between-speaker model respectively,and finally,the cosine similarity scores between the questioned embeddings and enrolled embeddings are evaluated in the above two models to get the likelihood ratio(LR)value.On the subset of LibriSpeech,test-other-500,we achieve a new state of the art.Both all the same speaker comparison pairs and different speaker comparison pairs get correct results and can provide considerable strong evidence strength for courts.展开更多
文摘为解决目前深度仿造检测方法对于跨数据集的检测性能难以提高的问题,提出基于注意力机制和一致性损失相结合的深度伪造人脸检测方法(method based on attention mechanism and consistency loss,MAMCL)。采用多注意力机制,迫使网络捕捉到更细微的局部异常。采用基于注意力机制的擦除方式,鼓励模型深入挖掘之前忽略的区域。设计一致性模块获取伪造图像中普遍存在的不一致细节特征,并应用一致性损失引导模型更加关注伪造细节。在面部取证++(FaceForensics++,FF++)数据集上进行实验,准确率达到96.38%,受试者工作特征曲线(receiver operating characteristic curve,ROC)的曲线下面积达到99.34%,在泛化性能测试中也取得了良好的效果。通过消融实验,证明了每个模块的有效性。结果表明,提出的检测方法能够较为准确地检测深度伪造人脸,且具有良好的泛化性能,可以作为应对当前人脸伪造威胁的有效检测手段。
基金Supported by the National Key Research and Development Projects(2017YFC0821000)Guangzhou Science and Technology Project(2019030004)Key Lab of Forensic Science,Ministry of Justice,China(KF202117)。
文摘In recent years,various speech embedding methods based on deep learning have been proposed and have shown better performance in speaker verification.Those new technologies will inevitably promote the development of forensic speaker verification.We propose a new forensic speaker verification method based on embeddings trained with loss function called generalized end-to-end(GE2E)loss.First,a long short-term memory(LSTM)based deep neural network(DNN)is trained as the embedding extractor,then the cosine similarity scores between embeddings from same speaker comparison pairs and different speaker comparison pairs are trained to represent within-speaker model and between-speaker model respectively,and finally,the cosine similarity scores between the questioned embeddings and enrolled embeddings are evaluated in the above two models to get the likelihood ratio(LR)value.On the subset of LibriSpeech,test-other-500,we achieve a new state of the art.Both all the same speaker comparison pairs and different speaker comparison pairs get correct results and can provide considerable strong evidence strength for courts.