实验室概况


实验室主要研究人工智能系统的安全性与人工智能技术在内容安全中的应用。主要研究方向包括(1)人工智能模型安全:深度神经网络版权溯源和完整性检验、模型水印和指纹的性能界限分析。(2)人工智能生成内容安全:具备可迁移性的深度伪造内容识别、生成多媒体内容模型的版权安全性分析。(3)生物特征信息的分析:基于大型预训练的唇读模型、不依赖预设内容的唇语特征身份识别等。实验室近年在相关领域国际顶刊顶会如IEEE TIFS,IEEE TCSVT,ICCV,IJCAI,AAAI等发表论文50余篇,多次承担国家自然科学基金研究,参与华为、奇安信、蚂蚁等横向课题数项。

最新成果

Lip Feature Disentanglement for Visual Speaker Authentication in Natural Scenes

Recent studies have shown that lip shape and movement can be used as an effective biometric feature for speaker authentication. By using random prompt text scheme, lip-based authentication system can also achieve good liveness de- tection performance in laboratory scenarios. However, due to the increasingly widespread mobile application, the authentication system may face additional practical difficulties such as complex background, limited user samples, etc., which will degrade the authentication performance derived by current methods. To confront the above problems, a new deep neural network, i.e. the Triple-feature Disentanglement Network for Visual Speaker Authentication (TDVSA-Net), is proposed in this paper to extract discriminative and disentangled lip features for visual speaker authentication in the random prompt text scenario. Three de- coupled lip features, including the content feature inferring the speech content, the physiological lip feature describing the static lip shape and appearance and the behavioral lip feature depicting the unique pattern in lip movements during utterance, are extracted by TDVSA-Net and fed into corresponding mod- ules to authenticate both the prompt text and the speaker’s identity. Experiment results have demonstrated that compared with several SOTA visual speaker authentication methods, the proposed TDVSA-Net can extract more discriminative and robust lip features which boost the content recognition and identity authentication performance against both human imposters and DeepFake attacks.