JOURNAL ARTICLE

Deepfake Speech Recognition and Detection.

Published In: International Journal of Pattern Recognition & Artificial Intelligence, 2023, v. 37, n. 9. P. 1 1 of 3
Database: Academic Search Ultimate 2 of 3
Authored By: Chang, Hung-Chang 3 of 3

Abstract

Deepfake technology, especially deep voice, which has been derived from artificial intelligence in recent years, is potentially harmful, and the public is not yet wary. However, many speech synthesis models measure the degree of true restitution by Mean Opinion Rating (MOS), a subjective assessment of naturalness and quality of speech by human subjects, but in future it will be difficult to distinguish the interlocutor's identity through the screen. For this reason, this study addresses the threat posed by this new technology by combining representational learning and 0transfer learning in two sub-systems: a recognition system and a voice print system. The recognition system is responsible for the detection of which voice is a fake voice generated by speech conversion or speech synthesis techniques, while the acoustic system is responsible for the verification of the speaker's identity through acoustic features. In the speech recognition system, we use the representation learning method and the transfer classification method. We use X-vector data for training, and then fine-tune the model using four types of marker data to learn the representation vectors of real and fake voice, and use support vector machine to classify real and fake voice in the back-end to reduce the negative effect of the new technique. [ABSTRACT FROM AUTHOR]

Additional Information

Source:International Journal of Pattern Recognition & Artificial Intelligence. 2023/07, Vol. 37, Issue 9, p1
Document Type:Article
Subject Area:Computer Science
Publication Date:2023
ISSN:0218-0014
DOI:10.1142/S0218001423500155
Accession Number:169947256
Copyright Statement:Copyright of International Journal of Pattern Recognition & Artificial Intelligence is the property of World Scientific Publishing Company and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.