JOURNAL ARTICLE

An Efficient Speaker Identification System with High-Level Feature Extraction and Database Dimensionality Reduction.

Published In: Ingénierie des Systèmes d'Information, 2025, v. 30, n. 9. P. 2473 1 of 3
Database: Business Source Ultimate 2 of 3
Authored By: Shatti, Ahmed Hussein; Mohamed-Kazim, Haider A.; Saraj, Rusul Noori; Aldhahab, Ahmed 3 of 3

Abstract

Speaker identification is a biometric technology that leverages distinct characteristics obtained from vocal utterances to verify users' identities. Later advancements in multiple fields have raised the importance of speaker identification systems, particularly in security applications. The challenging task in speaker identification systems is how accurately to extract discriminative features from the speech signal. This paper presents a novel approach method that integrates the two-dimensional discrete multi-wavelet analysis-based critical sampling scheme (2D-DMWT-CS) with the principal component analysis (PCA) to employ a reliable and efficient speaker identification system. The proposed method incorporates four phases: preprocessing, feature extraction, dimensionality reduction, and training and classification. During the preprocessing phase, successive refinement techniques such as duration division, silence removal, resampling, and dimension reshaping are applied to the databases. All databases speech samples are then analyzed using the 2D-DMWT-CS. The resultant discriminative features of the wavelet analysis are further processed by the PCA during the supplementary dimensionality reduction phase. The latter provides high-level, hierarchically ordered features that come with a substantial benefit for enhancing the classification accuracy of the convolutional neural network (CNN). The suggested approach was validated by testing and evaluating the framework over many individuals using their speech identities in four online datasets: RAVDESS, TIMIT, ELSDSR and SALU-AC. The achievement results, in terms of the recognition rate, were 97.19% for the TIMIT database, 97.96% for the RAVDESS database, and 98.91% for the ELSDR database, which are higher results than those in the state-of-the-art literature. The reliable and efficient identification rates with high accuracy and fast learning with reducing dimensionality, are the main contributions of this work. [ABSTRACT FROM AUTHOR]

Additional Information

Source:Ingénierie des Systèmes d'Information. 2025/09, Vol. 30, Issue 9, p2473
Document Type:Article
Subject Area:Applied Sciences
Publication Date:2025
ISSN:1633-1311
DOI:10.18280/isi.300921
Accession Number:189599960
Copyright Statement:Copyright of Ingénierie des Systèmes d'Information is the property of International Information & Engineering Technology Association (IIETA) and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)

Looking to go deeper into this topic? Look for more articles on EBSCOhost.