JOURNAL ARTICLE
Enhancing music recognition using deep learning-powered source separation technology for cochlear implant users.
Published In: Journal of the Acoustical Society of America, 2024, v. 155, n. 3. P. 1694 1 of 3
Database: Academic Search Ultimate 2 of 3
Authored By: Chang, Yuh-Jer; Han, Ji-Yan; Chu, Wei-Chung; Li, Lieber Po-Hung; Lai, Ying-Hui 3 of 3
Abstract
This article focuses on enhancing music listening experiences for cochlear implant (CI) users through a deep learning-based source separation system with a self-fitting function. The study employed the Demucs_augment model, which integrates noise augmentation during training, to separate music into components (vocals, rhythm, harmony) and allows users to customize the mixing proportions according to personal preference. Objective evaluations using source-to-distortion ratio (SDR), source-to-interference ratio (SIR), and source-to-artifact ratio (SAR) demonstrated that the proposed system outperformed the baseline Demucs model, particularly in noisy environments. Subjective listening tests with vocoder simulations further confirmed that personalized self-fitting significantly improved music appreciation compared to fixed vocal-to-instrument ratios. While computational complexity currently limits real-time implementation on CI devices, the study suggests that the proposed approach holds promise for future clinical application to improve music perception among CI users.
Additional Information
- Source:Journal of the Acoustical Society of America. 2024/03, Vol. 155, Issue 3, p1694
- Document Type:Article
- Subject Area:Music
- Publication Date:2024
- ISSN:0001-4966
- DOI:10.1121/10.0025057
- Accession Number:176343120
- Copyright Statement:Copyright of Journal of the Acoustical Society of America is the property of American Institute of Physics and its content may not be copied or emailed to multiple sites without the copyright holder's express written permission. Additionally, content may not be used with any artificial intelligence tools or machine learning technologies. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
Looking to go deeper into this topic? Look for more articles on EBSCOhost.