Tags

Type your tag names separated by a space and hit enter

Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise.
J Acoust Soc Am. 2011 Apr; 129(4):2227-36.JA

Abstract

When a target-speech/masker mixture is processed with the signal-separation technique, ideal binary mask (IBM), intelligibility of target speech is remarkably improved in both normal-hearing listeners and hearing-impaired listeners. Intelligibility of speech can also be improved by filling in speech gaps with un-modulated broadband noise. This study investigated whether intelligibility of target speech in the IBM-treated target-speech/masker mixture can be further improved by adding a broadband-noise background. The results of this study show that following the IBM manipulation, which remarkably released target speech from speech-spectrum noise, foreign-speech, or native-speech masking (experiment 1), adding a broadband-noise background with the signal-to-noise ratio no less than 4 dB significantly improved intelligibility of target speech when the masker was either noise (experiment 2) or speech (experiment 3). The results suggest that since adding the noise background shallows the areas of silence in the time-frequency domain of the IBM-treated target-speech/masker mixture, the abruption of transient changes in the mixture is smoothed and the perceived continuity of target-speech components becomes enhanced, leading to improved target-speech intelligibility. The findings are useful for advancing computational auditory scene analysis, hearing-aid/cochlear-implant designs, and understanding of speech perception under "cocktail-party" conditions.

Authors+Show Affiliations

Department of Machine Intelligence, Peking University, Beijing 100871, China.No affiliation info availableNo affiliation info available

Pub Type(s)

Journal Article
Research Support, Non-U.S. Gov't

Language

eng

PubMed ID

21476677

Citation

Cao, Shuyang, et al. "Improvement of Intelligibility of Ideal Binary-masked Noisy Speech By Adding Background Noise." The Journal of the Acoustical Society of America, vol. 129, no. 4, 2011, pp. 2227-36.
Cao S, Li L, Wu X. Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise. J Acoust Soc Am. 2011;129(4):2227-36.
Cao, S., Li, L., & Wu, X. (2011). Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise. The Journal of the Acoustical Society of America, 129(4), 2227-36. https://doi.org/10.1121/1.3559707
Cao S, Li L, Wu X. Improvement of Intelligibility of Ideal Binary-masked Noisy Speech By Adding Background Noise. J Acoust Soc Am. 2011;129(4):2227-36. PubMed PMID: 21476677.
* Article titles in AMA citation format should be in sentence-case
TY - JOUR T1 - Improvement of intelligibility of ideal binary-masked noisy speech by adding background noise. AU - Cao,Shuyang, AU - Li,Liang, AU - Wu,Xihong, PY - 2011/4/12/entrez PY - 2011/4/12/pubmed PY - 2011/8/17/medline SP - 2227 EP - 36 JF - The Journal of the Acoustical Society of America JO - J Acoust Soc Am VL - 129 IS - 4 N2 - When a target-speech/masker mixture is processed with the signal-separation technique, ideal binary mask (IBM), intelligibility of target speech is remarkably improved in both normal-hearing listeners and hearing-impaired listeners. Intelligibility of speech can also be improved by filling in speech gaps with un-modulated broadband noise. This study investigated whether intelligibility of target speech in the IBM-treated target-speech/masker mixture can be further improved by adding a broadband-noise background. The results of this study show that following the IBM manipulation, which remarkably released target speech from speech-spectrum noise, foreign-speech, or native-speech masking (experiment 1), adding a broadband-noise background with the signal-to-noise ratio no less than 4 dB significantly improved intelligibility of target speech when the masker was either noise (experiment 2) or speech (experiment 3). The results suggest that since adding the noise background shallows the areas of silence in the time-frequency domain of the IBM-treated target-speech/masker mixture, the abruption of transient changes in the mixture is smoothed and the perceived continuity of target-speech components becomes enhanced, leading to improved target-speech intelligibility. The findings are useful for advancing computational auditory scene analysis, hearing-aid/cochlear-implant designs, and understanding of speech perception under "cocktail-party" conditions. SN - 1520-8524 UR - https://www.unboundmedicine.com/medline/citation/21476677/Improvement_of_intelligibility_of_ideal_binary_masked_noisy_speech_by_adding_background_noise_ DB - PRIME DP - Unbound Medicine ER -