Speech recognition in noise: estimating effects of compressive nonlinearities in the basilar-membrane response.Ear Hear. 2007 Sep; 28(5):682-93.EH
This experiment was designed to estimate effects of cochlear nonlinearities on tonal and speech masking for individuals with normal hearing who have a range of quiet thresholds. Physiological and psychophysical evidence indicates that for signals close to the characteristic frequency (CF) of a place on the basilar membrane, the normal growth of response of the basilar membrane is linear at lower stimulus levels and compressed at medium to higher stimulus levels. In contrast, at moderate to high CFs, the basilar membrane responds more linearly to stimuli at frequencies well below the CF regardless of input level. Thus, the hypothesis tested was that masker effectiveness would change as a function of stimulus level consistent with the underlying basilar membrane response. Specifically, with a fixed-level speech signal and a speech-shaped masker that ranges from low to higher levels, the resulting response of the basilar membrane to the masker would be linear at lower levels and compressed at medium to higher levels. This would result in relatively less effective masking at higher masker levels. It was further hypothesized that the transition from linear to compressed responses to both tones and maskers would occur at higher levels for listeners with higher quiet thresholds than for listeners with lower quiet thresholds.
Tonal thresholds and speech recognition in noise were measured as a function of masker level. A 10-msec, 2.0-kHz tone was presented in a lower frequency masker ranging from 40 to 85 dB SPL. Moderate-level speech was presented in interrupted noise at six levels ranging from 47 to 77 dB SPL. To minimize differences in speech audibility that could arise during the "off" periods of the interrupted noise, a low-level steady-state "threshold-matching noise" was also present during measurement of speech recognition. Subjects were 30 adults with normal hearing with a 20-dB range of average quiet thresholds.
Tonal breakpoints (i.e., the levels corresponding to the transitions from linear to nonlinear responses) were significantly correlated with quiet thresholds, whereas slopes measured above the breakpoints were not. Speech recognition in noise was consistent with the hypothesis that the response of the basilar membrane to the masker was linear at lower levels and compressed at medium to higher levels, resulting in less effective masking at higher masker levels. That is, at lower masker levels, as masker level increased, mean observed speech scores declined as predicted using the articulation index, an audibility-based model. With further increases in masker level, mean scores declined less than predicted. Moreover, for subjects with higher quiet thresholds, masker effectiveness remained constant for a wider range of masker levels than for subjects with lower quiet thresholds, consistent with the hypothesis that the transition from linear to compressed responses occurred at higher levels. Finally, significant negative correlations were obtained between individual subjects' tonal and speech measures.
Results from tonal and speech tasks were consistent with basilar membrane nonlinearities and consistent with changes in nonlinearities with minor threshold elevations, providing support for their role in the understanding of speech in noise with increases in noise level.