HCS 7367 Speech Perception Lab -   Fall 2004

Back to Speech Perception Lab page

Books

Kent, R.D. & Read, C. (2001). The Acoustic Analysis of Speech. (Singular).

Stevens, K.N. (1999). Acoustic Phonetics (Current Studies in Linguistics). M.I.T. Press.

Hardcastle, W. and Laver, J. (1997). The Handbook of Phonetic Sciences. Blackwell.

Lass, N.J. (1996). Principles of Experimental Phonetics. edited by N.J. Lass, St. Louis: Mosby-Year Book Inc.

Papers

Assmann, P.F. and Summerfield, A.Q. (2004). The perception of speech under adverse conditions.
In S. Greenberg, W.A. Ainsworth, A.N. Popper and R.R. Fay (Eds.) Speech Processing in  the Auditory System.
Volume 14, Springer Handbook of Auditory Research.

Assmann, P.F. (1999). Fundamental frequency and the intelligibility of competing voices. Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, Aug. 1-7, 1999, pp. 179-182.

Assmann, P.F. and Katz, W.F. (2000). Time-varying spectral change in the vowels of children and adults. J. Acoust. Soc. Am. 108(4): 1856-1866.

Assmann, P.F. and Nearey, T.M. (1987). Perception of front vowels: The role of harmonics in the first formant region. J. Acoust. Soc. Am. 81, 520-534.

Assmann, P.F. and Summerfield, A. Q. (1990). Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies. J. Acoust. Soc. Am. 88: 680-697.

Bradlow, A.R., Bent, T. (2002). The clear speech effect for non-native listeners. J. Acoust. Soc. Am. 112(1): 272-284.

Blumstein SE, Stevens KN. (1979). Acoustic invariance in speech production: evidence from measurements of the spectral characteristics of stop consonants. J Acoust Soc Am. 1979 Oct;66(4):1001-17.

Bronkhorst AW, Plomp R. (1992). Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. J. Acoust. Soc. Am. 92(6): 3132-3139.

Childers, D. and Lee, C. (1991). Vocal quality factors: Analysis, synthesis, and perception. J. Acoust. Soc. Am. 90, 2394-2410.

Darwin, C.J. and Carlyon, R.P. (1995). Auditory Grouping. In The Handbook of Perception and Cognition, Volume 6, Hearing, edited by B.C.J. Moore (Academic, London).

Darwin, C.J. (1990). Environmental influences on speech perception. In Advances in Speech, Hearing and Language Processing, Volume 1, pp. 219-241.

Delgutte, B. (1999). Auditory neural processing of speech. In The Handbook of Phonetic Sciences, edited by WJ Hardcastle and J Laver, Blackwell: Oxford.

Delgutte, B., Hammond, B.M., Kalluri, S., Litvak, L.M., and Cariani, P. (1996). Neural encoding of temporal envelope and temporal interactions in speech. In Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Deeks JM, Carlyon RP (2004). Simulations of cochlear implant hearing using filtered harmonic complexes: Implications for concurrent sound segregation. J. Acoust. Soc. Am.  115, 1737-1746.

Dorman, M.F., Loizou, P.C. and Rainey, D. (1997). Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs. J. Acoust. Soc. Am.  102, 2403-2410.

Driver, J. (1996). Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading. Nature 381: 66-67.

Evans, BG, Iverson P (2004). Vowel normalization for accent: An investigation of best exemplar locations in northern and southern British English sentences. Acoust Soc Am 115(1): 352-361.

Festen, J.M. & Plomp, R. (1990). Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing.J Acoust Soc Am 88(4): 1725-1736.

Fishman, K., Shannon, R.V. and Slattery, W. (1997). Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor. J. Speech Hearing Res. 40, 1201-1215.

Fourakis, M., Geers, A. and Tobey, E. (1993). An acoustic metric for assessing the change in vowel production by profoundly hearing-impaired children. J. Acoust. Soc. Am. 94, 2544-2552.

Fruchter, D. and Sussman, H. (1997). The perecptual relevance of locus equations. J. Acoust. Soc. Am.  102, 2997-3008.

Geers, A., Brenner, C., & Davidson, L. (2003). Factors associated with development of speech perception skills in children implanted by age five. Ear and Hearing, 24(1), 24-36.

Grant KW, Walden BE, Seitz PF (1998). Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. J Acoust Soc Am 103(5): 2677-2690.

Glasberg, B.R. & Moore, B.C. (1989). Psychoacoustic abilities of subjects with unilateral and bilateral cochlear hearing impairments and their relationship to the ability to understand speech. Scandinavian Audiology. Supplementum. 32:1-25, 1989.

Glasberg, B. R., & Moore, B. C. J. (1990). Derivation of auditory filter shapes from notched-noise data. Hearing Research, 47, 103-138.

Greenberg, S. (1996). Auditory processing of speech. In Lass (1996).

Greenberg, S. (1996). Understanding speech understanding: Towards a unified theory of speech perception. In: Proceedings of the ESCA Workshop on the Auditory Basis of Speech Perception, Eds. W. Ainsworth and S. Greenberg.

Hillenbrand J, Getty LA, Clark MJ and Wheeler, K (1995). Acoustic characteristics of American English vowels J. Acoust. Soc. Am. 97, 3099-3111.

Hillenbrand, J. and Nearey, T. (1999). Identification of resynthesized /hVd/ utterances: Effects of formant contour. J. Acoust. Soc. Am., 105, 3509-3520.

Hillenbrand JM, Clark MJ, Nearey TM. (2001). Effects of consonant environment on vowel formant patterns. J Acoust Soc Am. 109(2): 748-763.

Jenkins, J.J., Strange, W., and Trent, S.A. (1999): Context-independent dynamic information for the perception of coarticulated vowels. J. Acoust. Soc. Am. 106 (1): 438- 448.

Jenkins, J.J., Strange, W., & Miranda, S. (1994). Vowel identification in mixed-speaker silent-center syllables. J. Acoust. Soc. Am. 95: 1030-1043.

Karsten, S.A. and Turner, C.W. (2000). Binaural speech recognition and the Stenger effect. Journal of Speech, Hearing and Language Research 43: 926-933.

Katz, W.F. and Assmann, P.F. (2001). Identification of children's and adults' vowels: Intrinsic fundamental frequency, fundamental frequency dynamics, and       presence of voicing. Journal of Phonetics 29, 23-51.

Kawahara, H. (1996). Speech Transformation using Adaptive Interpolation of Time-frequency Representation and All-Pass Filters. ATR Technical Report,
#200, http://www.hip.atr.co.jp/publications/Abstruct/Abstruct.200.html

Kawahara, H. (1997). Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited. Proc. ICASSP-97.

Kawahara, H. (1998). Wavelet transform, gabor transform and splines in a very high-quality speech transformation method, STRAIGHT, from IJCAI-CASA
workshop on wavelets. http://www.sys.wakayama-u.ac.jp/~kawahara/wavelet/straightwavelet.ps.gz

Kent, R.D., Dembowski, J. and Lass, N.J. (1996). The acoustic characteristics of American English. In Lass (1996), Chapter 5, pp. 185-225.

Kewley-Port, D. (1983). Time-varying features as correlates of place of articulation in stop consonants. J. Acoust. Soc. Am., 73, 322-335.

Kewley-Port, D., & Zheng, Y. (1998). Auditory models of formant frequency discrimination for isolated vowels. J. Acoust. Soc. Am. 103(3), 1654-1666.

Klatt, D.H. (1980). Software for cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 67, 971-995.

Klatt, D. H. (1989). Review of selected models of speech perception. In W. Marslen-Wilson (Ed.) Lexical representation and process, pp.169-226. Cambridge, MA : MIT Press.

Klatt, D.H. and Klatt, L.C. (1990) Analysis, synthesis, and perception of voice quality variations among female and male talkers. J. Acoust. Soc. Am. 87: 820-857.

Krause JC, Braida LD. (2002). Investigating alternative forms of clear speech: the effects of speaking rate and speaking mode on intelligibility. J Acoust Soc Am 2002 Nov;112(5 Pt 1):2165-72.

Kuhl, P.K. (1993). Innate predispositions and the effects of experience in speech perception: The native language magnet theory. In Developmental Neurocognition: Speech and face processing in the first year of life, Edited by de Boyssson-Bardies, B., de Schoen, S., Jusczyk, P., MacNeilage, P., & Morton, J. Dordrecht: Kluwer (pp. 259-274).

Lee, S., Potamianos, A. and Narayanan, S. (1999). Acoustics of children's speech: Developmental changes of temporal and spectral parameters. J. Acoust. Soc. Am.  105(3): 1455-1468.

Leek, M.R., Dorman, M.F., & Summerfield, Q. (1987). Minimum spectral contrast for vowel identification by normalhearing and hearing-impaired listeners. J. Acoust. Soc. Am. 81, 148-154.

Liberman, A.M. and Mattingly, I.G. (1989). A specialization for speech perception. Science, 243, 489-494.

Liberman, A.M.., & Mattingly, I.G. (1985). The motor theory of speech perception revised. Cognition, 21, 1-36.

Lippmann, R.P. (1996). Speech perception by humans and machines. In: Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Lively, SE & Pisoni, DB (1997). "On prototypes and phonetic categories: A critical assessment of the perceptual magnet effect in speech
perception". Journal of Experimental Psychology: Human Perception and Performance 23 (6), 1665 - 1679.

P. Loizou and O. Poroy (2001). "Minimum spectral contrast needed for vowel identification by normal-hearing and cochlear implant listeners," J. Acoust. Soc. Am., 110(3), 1619-1627.

Lotto, AJ, Kluender, KR & Holt, LL (1998). "Depolarizing the perceptual magnet effect". J. Acoust. Soc. Am. 103 (6), 3648 - 3655.

Lunner T, Hellgren J, Arlinger S, Elberling C. (1998). Non-linear signal processing in digital hearing aids. Scand Audiol Suppl. 49: 40-49.

Mackersie C.L. (2002). The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss. Proceedings of the 7th International Conference on Spoken Language Processing, Denver CO, Sept 16-20, 2002, pp. 429-432.

Meddis, R. (1988). Simulation of auditory-neural transduction: Further studies. J. Acoust. Soc. Am., 83, 1056-1063.

McCandliss, B.D., Fiez, J.A., Protopapas, A., Conway, M., & McClelland, J.L. (2002). Success and failure in teaching the [r]-[l] contrast to Japanese adults: Tests of a Hebbian model of plasticity and stabilization in spoken language perception. Cognitive, Affective, & Behavioral Neuroscience, 2(2), 89-108.

Miller, J.D. (1989). Auditory-perceptual interpretation of the vowel. J. Acoust. Soc. Am. 85: 2114-2134.

Monsen, R.B. (1976). Normal and reduced phonological space: The production of English vowels by deaf adolescents. Journal of Phonetics, 4, 189-198.

Moore, B. C. J., & Glasberg, B. R. (1987). Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns. Hearing Research, 28, 209-225.

Moore, JK (2002). Maturation of human auditory cortex: Implications for speech perception. Ann. Otol. Rhinol. Laryngol. 111: 7-10.

Munro K, Lutman M. (2003). The effect of speech presentation level on measurement of auditory acclimatization to amplified speech. J. Acoust. Soc. Am. 114(1): 484-495.

Nabelek, A. (1988). Identification of vowels in quiet, noise, and reverberation: Relationships with age and hearing loss. J. Acoust. Soc. Am. 84, 476-484.

Nabelek, A.K., Czyzewski, Z, & Krishnan, L.A. (1992). The influence of talker differences on vowel identification by normal-hearing and hearing-impaired listeners. J. Acoust. Soc. Am. 1228-1246.

Nearey, T.M. (1989). Static, dynamic, and relational properties in vowel perception. J. Acoust. Soc. Am. 85: 2088-2113.

Nearey, T. and Assmann, P. (1986). Modeling the role of inherent spectral change in vowel identification. J. Acoust. Soc. Am.  80, 1297-1308.

Nelson PB, Jin SH (2004). Factors affecting speech understanding in gated interference: Cochlear implant users and normal-hearing listeners. J Acoust Soc Am 2003 115(5): 2286-2294.

Osberger, M.J. (1987). Training effects on vowel production by two profoundly hearing-impaired speakers. J. Sp. Hear. Res., 30, 241-251.

Patterson, R. D., Allerhand, M. H., & Giguere, C. (1995). Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform. J. Acoust. Soc. Am. 98(4), 1890-1894.

Pena M, Bonatti LL, Nespor M, Mehler J (2002). Signal-driven computations in speech processing Science 298 (5593): 604-607 OCT 18 2002.

Perry, T.L., Ohde, R.N., Ashmead, D.H. (2001). The acoustic bases for gender identification from children s voices. J. Acoust. Soc. Am. 109: 2988-2998.

Picheney, M.A., Durlach, N.I. and Braida, L.D. (1986). "Speaking clearly for the heard of Hearing II: Acoustic characteristics of clear and conversational speech", J. Speech Hear. Res. 29, 434-446.

Qin MK, Oxenham AJ. Effects of simulated cochlear-implant processing on speech reception in fluctuating maskers. J Acoust Soc Am. 2003 Jul;114:446-454.

Shannon, R. V., Zeng, F-G., and Wygonski, J. (1998). Speech recognition with altered spectral distribution of envelope cues. J. Acoust. Soc. Am.  103, 2467-2476.

Sheffert SM, Pisoni DB, Fellowes JM, Remez RE. (2002). Learning to recognize talkers from natural, sinewave, and reversed speech samples. J Exp Psychol Hum Percept Perform. 28(6): 1447-1469.

Smits, R., ten Bosch, L., and Collier, R. ( 1996a) Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. I. Perception experiment. J. Acoust. Soc. Am.  100: 3852-3864

Smits, R., ten Bosch, L., and Collier, R. ( 1996b) Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. II. Modeling and evaluation. J. Acoust. Soc. Am.  100: 3865-3881

Stickney, G. and Assmann, P.F. (2001). Acoustic and linguistic factors in the perception of bandpass-filtered speech. J. Acoust. Soc. Am. 109(3): 1157-1165.

Strange, W. (1989). Evolving theories of vowel perception. J. Acoust. Soc. Am. 85: 2081-2087.

Strange, W., and Bohn, O. (1998): Dynamic specification of coarticulated German vowels: Perceptual and acoustical studies. J. Acoust. Soc. Am. 104 (1): 488-504.

Summerfield, Q. (1991). Visual perception of phonetic gestures. In: Modularity and the Motor Theory of Speech Perception. Edited by Mattingly, I. and Studdert-Kennedy, M., Ch. 6, pp. 117-138. L. Erlbaum & Associates: Hillsdale, N.J.

Summerfield, Q. and Assmann, P.F. (1989). Auditory enhancement and the perception of concurrent vowels. Perception and Psychophysics 45, 529-536.

Summerfield, Q., Culling, J.F. and Assmann, P.F. (1996). The perception of speech under adverse conditions: Contributions of spectro-temporal peaks, periodicity, and inter-aural timing to perceptual robustness. In: Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Summers, W., & Leek, M. (1992). The role of spectral and temporal cues in vowel identification by listeners with impaired hearing. Journal of Speech and Hearing Research, 35, 1189-1199.

Sussman HM, McCaffrey, HA, Matthews SA (1991). An investigation of locus equations as a source of relational invariance for stop place categorization. J Acoust Soc Am. 1991 90(3):1309-1325.

Sussman HM, Fruchter D, Hilbert J, Sirosh J. (1998). Linear correlates in the speech signal: the orderly output constraint. Behav Brain Sci 21(2): 241-259.

Van Tasell, D., Fabry, D., & Thibodeau, L. (1987). Vowel identification and vowel masking patterns of hearing-impaired subjects. J. Acoust. Soc. Am. 81, 1586-1597.

Werker, J.F. and Tees, R.C. (1992). The organization and reorganization of human speech perception. Annual Review of Neuroscience 15: 377-402.

Werker, J. (1989) Becoming a native listener. American Scientist 77, 54-59.

Werker, J.F. & Polka, L.(1993). Developmental changes in speech perception: new challenges and new directions. Journal of Phonetics, 21, 83-101.
 
 Aslin RN, Werker JF, Morgan JL. (2002). Innate phonetic boundaries revisited. J Acoust Soc Am. 2002 Oct;112(4):1257-60.

Assmann, P.F. and Summerfield, A.Q. (to appear, 2003). The perception of speech under adverse conditions. In S. Greenberg, W.A. Ainsworth, A.N. Popper and R.R. Fay (Eds.) Speech Processing in  the Auditory System. Volume 14, Springer Handbook of Auditory Research.

Assmann, P.F. (1999). Fundamental frequency and the intelligibility of competing voices. Proceedings of the 14th International Congress of Phonetic Sciences, San Francisco, Aug. 1-7, 1999, pp. 179-182.

Assmann, P.F. and Katz, W.F. (2000). Time-varying spectral change in the vowels of children and adults. J. Acoust. Soc. Am. 108(4): 1856-1866.

Assmann, P.F. and Summerfield, A.Q. (1990). Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies,  JJ. Acoust. Soc. Am. 88: 680-697.

Assmann, P.F. and Nearey, T.M. (1987). Perception of front vowels: The role of harmonics in the first formant region. J. Acoust. Soc. Am. 81, 520-534.

Assmann, P.F. and Summerfield, A. Q. (1990). Modeling the perception of concurrent vowels: Vowels with different fundamental frequencies. J. Acoust. Soc. Am. 88: 680-697.

Binder J. (2000). Comment on: The new neuroanatomy of speech perception. Brain 123(12): 2371-2372.

Bradlow, A.R., Bent, T. (2002). The clear speech effect for non-native listeners. J. Acoust. Soc. Am. 112(1): 272-284.

Bronkhorst AW, Plomp R. (1992). Effect of multiple speechlike maskers on binaural speech recognition in normal and impaired hearing. J. Acoust. Soc. Am. 92(6): 3132-3139.

Chambers KE, Onishi KH, Fisher C. (2003). Infants learn phonotactic regularities from brief auditory experience. Cognition. 2003 Mar;87(2):B69-77.

Childers, D. and Lee, C. (1991). Vocal quality factors: Analysis, synthesis, and perception. J. Acoust. Soc. Am. 90, 2394-2410.

Darwin, C.J. and Carlyon, R.P. (1995). Auditory Grouping. In The Handbook of Perception and Cognition, Volume 6, Hearing, edited by B.C.J. Moore (Academic, London).

Darwin, C.J. (1990). Environmental influences on speech perception. In Advances in Speech, Hearing and Language Processing, Volume 1, pp. 219-241.

Darwin, C.J. and Carlyon, R.P. (1995). Auditory Grouping. In The Handbook of Perception and Cognition, Volume 6, Hearing, edited by B.C.J. Moore (Academic, London).

de Boer, B. and Kuhl, P.K. (2003). Investigating the role of infant-directed speech with a computer model. Acoustics Research Letters Online 4(4): 112-134.

Delgutte, B. (1999). Auditory neural processing of speech. In The Handbook of Phonetic Sciences, edited by WJ Hardcastle and J Laver, Blackwell: Oxford.

Delgutte, B., Hammond, B.M., Kalluri, S., Litvak, L.M., and Cariani, P. (1996). Neural encoding of temporal envelope and temporal interactions in speech. In Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Dorman, M.F., Loizou, P.C. and Rainey, D. (1997). Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs. J. Acoust. Soc. Am.  102, 2403-2410.

Doupe A.J., Kuhl P.K. (1999). Birdsong and human speech: common themes and mechanisms. Annual Rev. Neurosci. 22: 567-631.

Driver, J. (1996). Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading. Nature 381: 66-67.

Fenn KM, Nusbaum HC, Margoliash D (2003). Consolidation during sleep of perceptual learning of spoken language. Nature 425: 614-615.

Festen, J.M. & Plomp, R. (1990). Effects of fluctuating noise and interfering speech on the speech-reception threshold for impaired and normal hearing. J Acoust Soc Am 88(4): 1725-1736.

Fishman, K., Shannon, R.V. and Slattery, W. (1997). Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor. J. Speech Hearing Res. 40, 1201-1215.

Flege, JE, MacKay, I.R.A., Meador, D. (1999). Native Italian speakers' perception and production of English vowels. J. Acoust. Soc. Am. 106 (5): 2973-2987.

Flege, JE, Schirru, C. and MacKay, I.R.A. (2003). Interaction between the native and second language phonetic subsystems Speech Communication 40: 467-491.

Fowler C., Brown J., and Mann V. (2000). Contrast effects do not underlie effects of preceding liquids on stop-consonant identification by humans. Journal of Experimental Psychology: Human Perception and Performance. 26(3):877-888.

Fourakis, M., Geers, A. and Tobey, E. (1993). An acoustic metric for assessing the change in vowel production by profoundly hearing-impaired children. J. Acoust. Soc. Am. 94, 2544-2552.

Fruchter, D. and Sussman, H. (1997). The perecptual relevance of locus equations. J. Acoust. Soc. Am.  102, 2997-3008.

Fu Q.J., Shannon R.V., Wang X. (1998). Effects of noise and spectral resolution on vowel and consonant recognition: acoustic and electric hearing. J. Acoust. Soc. Am.  104: 3586-3596.

Grant KW, Walden BE, Seitz PF (1998). Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. J Acoust Soc Am 103(5): 2677-2690.

Glasberg, B.R. & Moore, B.C. (1989). Psychoacoustic abilities of subjects with unilateral and bilateral cochlear hearing impairments and their relationship to the ability to understand speech. Scandinavian Audiology. Supplementum. 32:1-25, 1989.

Glasberg, B. R., & Moore, B. C. J. (1990). Derivation of auditory filter shapes from notched-noise data. Hearing Research, 47, 103-138.

Goldstein, MF, King, AP and West, MJ (2003). Social interaction shapes babbling: Testing parallels between birdsong and speech. PNAS 100(13): 8030 8035.

Gordon-Salant, S.; Fitzgibbons, P.J. (1997). Selected cognitive factors and speech recognition performance among young and elderly listeners. Journal of Speech, Language, and Hearing Research 40(2): 423-431.

Greenberg, S. (1996). Auditory processing of speech. In Lass (1996).

Greenberg, S. (1996). Understanding speech understanding: Towards a unified theory of speech perception. In: Proceedings of the ESCA Workshop on the Auditory Basis of Speech Perception, Eds. W. Ainsworth and S. Greenberg.

Iverson P, Kuhl PK, Akahane-Yamadac R, Diesch E, Tohkura Y, Kettermann A, Siebert C. (2003). A perceptual interference account of acquisition difficulties for non-native phonemes. Cognition 87: B47-B57.

Grant K.W., Walden B.E., Seitz P.F. (1998). Auditory-visual speech recognition by hearing-impaired subjects: consonant recognition, sentence recognition, and auditory-visual integration. J. Acoust. Soc. Am.  103: 2677-2690.

Greenberg, S. (1996). Understanding speech understanding: Towards a unified theory of speech perception.  In: W. Ainsworth and S. Greenberg (eds.) Proceedings of the ESCA Workshop on the Auditory Basis of Speech Perception.

Hillenbrand, J. and Nearey, T. (1999). Identification of resynthesized /hVd/ utterances: Effects of formant contour. J. Acoust. Soc. Am., 105, 3509-3520.

Hillenbrand JM, Clark MJ, Nearey TM. (2001). Effects of consonant environment on vowel formant patterns. J Acoust Soc Am. 109(2): 748-763.

Ho, A.K., Iansek, R., and Bradshaw J.L. (1999). Regulation of parkinsonian speech volume: the effect of interlocuter distance. J Neurol Neurosurg Psychiatry 67: 199-202.

Jenkins J.J., Strange W., Trent S.A. (1999). Context-independent dynamic information for the perception of coarticulated vowels. J. Acoust. Soc. Am. 106: 438-448.

Jenkins, J.J., Strange, W., & Miranda, S. (1994). Vowel identification in mixed-speaker silent-center syllables. J. Acoust. Soc. Am. 95: 1030-1043.

Keith Johnson, Elizabeth A. Strand and Mariapaola D:Imperio (1999). Auditory-visual integration of talker gender in vowel perception. Journal of Phonetics 27, 359-384

Johnson, K (1990). The role of perceived speaker identity in F0 normalization of vowels. Journal of the Acoustical Society of America, 88, 642-654.

Johnson K (1997). Speech perception without speaker normalization: An exemplar model. In: Talker variability in speech processing, edited by K. Johnson and J. Mullenix. Academic Press.

Johnson EK, Jusczyk PW, Cutler A, Norris D. (2003). Lexical viability constraints on speech segmentation by infants. Cognitive Psychology 46(1): 65-97.

Jusczyk PW, Luce PA (2002). Speech Perception and Spoken Word Recognition: Past and Present. Ear and Hearing 23(1): 2-40.

Karsten, S.A. and Turner, C.W. (2000). Binaural speech recognition and the Stenger effect. Journal of Speech, Hearing and Language Research 43: 926-933.

Kataoka R, Warren DW, Zajac DJ, Mayo R, Lutz RW. (2001). The relationship between spectral characteristics and perceived hypernasality in children. J Acoust Soc Am. 109: 2181-2189.

Katz, W.F. and Assmann, P.F. (2001). Identification of children's and adults' vowels: Intrinsic fundamental frequency, fundamental frequency dynamics, and presence of voicing. Journal of Phonetics 29, 23-51.

Katz, W.F. and Bharadwaj, S. (2001). Coarticulation in fricative-vowel syllables produced by children and adults: a preliminary report. Clinical Linguistics and Phonetics 15: 139-143.

Kawahara, H. (1996). Speech Transformation using Adaptive Interpolation of Time-frequency Representation and All-Pass Filters. ATR Technical Report,
#200, http://www.hip.atr.co.jp/publications/Abstruct/Abstruct.200.html

Kawahara, H. (1997). Speech representation and transformation using adaptive interpolation of weighted spectrum: vocoder revisited. Proc. ICASSP-97.

Kawahara, H. (1998). Wavelet transform, gabor transform and splines in a very high-quality speech transformation method, STRAIGHT, from IJCAI-CASA
workshop on wavelets. http://www.sys.wakayama-u.ac.jp/~kawahara/wavelet/straightwavelet.ps.gz

Kent, R.D., Dembowski, J. and Lass, N.J. (1996). The acoustic characteristics of American English. In  N.J. Lass (ed.) Principles of Experimental Phonetics, St. Louis: Mosby. Ch. 5, pp. 185-225.

Kent, R.D. and Read, C. (1992). The acoustic analysis of speech. Chapters 5 & 6. Singular: San Diego.

Kewley-Port, D. (1983). Time-varying features as correlates of place of articulation in stop consonants. J. Acoust. Soc. Am., 73, 322-335.

Kewley-Port, D., & Zheng, Y. (1998). Auditory models of formant frequency discrimination for isolated vowels. J. Acoust. Soc. Am. 103(3), 1654-1666.

Klatt, D.H. (1980). Software for cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 67, 971-995.

Klatt, D. H. (1989). Review of selected models of speech perception. In W. Marslen-Wilson (Ed.) Lexical representation and process, pp.169-226. Cambridge, MA : MIT Press.

Klatt, D.H. and Klatt, L.C. (1990) Analysis, synthesis, and perception of voice quality variations among female and male talkers. J. Acoust. Soc. Am. 87: 820-857.

Kluender, K.R. and Lotto, A.J. (1999). Virtues and perils of an empiricist approach to speech perception. J. Acoust. Soc. Am. 105: 503-511.

Kraus N, McGee TJ,Koch DB (1998). "Speech sound representation, perception and plasticity: A neurophysiologic perspective" Audiology and Neuro-Otology, 1998. 3:168-182.

Kraus N., McGee T.J., Koch D.B. (1998). Speech sound perception and learning: biologic bases. Scand. Audiol. Suppl. 49: 7-17.

Krause JC, Braida LD. (2002). Investigating alternative forms of clear speech: the effects of speaking rate and speaking mode on intelligibility. J Acoust Soc Am 2002 Nov;112(5 Pt 1):2165-72.

Koch, DB, McGee, TJ, Bradlow, AR, Kraus, N (1999). Acoustic-Phonetic Approach toward Understanding Neural Processes and Speech Perception. J Am Acad Audiol 10: 304 318.

Kuhl, P.K. (1993). Innate predispositions and the effects of experience in speech perception: The native language magnet theory. In Developmental Neurocognition: Speech and face processing in the first year of life, Edited by de Boyssson-Bardies, B., de Schoen, S., Jusczyk, P., MacNeilage, P., & Morton, J. Dordrecht: Kluwer (pp. 259-274).

Kuhl PK. (1991). Human adults and human infants show a "perceptual magnet effect" for the prototypes of speech categories, monkeys do not. Perception and Psychophysics 50(2): 93-107. .

Kuhl PK. (2000). A new view of language acquisition. PNAS 97(22): 11850-11857.

Kuhl PK. (2003). Human speech and birdsong: Communication and the social brain. PNAS 100(17): 9645-9646.

Kuhl P.K., Andruski J.E., Chistovich I.A., Chistovich L.A., Kozhevnikova E.V., Ryskina V.L., Stolyarova E.I., Sundberg U., Lacerda F. (1997). Cross-language analysis of phonetic units in language addressed to infants. Science 277: 684-686.

Ladefoged, P., & Broadbent, D. E. (1957). Information conveyed by vowels. Journal of Acoustical Society of America, 29, 98-104.

Liberman, A. M., & Whalen, D. H. (2000). On the relation of speech to language. Trends in Cognitive Science, 4, 187-196.

Lee, S., Potamianos, A. and Narayanan, S. (1999). Acoustics of children's speech: Developmental changes of temporal and spectral parameters. J. Acoust. Soc. Am.  105(3): 1455-1468.

Leek, M.R., Dorman, M.F., & Summerfield, Q. (1987). Minimum spectral contrast for vowel identification by normalhearing and hearing-impaired listeners. J. Acoust. Soc. Am. 81, 148-154.

Liberman, A.M. and Mattingly, I.G. (1989). A specialization for speech perception. Science, 243, 489-494.

Liberman, A.M.., & Mattingly, I.G. (1985). The motor theory of speech perception revised. Cognition, 21, 1-36.

Lippmann, R.P. (1996). Speech perception by humans and machines. In: Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Lippmann, R.P. (1997). Speech recognition by machines and humans, Speech Commun. 22 (1): 1-15.

Liberman, A.M., Cooper, F.S., Shankweiler, D.P., and Studdert-Kennedy, M. (1967). Perception of the speech code. Psychol. Rev. 74: 431-461.

Lively, SE & Pisoni, DB (1997). "On prototypes and phonetic categories: A critical assessment of the perceptual magnet effect in speech
perception". Journal of Experimental Psychology: Human Perception and Performance 23 (6), 1665 - 1679.

P. Loizou (1998). Mimicking the Human Ear: An overview of signal processing techniques used for cochlear implants. IEEE Signal Processing Magazine 15(5), 101-130.

Lotto, A.J., Kluender, K.R. and Holt, L.L. (1997). "Perceptual compensation for coarticulation by Japanese quail (Coturnix coturnix japonica)," J. Acoust. Soc. Am. 102, 1134-1140.

Lotto A.J., Kluender K.R., Holt L.L. (1998). Depolarizing the perceptual magnet effect. J. Acoust. Soc. Am. 103: 3648-3655.

Lotto A.J., Kluender K.R., Holt L.L. (1998). General contrast effects in speech perception: Effect of preceding liquid on stop consonant identification. Perception and Psychophysics60: 602-619.

Lunner T, Hellgren J, Arlinger S, Elberling C. (1998). Non-linear signal processing in digital hearing aids. Scand Audiol Suppl. 49: 40-49.

Mackersie C.L. (2002). The relationship between pure-tone sequential stream segregation and perceptual separation of male and female talkers by listeners with hearing loss. Proceedings of the 7th International Conference on Spoken Language Processing, Denver CO, Sept 16-20, 2002, pp. 429-432.

Magnuson, J.S., & Nusbaum, H.C. (1993). Talker differences and perceptual normalization. Journal of the Acoustical Society of America, 93, 2371.

MacKay, I.R.A., Flege, JE, Piske T, Schirru, C. (2001). Category restructuring during second-language speech acquisition J. Acoust. Soc. Am. 110(1): 516-528.

Marian V, Spivey M, Hirsch J. Shared and separate systems in bilingual language processing: Converging evidence from eyetracking and brain imaging Brain Lang. 86(1): 70-82. .

Massaro D.W., Cohen M.M. (1999). Speech perception in perceivers with hearing loss: synergy of multiple modalities. J. Speech Lang. Hear. Res. 42: 21-41.

Maye, J., Werker, J. F. & Gerken, L. A. (2002). Infant sensitivity to distributional information can affect phonetic discrimination. Cognition 82, 101-111.

Meddis, R. (1988). Simulation of auditory-neural transduction: Further studies. J. Acoust. Soc. Am., 83, 1056-1063.

McCandliss, B.D., Fiez, J.A., Protopapas, A., Conway, M., & McClelland, J.L. (2002). Success and failure in teaching the [r]-[l] contrast to Japanese adults: Tests of a Hebbian model of plasticity and stabilization in spoken language perception. Cognitive, Affective, & Behavioral Neuroscience, 2(2), 89-108.

Miller, J.D. (1989). Auditory-perceptual interpretation of the vowel. J. Acoust. Soc. Am. 85: 2114-2134.

Miller J.L., Liberman A.M .(1979). Some effects of later-occurring information on the perception of stop consonant and semivowel. Percept Psychophys 25: 457-465.

Monsen, R.B. (1976). Normal and reduced phonological space: The production of English vowels by deaf adolescents. Journal of Phonetics, 4, 189-198.

Moore, B.C.J. (1995). Perceptual consequences of cochlear damage. (Ch. 7: Speech perception by people with cochlear damage). Oxford Univ. Press: Oxford.

Moore, B.C.J. (1997). Aspects of auditory processing related to speech perception. In  W.J. Hardcastle and J. Laver (eds.). The Handbook of Phonetic Sciences. Blackwell: Oxford.

Moore, B.C.J. (2003). Speech processing for the hearing-impaired: successes, failures, and implications for speech mechanisms. Speech Communication 41: 81-91.

Moore, B. C. J., & Glasberg, B. R. (1987). Formulae describing frequency selectivity as a function of frequency and level, and their use in calculating excitation patterns. Hearing Research, 28, 209-225.

Moore, JK (2002). Maturation of human auditory cortex: Implications for speech perception. Ann. Otol. Rhinol. Laryngol. 111: 7-10.

Mullennix,J.W., Pisoni, D.B., & Martin , C.S. (1989). Some effects of talker variability on spoken word recognition. Journal of the Acoustical Society of America, 85, 365-378.

Nabelek, A. (1988). Identification of vowels in quiet, noise, and reverberation: Relationships with age and hearing loss. J. Acoust. Soc. Am. 84, 476-484.

Nabelek, A.K., Czyzewski, Z, & Krishnan, L.A. (1992). The influence of talker differences on vowel identification by normal-hearing and hearing-impaired listeners. J. Acoust. Soc. Am. 1228-1246.

Nearey, T.M. (1989). Static, dynamic, and relational properties in vowel perception. J. Acoust. Soc. Am. 85: 2088-2113. (Warning: large file: 3.62 MB)

Nearey, T.M. (1997). Speech perception as pattern recognition. J. Acoust. Soc. Am. 101: 3241-3254.

Nearey, T. and Assmann, P. (1986). Modeling the role of inherent spectral change in vowel identification. J. Acoust. Soc. Am.  80, 1297-1308.

Nelson PB, Jin SH, Carney AE, Nelson DA. (2003). Understanding speech in modulated interference: cochlear implant users and normal-hearing listeners.

Nittrouer S. (2001). Challenging the notion of innate phonetic boundaries. J Acoust Soc Am. 2001 Sep;110(3 Pt 1):1598-605.

Nittrouer S. (2002). Innate phonetic boundaries revisited: author reply. J Acoust Soc Am. 2002 Oct;112(4):1261-4.

Nooteboom, S. (1997). The prosody of speech: melody and rhythm. In  W.J. Hardcastle and J. Laver (eds.). The Handbook of Phonetic Sciences. Blackwell: Oxford.

Norris D, McQueen, J.M. & Cutler, A. (2002). Merging information in speech recognition: Feedback is never necessary. Behavioral and Brain Sciences 23, 299-370.

Osberger, M.J. (1987). Training effects on vowel production by two profoundly hearing-impaired speakers. J. Sp. Hear. Res., 30, 241-251.

Patterson, R. D., Allerhand, M. H., & Giguere, C. (1995). Time-domain modeling of peripheral auditory processing: A modular architecture and a software platform. J. Acoust. Soc. Am. 98(4), 1890-1894.

Picheney, M.A., Durlach, N.I. and Braida, L.D. (1986). "Speaking clearly for the heard of Hearing II: Acoustic characteristics of clear and conversational speech", J. Speech Hear. Res. 29, 434-446.

Robinson, K. (1998). Implications of developmental plasticity for the language acquisition of deaf children with cochlear implants. Int. J. Pediatr. Otorhinolaryngol. 46: 71-80.

R. J. Salvi , A. H. Lockwood, R. D. Frisina, M. L. Coad, D. S. Wack and D. R. Frisina (2002). PET imaging of the normal human auditory system: responses to speech in quiet and in background noise. Hear Res.   170 (1--2):96-106.

Scott SK, Blank CC, Rosen S, Wise RJ. (2000). Identification of a pathway for intelligible speech in the left temporal lobe. Brain. 2000 Dec;123 Pt 12:2400-6.

Scott, S.K. and Wise, R.J.S. (2003). Functional imaging and language: A critical guide to methodology and analysis Speech Communication, 41: 7-21.

Shannon R.V., Zeng F-G., Kamath V., Wygonski J., Ekelid M. (1995). Speech recognition with primarily temporal cues. Science 270: 303-304.

Shannon, R. V., Zeng, F-G., and Wygonski, J. (1998). Speech recognition with altered spectral distribution of envelope cues. J. Acoust. Soc. Am.  103, 2467-2476.

Sheffert SM, Pisoni DB, Fellowes JM, Remez RE. (2002). Learning to recognize talkers from natural, sinewave, and reversed speech samples. J Exp Psychol Hum Percept Perform. 28(6): 1447-1469.

Smits, R., ten Bosch, L., and Collier, R. ( 1996a) Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. I. Perception experiment. J. Acoust. Soc. Am.  100: 3852-3864

Smits, R., ten Bosch, L., and Collier, R. ( 1996b) Evaluation of various sets of acoustic cues for the perception of prevocalic stop consonants. II. Modeling and evaluation. J. Acoust. Soc. Am.  100: 3865-3881

Sommers, M.S., Nygaard, L.C., & Pisoni, D.B. (1994). Stimulus variability of spoken word recognition: 1. Effects of variability of speaking rate and overall amplitude. Journal of the Acoustical Society of America, 96, 1314-1324.

Steinschneider M, Fishman YI, and Arezzo JC (2003). Representation of the voice onset time (VOT) speech parameter in population responses within primary auditory cortex of the awake monkey J. Acoust. Soc. Am. 114: 307-321.

Stevens, K.N. (1997). Articulatory-Acoustic-Auditory Relationships. In  W.J. Hardcastle and J. Laver (eds.). The Handbook of Phonetic Sciences. Blackwell: Oxford.

Stevens, K.N. (2002). Toward a model for lexical access based on acoustic landmarks and distinctive features. J. Acoust. Soc. Am. 106: 438-448.

Stickney, G. and Assmann, P.F. (2001). Acoustic and linguistic factors in the perception of bandpass-filtered speech. J. Acoust. Soc. Am. 109(3): 1157-1165.

Strange, W. (1989). Evolving theories of vowel perception. J. Acoust. Soc. Am. 85: 2081-2087.

Strange, W., and Bohn, O. (1998): Dynamic specification of coarticulated German vowels: Perceptual and acoustical studies. J. Acoust. Soc. Am. 104 (1): 488-504.

Summerfield, Q. (1991). Visual perception of phonetic gestures. In: Modularity and the Motor Theory of Speech Perception. Edited by Mattingly, I. and Studdert-Kennedy, M., Ch. 6, pp. 117-138. L. Erlbaum & Associates: Hillsdale, N.J.

Summerfield, Q. and Assmann, P.F. (1989). Auditory enhancement and the perception of concurrent vowels. Perception and Psychophysics 45, 529-536.

Summerfield, Q., Culling, J.F. and Assmann, P.F. (1996). The perception of speech under adverse conditions: Contributions of spectro-temporal peaks, periodicity, and inter-aural timing to perceptual robustness. In: Proceedings of the Workshop on the Auditory Basis of Speech Perception, Eds W. Ainsworth and S. Greenberg.

Summers, W., & Leek, M. (1992). The role of spectral and temporal cues in vowel identification by listeners with impaired hearing. Journal of Speech and Hearing Research, 35, 1189-1199.

Sussman HM, Fruchter D, Hilbert J, Sirosh J. (1998). Linear correlates in the speech signal: the orderly output constraint. Behav Brain Sci 21(2): 241-259. Discussion, pp. 260-299.

Van Tasell, D., Fabry, D., & Thibodeau, L. (1987). Vowel identification and vowel masking patterns of hearing-impaired subjects. J. Acoust. Soc. Am. 81, 1586-1597.

Werker, J.F. and Tees, R.C. (1992). The organization and reorganization of human speech perception. Annual Review of Neuroscience 15: 377-402.

Werker, J. (1989) Becoming a native listener. American Scientist 77, 54-59.

Werker, J.F. & Polka, L.(1993). Developmental changes in speech perception: new challenges and new directions. Journal of Phonetics, 21, 83-101.

Wingfield, A., Poon, L. W., Lombardi, L., & Lowe, D. (1985). Speed of processing in normal aging: Effects of speech rate, linguistic structure, and processing time., Journal of Gerontology 40, 579-585.

 


Back to Speech Perception Lab page