Research Interests
1. Information Retrieval System
- Multimedia/Music Information Retrieval, Spoken Document Retrieval
- Keyword Spotting/Detection, Data Mining, Dialog System
2. Human Behavior Signal Processing
- In-Vehicle Human Interface, Robust Speech Recognition for In-Vehicle System
- Signal Processing/Estimation Under Cognitive Stress/Distraction
3. Speech Processing/Recognition
- Large Vocabulary Continuous Speech Recognition, ASR for Wireless/VoIP
- Environmental Robust Processing for ASR, Missing-Feature Theory
4. Statistical Signal Processing, Pattern Recognition, Neural Network
|
Recent and Current Works
1. SpeechFind system (SpeechFind.utdallas.edu)
- Spoken document retrieval system of National Gallery of the Spoken Word
- System integration and programming
- Algorithm development for environmental robustness
2. Feature compensation method based on Gaussian mixture model
- Aiming at robust speech recognition in additive background noise
- Adaptive estimation of parameters using PCMM(Parallel Combined Mixture Model) scheme
- Computational reduction via multiple-model interpolation and mixture-sharing
- Discriminability improvement applying MCE(Minimum Classification Error) training
3. Mask estimation for missing-feature algorithm
- Aiming at robust speech recognition in unknown noisy environments
- Mask estimation based on Bayesian classifier
- Environment-independent performance via model training using the colored-noise
|
Education and Experiences
1. Experiences
Research Assistant Professor, EE, University of Texas at Dallas, Sept.2007-Present
Research Associate, EE, University of Texas at Dallas, Sept.2005-Aug.2007
-Supervised by Prof. John H. L. Hansen
Post-doctoral Fellow, ECE, Carnegie Mellon University, Sept.2004-Aug.2005
Post-doctoral Fellow, EE, Korea University, Sept.2003-Aug.2004
2. Education
Ph.D., Electronics Engineering, Korea University, Seoul, Korea, Aug. 2003
-Thesis: Model-based Feature Compensation for Robust Speech Recognition in Adverse Environments
-Advised by Prof. Hanseok Ko
M.S., Electronics Engineering, Korea University, Seoul, Korea, Aug. 1998
-Thesis: Spectral Subtraction based on Phonemic and Auditory Properties
-Advised by Prof. Hanseok Ko
B.S., Electronics Engineering, Korea University, Seoul, Korea, Feb. 1996
|
Publications
1. Journal Papers
[1] Wooil Kim and J.H.L. Hansen, "Missing-Feature Reconstruction by Leveraging Temporal Spectral Correlation for Robust Speech Recognition in Background Noise Conditions," IEEE Transactions on Audio, Speech, and Language Processing, 2010.
[2] Wooil Kim and J.H.L. Hansen, "Phonetic Distance Based Confidence Measure," IEEE Signal Processing Letters, vol.17, no.2, pp.117-120, Feb. 2010.
[3] Wooil Kim and J.H.L. Hansen, "Time-Frequency Correlation Based Missing-Feature Reconstruction for Robust Speech Recognition in Band-Restricted Conditions," IEEE Transactions on Audio, Speech, and Language Processing, vol.17, no.7, pp.1292-1304, Sept. 2009.
[4] Wooil Kim and J.H.L. Hansen, "Feature Compensation in the Cepstral Domain Employing Model Combination," Speech Communication, vol.51, no.2, pp.83-96, Feb. 2009.
[5] Wooil Kim and J.H.L. Hansen, "Feature Compensation Employing Multiple Environmental Models for Robust In-Vehicle Speech Recognition," IEICE Trans. Information and Systems, Vol. E91-D, No. 3, pp. 430-438, March 2008.
[6] Wooil Kim and H. Ko, "Noise Variance Estimation for Kalman Filtering of Noisy Speech," IEICE Trans. Information and Systems, Vol. E84-D, No. 1, pp. 155-160, Jan, 2001.
[7] Wooil Kim, S. Kang, and H. Ko, "Spectral subtraction based on phonetic dependency and masking effects," IEE Proc.-Vision Image and Signal Processing, Vol. 147, No. 5 , pp.423-427, Oct. 2000.
2. Conference Papers (Peer-Reviewed)
[1] Wooil Kim and J. H. L. Hansen, "Angry Emotion Detection from Real-Life Conversational Speech by Leveraging Content Structure," ICASSP-2010, Dallas, U.S.A., March 2010.
[2] Wooil Kim and J. H. L. Hansen, "Mask Estimation Employing Posterior-Based Representative Mean for Missing-Feature Speech Recognition with Time-Varying Background Noise," IEEE ASRU-2009, pp. 194-198, Merano, Italy, Dec. 2009.
[3] Wooil Kim and J. H. L. Hansen, "Variational Model Composition for Robust Speech Recognition with Time-Varying Background Noise," Interspeech-2009, pp. 2399-2402, Brighton, UK, Sept. 2009.
[4] Wooil Kim and J. H. L. Hansen, "Robust Angry Speech Detection Employing TEO-Based Discriminative Classifier Combination," Interspeech-2009, pp. 2019-2022, Brighton, UK, Sept. 2009.
[5] Wooil Kim and J. H. L. Hansen, "Missing-Feature Method for Speaker Recognition in Band-Restricted Conditions," Interspeech-2008, pp.1909-1912, Brisbane, Australia, Sept. 2008.
[6] J.H.L. Hansen, Wooil Kim, and P. Angkititrakul, "Advances in Human-Machine Systems for In-Vehicle Environments," IEEE HSCMA-2008: Hands-free Speech Communication and Microphone Arrays, pp. 128-131, Trento, Italy, May 2008.
[7] Wooil Kim and J. H. L. Hansen, "Advances in Spoken Document Retrieval for the U. S. Collaborative Digitization Program," IEEE ASRU-2007, pp.687-692, Kyoto, Japan, Dec. 2007.
[8] Wooil Kim and J. H. L. Hansen, "Advances in SpeechFind: Transcript Reliability Estimation Employing Confidence Measure based on Discriminative Sub-word Model for SDR," Interspeech-2007, pp.2409-2412, Antwerp, Belgium, Aug. 2007.
[9] Wooil Kim, M. Akbacak and J. H. L. Hansen, "Advances in SpeechFind: CRSS-UTD Spoken Document Retrieval System," ACM SIGIR 2007 Workshop, Amsterdam, Netherlands, July 2007.
[10] Wooil Kim and J. H. L. Hansen, "Missing-Feature Reconstruction for Band-Limited Speech Recognition in Spoken Document Retrieval," Interspeech-2006, pp.2306-2309, Pittsburgh, U.S.A., Sep. 2006.
[11] Wooil Kim and R. M. Stern, "Band-Independent Mask Estimation for Missing-Feature Reconstruction in the Presence of Unknown Background Noise," ICASSP-2006, pp.305-308, Toulouse, France, May 2006.
[12] Wooil Kim, R. M. Stern and H. Ko, "Environment-Independent Mask Estimation for Missing-Feature Reconstruction," Interspeech-2005, pp.2637-2640, Lisbon, Portugal, Sep. 2005.
[13] Wooil Kim, O. Kwon and H. Ko, "PCMM-based Feature Compensation Schemes Using Model Interpolation and Mixture Sharing," ICASSP-2004, pp.989-992, Montreal, Canada, May 2004.
[14] Wooil Kim, S. Ahn and H. Ko, "Feature Compensation Scheme Based on Parallel Combined Mixture Model," Eurospeech-2003, pp. 677-680, Geneva, Switherland, Sep. 2003.
[15] Wooil Kim and H. Ko, "Improved Acoustic Modeling Based on Selective Data-Driven PMC," ICASSP-2002, Student Forum, Orlando, U.S.A., May 2002.
[16] Wooil Kim, T. Kim, S. Ahn and H. Ko, "Model Based Stress Decision Method," Eurospeech-2001, Vol. 1, pp.107-110, Alborg, Denmark, Sep. 2001.
3. Book Chapter
[1] Wooil Kim and J. H. L. Hansen, "SpeechFind: Advances in Rich Content Based Spoken Document Retrieval," Chapter 17 of Handbook of Research on Digital Libraries: Design, Development, and Impact, pp.173-187, IGI Global, 2009.
[2] Wooil Kim and J. H. L. Hansen, "Feature Compensation Employing Model Combination for Robust Speech Recognition in In-Vehicle Environment," Chapter 19 of In-Vehicle Corpus and Signal Processing for Driver Behavior, pp.233-243, Springer, 2008.
4. Etc.
|