Home | About me | Research | Publications | CV | Misc | CRSS | UT Dallas

Publications

Journal Articles
  • Seyedmahdad Mirsamadi and John H.L. Hansen, "Multi-domain adversarial training of neural network acoustic models for distant speech recognition", in Speech Communication, Vol. 106, pp. 21--30, Jan. 2019.. (pdf)

  • Seyedmahdad Mirsamadi and John H.L. Hansen, "A generalized nonnegative tensor factorization approach for distant speech recognition with distributed microphones", in IEEE/ACM Transactions on Audio, Speech, and Language Processing, Vol. 24, No. 10, Oct. 2016. (pdf)

  • Seyedmahdad Mirsamadi, Shabnam Ghaffarzadegan, Hamid Sheikhzadeh, Seyed Mohammad Ahadi, and Amir Hossein Rezaie, "Efficient frequency domain implementation of non-causal multichannel blind deconvoltion for convolutive mixtures of speech", in IEEE Transactions on Audio, Speech and Language Processing, Vol. 20, No. 8, Oct. 2012. (pdf)

  • Conference Papers
  • Seyedmahdad Mirsamadi and John H.L. Hansen, "On multi-domain training and adaptation of end-to-end RNN acoustic models for distant speech recognition", Interspeech 2017. (pdf)

  • Seyedmahdad Mirsamadi, Emad Barsoum and Cha Zhang "Automatic speech emotion recognition using recurrent neural networks with local attention", ICASSP 2017. (pdf)

  • Seyedmahdad Mirsamadi and Ivan Tashev, "Causal speech enhancement combining data-driven learning and suppression rule estimation", Interspeech 2016. (pdf)

  • Seyedmahdad Mirsamadi and John H.L. Hansen, "A study on deep neural network acoustic model adaptation for robust far-field speech recognition", in Proc. Interspeech 2015, Dresden, Germany, Sep. 6-10, 2015. (pdf)

  • Seyedmahdad Mirsamadi and John H.L. Hansen, "Multichannel feature enhancement in distributed microphone arrays for robust distant speech recognition in smart rooms", in IEEE Spoken Language Technology (SLT) workshop, 2014. (pdf)

  • Seyedmahdad Mirsamadi and John H.L. Hansen, "Multichannel speech dereverberation based on convolutive nonnegative tensor factorization for ASR applications", in Proc. Interspeech 2014, Max Atria, Singapore, Sep. 14-18, 2014, pp. 2828--2832. (pdf)