Microphone array speech processing

Overview

We are working on microphone array processing and distant speech recognition, aiming to create hands-free, voice-enabled interfaces for home automation control. The user will be able to control appliances and perform actions without having to move from his/her place, by using their voice. For this purpose, microphone array processing is employed, with microphones placed on walls and ceiling. Our research is focused on acoustic speaker localization, voice activity detection, acoustic event detection, speech enhancement/beamforming, activation keyword spotting and distant speech recognition. We have also collected a distant speech database in Greek that is publicly available: ATHENA database

People

Publications

Software

Some of our tools are publicly available via GitHub:

  • Multi-channel speech enhancementPlease cite:Z. I. Skordilis, A. Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino,
    Multichannel Speech Enhancement Using MEMS Microphones,
    Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing 2015, Brisbane, Australia, (ICASSP-2015).andS. Lefkimmiatis and P. Maragos,
    A Generalized Estimation for Linear and and Nonlinear Microphone Array Post-Filters Speech Communication, vol.49, pp.657-666, 2007.
  • Sweet Home Listen: A distant speech recognition system for home automation controlPlease cite:A. Katsamanis, I. Rodomagoulakis, G. Potamianos, P. Maragos and A. Tsiami,
    Robust Far-Field Spoken Command Recognition for Home Automation Combining Adaptation and Multichannel Processing ,
    Proc. Int’l. Conf. on Acoustics, Speech and Signal Processing (ICASSP-2014), Florence, Italy, May 2014.

Data

We have collected a real distant speech corpus in Greek that is publicly available.
The description and reference for the database is:

  • A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos,
    ATHENA: A Greek Multi-Sensory Database for Home Automation Control ,
    Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.
    For more information visit: ATHENA database
2018-09-21T07:43:55+00:00