We are working on microphone array processing and distant speech recognition, aiming to create hands-free, voice-enabled interfaces for home automation control. The user will be able to control appliances and perform actions without having to move from his/her place, by using their voice. For this purpose, microphone array processing is employed, with microphones placed on walls and ceiling. Our research is focused on acoustic speaker localization, voice activity detection, acoustic event detection, speech enhancement/beamforming, activation keyword spotting and distant speech recognition. We have also collected a distant speech database in Greek that is publicly available: ATHENA database
Z. I. Skordilis, A Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino, Multichannel Speech Enhancement Using MEMS Microphones,
Proc. IEEE Int’l Conf. on Acoustics, Speech, and Signal Processing (ICASSP-2015), Brisbane, Australia, Apr. 2015.
A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos, ATHENA: A Greek Multi-Sensory Database for Home Automation Control,
Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.
P. Giannoulis, A. Tsiami, I. Rodomagoulakis, A. Katsamanis, G. Potamianos, P. Maragos, The ATHENA-RC system for speech activity detection and speaker localization in the DIRHA smart home, Proc. 4th Joint Workshop on Hands-free Speech Communication and Microphone Arrays (HSCMA-2014).
I. Rodomagoulakis, G. Potamianos and P. Maragos, Advances In Large Vocabulary Continuous Speech Recognition In Greek: Modeling And Nonlinear Features,
Proc. 21st European Signal Processing Conference (EUSIPCO-2013), Marrakech, Morocco, Sep. 2013.
Some of our tools are publicly available via GitHub:
Multi-channel speech enhancementPlease cite:Z. I. Skordilis, A. Tsiami, P. Maragos, G. Potamianos, L. Spelgatti and R. Sannino, Multichannel Speech Enhancement Using MEMS Microphones,
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing 2015, Brisbane, Australia, (ICASSP-2015).andS. Lefkimmiatis and P. Maragos, A Generalized Estimation for Linear and and Nonlinear Microphone Array Post-Filters Speech Communication, vol.49, pp.657-666, 2007.
Sweet Home Listen: A distant speech recognition system for home automation controlPlease cite:A. Katsamanis, I. Rodomagoulakis, G. Potamianos, P. Maragos and A. Tsiami, Robust Far-Field Spoken Command Recognition for Home Automation Combining Adaptation and Multichannel Processing ,
Proc. Int’l. Conf. on Acoustics, Speech and Signal Processing (ICASSP-2014), Florence, Italy, May 2014.
Data
We have collected a real distant speech corpus in Greek that is publicly available.
The description and reference for the database is:
A. Tsiami, I. Rodomagoulakis, P. Giannoulis, A. Katsamanis, G. Potamianos and P. Maragos, ATHENA: A Greek Multi-Sensory Database for Home Automation Control ,
Proc. 15th Annual Conf. of International Speech Communication Association (INTERSPEECH-2014), Singapore, Sep. 2014.
For more information visit: ATHENA database