Publications

Publications 2020-07-29T17:30:12+00:00

Show all

2014

Pavlakos, Georgios; Theodorakis, Stavros; Pitsikalis, Vassilis; Katsamanis, Athanasios; Maragos, Petros

Kinect-based multimodal gesture recognition using a two-pass fusion scheme Conference

2014 IEEE International Conference on Image Processing, ICIP 2014, 2014, ISBN: 9781479957514.

Abstract | BibTeX | Tags: HMMs, Multimodal fusion, multimodal gesture recognition, speech recognition | Links: pdf link imagedoi link image

2013

Rodomagoulakis, I; Giannoulis, P; Skordilis, Z I; Maragos, P; Potamianos, G

Experiments on far-field multichannel speech processing in smart homes Conference

2013 18th International Conference on Digital Signal Processing, DSP 2013, 2013, ISBN: 9781467358057.

BibTeX | Tags: Array processing, Microphone arrays, Smart homes, Speech enhancement, speech recognition, Voice activity detection | Links: pdf link imagedoi link image

2011

Dimitriadis, Dimitrios; Maragos, Petros; Potamianos, Alexandros

On the effects of filterbank design and energy computation on robust speech recognition Journal Article

IEEE Transactions on Audio, Speech and Language Processing, 19 (6), pp. 1504–1516, 2011, ISSN: 15587916.

Abstract | BibTeX | Tags: Bandpass filters, cepstrum analysis, error analysis, parameter estimation, Robustness, Spectral analysis, speech processing, speech recognition, timefrequency analysis | Links: pdf link imagedoi link image

2009

Dimitriadis, D; Metallinou, A; Konstantinou, I; Goumas, G; Maragos, P; Koziris, N

GridNews: A distributed automatic Greek broadcast transcription system Conference

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, 2009, ISSN: 15206149.

Abstract | BibTeX | Tags: Computer architecture, Distributed database systems, Multimedia systems, speech recognition, User interface | Links: pdf link imagedoi link image

2006

Katsamanis, A; Papandreou, G; Pitsikalis, V; Maragos, P

Multimodal fusion by adaptive compensation for feature uncertainty with application to audiovisual speech recognition Conference

Proc. 14th European Signal Processing Conference (EUSIPCO-2006), Florence, Italy, Sep. 2006, 2006, ISBN: 22195491 (ISSN).

Abstract | BibTeX | Tags: Adaptive compensation, Audio visual speech recognition, Complementary features, Environmental conditions, Feature measurement, Feature uncertainty, Measurement Noise, Multi-modal fusion, Multiple streams, Probabilistic framework, signal processing, speech recognition, Uncertainty analysis | Links: pdf link image

2002

Dimitriadis, D; Maragos, P; Potamianos, A

Modulation features for speech recognition Conference

International Conference on Acoustics, 1 , 2002.

BibTeX | Tags: acoustic features, acoustic processing, ASR systems, automatic speech recognition, cepstral analysis, feature extraction, hidden Markov models, HMM-based word recognition, mel-frequency cepstrum, modulation, modulation type, nonlinear phenomena, nonlinear systems, robust methods, speech analysis, speech production, speech recognition, speech signals, time-varying models, time-varying phenomena, time-varying systems, TIMIT database

Pitsikalis, V; Maragos, P

Speech analysis and feature extraction using chaotic models Conference

International Conference on Acoustics, 1 , 2002.

BibTeX | Tags: acoustic signal processing, cepstral analysis, cepstrum, chaos, chaos theory, chaotic models, feature extraction, generalized hybrid set, hidden Markov models, HMM, multidimensional phase space, multidimensional signal processing, nonlinear acoustic features, nonlinear dynamic systems, nonlinear dynamical systems, short-time acoustic features, speech analysis, speech processing, speech production, speech recognition, speech signal modeling, speech signals, speech synthesis, word recognition

Dimitriadis, D; Maragos, P; Potamianos, A

Modulation features for speech recognition Journal Article

International Conference on Acoustics, 1 , pp. I–377–I–380, 2002.

BibTeX | Tags: acoustic features, acoustic processing, ASR systems, automatic speech recognition, cepstral analysis, feature extraction, hidden Markov models, HMM-based word recognition, mel-frequency cepstrum, modulation, modulation type, nonlinear phenomena, nonlinear systems, robust methods, speech analysis, speech production, speech recognition, speech signals, time-varying models, time-varying phenomena, time-varying systems, TIMIT database | Links: pdf link image