Alexandros Potamianos 2024-07-04T15:46:29+00:00


Alexandros Potamianos received the Diploma in Electrical and Computer Engineering from the National Technical University of Athens, Greece in 1990. He received the M.S and Ph.D. degrees in Engineering Sciences from Harvard University, Cambridge, MA, USA in 1991 and 1995, respectively. He received the M.B.A. degree from Stern School of Business, NYU in 2002.

From 1991 to June 1993 he was a research assistant at the Robotics Lab, Harvard University. From 1993 to 1995 he was a research assistant at the Digital Signal Processing Lab at Georgia Tech. From 1995 to 1999 he was a Senior Technical Staff Member at the Speech and Image Processing Lab, AT&T Shannon Labs, Florham Park, NJ. From 1999 to 2002 he was a Technical Staff Member and Technical Supervisor at the Multimedia Communications Lab at Bell Labs, Lucent Technologies, Murray Hill, NJ. From 1999 to 2001 he was an adjunct Assistant Professor at the Department of Electrical Engineering of Columbia University, New York, NY. From 2003 to 2013 he was an adjunct Associate Professor at the Department of Electronic and Computer Engineering of Technical University of Crete, Chania, Greece. In the summer of 2013, he joined the School of Electronical and Computer Engineering at the National Technical University of Athens, Athens, Greece as an associate professor.

His current research interests include speech processing, analysis, synthesis and recognition, dialog and multi-modal systems, lexical semantics, nonlinear signal processing, natural language understanding, artificial intelligence and multimodal child-computer interaction.

Prof. Potamianos has authored or co-authored over 110 papers in professional journals and conferences (citations: 2700, h-index: 25, in google scholar as of Sept 2013). He is the co-author of the paper “Creating conversational interfaces for children” that received a 2005 IEEE Signal Processing Society Best Paper Award. He is the co-editor of the book “Multimodal Processing and Interaction: Audio, Video, Text”, Springer, 2008. He holds four patents. He has been a member of the IEEE Signal Processing Society since 1992 and a senior member since 2010. He is currently serving his third term at the IEEE Speech and Language Technical Committee and his first term at the IEEE Multimedia Signal Processing Committee.


Recent Research Projects



A Zlatintsi, P Koutras, G Evangelopoulos, N Malandrakis, N Efthymiou, K Pastra, A Potamianos, P Maragos

COGNIMUSE: a multimodal video database annotated with saliency, events, semantics and emotion with application to summarization Journal Article

EURASIP Journal on Image and Video Processing, 54 , pp. 1–24, 2017.

Abstract | BibTeX | Links: [PDF]

G Karamanolakis, E Iosif, A Zlatintsi, A Pikrakis, A Potamianos

Audio-based Distributional Semantic Models for Music Auto-tagging and Similarity Measurement Conference

Proc. MultiLearn2017: Multimodal Processing, Modeling and Learning for Human-Computer/Robot Interaction Workshop, in conjuction with European Signal Processing Conference, Kos, Greece, 2017.

Abstract | BibTeX | Links: [PDF]


G Karamanolakis, E Iosif, A Zlatintsi, A Pikrakis, A Potamianos

Audio-Based Distributional Representations of Meaning Using a Fusion of Feature Encodings Conference


Abstract | BibTeX | Links: [Webpage]


P Koutras, A Zlatintsi, E.Iosif, A Katsamanis, P Maragos, A Potamianos

Predicting Audio-Visual Salient Events Based on Visual, Audio and Text Modalities for Movie Summarization Conference

Proc. {IEEE} Int'l Conf. Acous., Speech, and Signal Processing, Quebec, Canada, 2015.

Abstract | BibTeX | Links: [PDF]

A Zlatintsi, E.Iosif, P Maragos, A Potamianos

Audio Salient Event Detection and Summarization using Audio and Text Modalities Conference

Nice, France, 2015.

Abstract | BibTeX | Links: [PDF]

A Zlatintsi, P Koutras, N Efthymiou, P Maragos, A Potamianos, K Pastra

Quality Evaluation of Computational Models for Movie Summarization Conference

Costa Navarino, Messinia, Greece, 2015.

Abstract | BibTeX | Links: [PDF]

P. Koutras, A. Zlatintsi, E. Iosif, A. Katsamanis, P. Maragos, A. Potamianos

Predicting audio-visual salient events based on visual, audio and text modalities for movie summarization Conference

Proceedings - International Conference on Image Processing, ICIP, 2015-December , 2015, ISSN: 15224880.

BibTeX | Links: [PDF]


A Zlatintsi, P Maragos, A Potamianos, G Evangelopoulos

A Saliency-Based Approach to Audio Event Detection and Summarization Conference

Proc. European Signal Processing Conference, Bucharest, Romania, 2012.

Abstract | BibTeX | Links: [PDF]


Dimitrios Dimitriadis, Petros Maragos, Alexandros Potamianos

On the effects of filterbank design and energy computation on robust speech recognition Journal Article

IEEE Transactions on Audio, Speech and Language Processing, 19 (6), pp. 1504–1516, 2011, ISSN: 15587916.

Abstract | BibTeX | Links: [PDF]


Dimitrios Dimitriadis, Alexandros Potamianos, Petros Maragos

A comparison of the squared energy and teager-kaiser operators for short-term energy estimation in additive noise Journal Article

IEEE Transactions on Signal Processing, 57 (7), pp. 2569–2581, 2009, ISSN: 1053587X.

Abstract | BibTeX | Links: [PDF]

G Evangelopoulos, A Zlatintsi, G Skoumas, K Rapantzikos, A Potamianos, P Maragos, Y Avrithis

Video Event Detection and Summarization Using Audio, Visual and Text Saliency Conference

Taipei, Taiwan, 2009.

Abstract | BibTeX | Links: [PDF]

G Evangelopoulos, A Zlatintsi, G Skoumas, K Rapantzikos, A Potamianos, P Maragos, Y Avrithis

Video Event Detection and Summarization using Audio, Visual and Text Saliency Conference

Icassp, (2), 2009, ISBN: 9781424423545.

BibTeX | Links: [PDF]


G. Evangelopoulos, K. Rapantzikos, A. Potamianos, P. Maragos, A. Zlatintsi, Y. Avrithis

Movie summarization based on audiovisual saliency detection Conference

Proceedings - International Conference on Image Processing, ICIP, 2008, ISSN: 15224880.

Abstract | BibTeX | Links: [PDF]

Georgios Evangelopoulos, Konstantinos Rapantzikos, Petros Maragos, Yannis Avrithis, Alexandros Potamianos

Audiovisual Attention Modeling and Salient Event Detection Book Chapter

Maragos, Petros; Potamianos, Alexandros; Gros, Patrick (Ed.): Multimodal Processing and Interaction: Audio, Video, Text, pp. 1–21, Springer US, Boston, MA, 2008, ISBN: 978-0-387-76316-3.

BibTeX | Links: [Webpage]


Dimitrios Dimitriadis, Petros Maragos, Alexandros Potamianos

Auditory Teager Energy Cepstrum Coefficients for Robust Speech Recognition Conference

Proc. of European Speech Processing Conference, (2), 2005.

Abstract | BibTeX | Links: [PDF]


D Dimitriadis, P Maragos, A Potamianos

Modulation features for speech recognition Journal Article

International Conference on Acoustics, 1 , pp. I–377–I–380, 2002.

BibTeX | Links: [PDF]

D Dimitriadis, P Maragos, A Potamianos

Modulation features for speech recognition Conference

International Conference on Acoustics, 1 , 2002.



Alexandros Potamianos, Petros Maragos

Time-frequency distributions for automatic speech recognition Journal Article

Transactions on Speech and Audio Processing, IEEE, 9 (3), pp. 196–200, 2001.

BibTeX | Links: [PDF]


Alexandros Potamianos, Petros Maragos

Speech analysis and synthesis using an AM ± FM modulation Journal Article

Speech Communication, 28 (3), pp. 195–209, 1999.

BibTeX | Links: [PDF]

Petros Maragos, Alexandros Potamianos

Fractal dimensions of speech sounds: Computation and application to automatic speech recognition Journal Article

The Journal of the Acoustical Society of America, 105 (3), pp. 1925–1932, 1999, ISSN: 0001-4966.

Abstract | BibTeX | Links: [Webpage]

Alexandros Potamianos, Petros Maragos

Speech analysis and synthesis using an AM ± FM modulation Conference

Speech Communication, 28 (3), 1999.



Petros Maragos, Alexandros Potamianos

On Using Fractal Features of Speech Sounds in Automatic Speech Recognition Conference

Eurospeech, 1997.



A Potamianos, P Maragos

Speech formant frequency and bandwidth tracking using multiband energy demodulation Journal Article

1995 International Conference on Acoustics, Speech, and Signal Processing, 1 , pp. 784–787, 1996, ISSN: 1520-6149.

Abstract | BibTeX | Links: [Webpage]


Petros Maragos, Alexandros Potamianos

Higher Order Differential Energy Operators Journal Article

IEEE Signal Processing Letters, 2 (8), pp. 152–154, 1995, ISSN: 15582361.

Abstract | BibTeX | Links: [PDF]

P. Maragos, A. Potamianos, B. Santhanam

Instantaneous Energy Operators: Applications to Speech Processing and Communications Conference

Proc. IEEE Workshop on Nonlinear Signal and Image Processing, Halkidiki, Greece, pp.955-958, June 1995, 1995.


A. Potamianos, P. Maragos

Speech formant frequency and bandwidth tracking using multiband energy demodulation Conference

1995 International Conference on Acoustics, Speech, and Signal Processing, 1 , 1995, ISSN: 1520-6149.

Abstract | BibTeX | Links: [Webpage]


H M Hanson, P Maragos, A Potamianos

A system for finding speech formants and modulations via energy separation Journal Article

IEEE Transactions on Speech and Audio Processing, 2 (3), pp. 436-443, 1994, ISSN: 1063-6676.

Abstract | BibTeX | Links:

A Potamianos, P Maragos

A Comparison of the Energy Operator and Hilbert Transform Approaches for Signal and Speech Demodulation Journal Article

Signal Processing, 37 (1), pp. 95–120, 1994.

BibTeX | Links: [PDF]

A Potamianos, P Maragos

Applications of Speech Processing Using an AM--FM Modulation Model and Energy Operators Conference

Proc. European Signal Process. Conf., 1994.



H. M. Hanson, P. Maragos,, A. Potamianos

Finding Speech Formants and Modulations via Energy Separation: With an Application to a Vocoder Conference

Proc. Int’l Conf. on Acoustics, Speech, and Signal Processing (ICASSP-93), Minneapolis, MN, 1993.
