Dr. Martin Wöllmer

  • Home
  • BMW
  • audEERING
  • Publications
  • Slope
  • Funny Farm
  • Home
  • BMW
  • audEERING
  • Publications
  • Slope
  • Funny Farm
Picture
From 2008 until 2012 I worked as a speech processing researcher at the Institute for Human-Machine Communication at Technische Universität München (TUM). Our research group focussed on automatic speech and emotion recognition for intelligent dialog systems.

In my PhD thesis, I developed techniques for context-sensitive classification of speech signals. More specifically, I examined how so-called Long Short-Term Memory (LSTM) neural networks can be used to improve automatic speech and emotion recognition.

LSTM is a machine learning algorithm that learns how much contextual information should be exploited in order to classify a given speech fragment. It was invented in 1997 by Sepp Hochreiter and Jürgen Schmidhuber at TUM. I was the first one to apply LSTM for continuous speech recognition and presented my research results at several international conferences.

With the breaktrough of "Deep Learning" methods, the usage of neural networks for speech classification tasks became more and more popular in the last few years. Since 2016, also the big players in speech processing are using LSTM for speech recognition: The technology is now used by Google and Microsoft for new products and can be also found in Apple's SIRI and Amazon's Alexa (see Wikipedia).

PhD Thesis:
  • Martin Wöllmer: "Context-Sensitive Machine Learning for Intelligent Human Behavior Analysis", PhD Thesis, Technische Universität München, 2013.

Journal Articles:
  • Martin Wöllmer, Björn Schuller: "Probabilistic Speech Feature Extraction with Context-Sensitive Bottleneck Neural Networks", in Neurocomputing (NEUCOM), Elsevier, vol. 132, pp. 113-120, 2014.
  • Martin Wöllmer, Felix Weninger, Tobias Knaup, Björn Schuller, Congkai Sun, Kenji Sagae, Louis-Philippe Morency: "YouTube Movie Reviews: Sentiment Analysis in an Audiovisual Context", in IEEE Intelligent Systems Magazine, IEEE, vol. 28, no. 3, pp. 46-53, 2013.
  • Martin Wöllmer, Felix Weninger, Jürgen Geiger, Björn Schuller, Gerhard Rigoll: "Noise Robust ASR in Reverberated Multisource Environments Applying Convolutive NMF and Long Short-Term Memory", in Computer Speech and Language (CSL), Special Issue on Speech Separation and Recognition in Multisource Environments, Elsevier, vol. 27, no. 3, pp. 780-797, 2013.
  • Martin Wöllmer, Moritz Kaiser, Florian Eyben, Björn Schuller, Gerhard Rigoll: "LSTM-Modeling of Continuous Emotions in an Audiovisual Affect Recognition Framework", in Image and Vision Computing (IMAVIS), Special Issue on Affect Analysis in Continuous Input, Elsevier, vol. 31, no. 2, pp. 153-163, 2013.
  • Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "Keyword Spotting Exploiting Long Short-Term Memory", in Speech Communication (SPECOM), Elsevier, vol. 55, no. 2, pp. 252-265, 2013.
  • Martin Wöllmer, Erik Marchi, Stefano Squartini, Björn Schuller: "Multi-Stream LSTM-HMM Decoding and Histogram Equalization for Noise Robust Keyword Spotting", in Cognitive Neurodynamics (CODY), Springer, vol. 5, no. 3, pp. 253-264, 2011.
  • Martin Wöllmer, Felix Weninger, Florian Eyben, Björn Schuller: "Computational Assessment of Interest in Speech - Facing the Real-Life Challenge", in Künstliche Intelligenz (KI), Special Issue on Emotion and Computing, Springer, vol. 25, no. 3, pp. 225-234, 2011.
  • Martin Wöllmer, Christoph Blaschke, Thomas Schindl, Björn Schuller, Berthold Färber, Stefan Mayer, Benjamin Trefflich: "On-line Driver Distraction Detection using Long Short-Term Memory", in IEEE Transactions on Intelligent Transportation Systems (TITS), IEEE, vol. 12, no. 2, pp. 574-582, 2011.
  • Martin Wöllmer, Björn Schuller, Anton Batliner, Stefan Steidl, Dino Seppi: "Tandem Decoding of Children's Speech for Keyword Detection in a Child-Robot Interaction Scenario", in ACM Transactions on Speech and Language Processing (TSLP), Special Issue on Speech and Language Processing of Children's Speech for Child-machine Interaction Applications, ACM, vol. 7, no. 4, Article 12, 2011.
  • Martin Wöllmer, Florian Eyben, Alex Graves, Björn Schuller, Gerhard Rigoll: "Bidirectional LSTM Networks for Context-Sensitive Keyword Detection in a Cognitive Virtual Agent Framework", in Cognitive Computation, Special Issue on Non-Linear and Non-Conventional Speech Processing, Springer, vol. 2, no. 3, pp. 180-190, 2010.
  • Martin Wöllmer, Björn Schuller, Florian Eyben, Gerhard Rigoll: "Combining Long Short-Term Memory and Dynamic Bayesian Networks for Incremental Emotion-Sensitive Artificial Listening", in IEEE Journal of Selected Topics in Signal Processing (JSTSP), Special Issue on Speech Processing for Natural Interaction with Intelligent Environments, IEEE, vol. 4, no. 5, pp. 867-881, 2010.
  • Martin Wöllmer, Marc Al-Hames, Florian Eyben, Björn Schuller, Gerhard Rigoll: "A Multidimensional Dynamic Time Warping Algorithm for Efficient Multimodal Fusion of Asynchronous Data Streams", in Neurocomputing (NEUCOM), Elsevier, vol. 73, no. 1-3, pp. 366-380, 2009.
  • Jürgen Geiger, Felix Weninger, Jort Gemmeke, Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "Memory-Enhanced Neural Networks and NMF for Robust ASR", in IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP, to appear), IEEE/ACM, 2014.
  • Felix Weninger, Jürgen Geiger, Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "Feature Enhancement by Deep LSTM Networks for ASR in Reverberant Multisource Environments",  in Computer Speech and Language (CSL, to appear), Elsevier, 2014.
  • Angeliki Metallinou, Martin Wöllmer, Athanasios Katsamanis, Florian Eyben, Björn Schuller, Shrikanth Narayanan: "Context-Sensitive Learning for Enhanced Audiovisual Emotion Classification", in IEEE Transactions on Affective Computing (TAC), IEEE, vol. 3, no. 2, pp. 184-198, 2012.
  • Marc Schröder, Elisabetta Bevacqua, Roddy Cowie, Florian Eyben, Hatice Gunes, Dirk Heylen, Mark ter Maat, Gary McKeown, Sathish Pammi, Maja Pantic, Catherine Pelachaud, Björn Schuller, Etienne de Sevin, Michel Valstar, Martin Wöllmer: "Building Autonomous Sensitive Artificial Listeners", in IEEE Transactions on Affective Computing (TAC), IEEE, vol. 3, no. 2, pp. 165-183, 2012.
  • Emanuele Principi, Rudy Rotili, Martin Wöllmer, Florian Eyben, Stefano Squartini, Björn Schuller: "Real-Time Activity Detection in a Multi-Talker Reverberated Environment", in Cognitive Computation, Special Issue on Cognitive and Emotional Information Processing for Human-Machine Interaction, vol. 4, no. 4, pp. 386-397, Springer, 2012.
  • Florian Eyben, Martin Wöllmer, Björn Schuller: "A Multi-Task Approach to Continuous Five-Dimensional Affect Sensing in Natural Speech", in ACM Transactions on Interactive Intelligent Systems (TIIS), Special Issue on Affective Interaction in Natural Environments, ACM, vol. 2, no. 1, Article 6, 2012.
  • Florian Eyben, Martin Wöllmer, Alex Graves, Björn Schuller, Ellen Douglas-Cowie, Roddy Cowie: "On-line Emotion Recognition in a 3-D Activation-Valence-Time Continuum using Acoustic and Linguistic Cues", in Journal on Multimodal User Interfaces (JMUI), Special Issue on Real-time Affect Analysis and Interpretation: Closing the Loop in Virtual Agents, Springer, vol. 3, no. 1-2, pp. 7-19, 2010.
  • Florian Eyben, Martin Wöllmer, Tony Poitschke, Björn Schuller, Christoph Blaschke, Berthold Färber, Nhu Nguyen-Thien: "Emotion on the Road - Necessity, Acceptance, and Feasibility of Affective Computing in the Car", in Advances in Human Computer Interaction (AHCI), Special Issue on "Emotion-Aware Natural Interaction", Hindawi Publishing, Article ID 263593, 17 pages, 2010.
  • Björn Schuller, Bogdan Vlasenko, Florian Eyben, Martin Wöllmer, André Stuhlsatz, Andreas Wendemuth, Gerhard Rigoll: "Cross-Corpus Acoustic Emotion Recognition: Variances and Strategies", in IEEE Transactions on Affective Computing (TAC), IEEE, vol. 1, no. 2, pp. 119-131, 2010.
  • Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll: "Recognition of Noisy Speech: A Comparative Survey of Robust Model Architectures and Feature Enhancement", in EURASIP Journal on Audio, Speech, and Music Processing (JASMP), Article ID 942617, 17 pages, 2009.
  • Björn Schuller, Ronald Müller, Florian Eyben, Jürgen Gast, Benedikt Hörnler, Martin Wöllmer, Gerhard Rigoll, Anja Höthker, Hitoshi Konosu: "Being Bored? Recognising Natural Interest by Extensive Audiovisual Integration for Real-Life Application", in Image and Vision Computing Journal (IMAVIS), Special Issue on Visual and Multimodal Analysis of Human Spontaneous Behavior, Elsevier, vol. 27, no. 12, pp. 1760-1774, 2009.

Conference papers:
2013
  • Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "Probabilistic ASR Feature Extraction Applying Context-Sensitive Connectionist Temporal Classification Networks", in Proc. of ICASSP 2013, IEEE, pp. 7125-7129, Vancouver, Canada, 2013.
  • Martin Wöllmer, Zixing Zhang, Felix Weninger, Björn Schuller, Gerhard Rigoll: "Feature Enhancement by Bidirectional LSTM Networks for Conversational Speech Recognition in Highly Non-Stationary Noise", in Proc. of ICASSP 2013, IEEE, pp. 6822-6826, Vancouver, Canada, 2013.
  • Felix Weninger, Claudia Wagner, Martin Wöllmer, Björn Schuller, Louis-Philippe Morency: "Speaker Trait Characterization in Web Videos: Uniting Speech, Language, and Facial Features", in Proc. of ICASSP 2013, IEEE, Vancouver, Canada, 2013.
  • Jürgen Geiger, Felix Weninger, Antti Hurmalainen, Jort F. Gemmeke, Martin Wöllmer, Björn Schuller, Gerhard Rigoll, Tuomas Virtanen: "The TUM+TUT+KUL Approach to the CHiME Challenge 2013: Multi-Stream ASR Exploiting BLSTM Networks and Sparse NMF", in Proc. of 2nd CHiME Speech Separation and Recognition Challenge held in conjunction with ICASSP 2013, IEEE, pp. 25-30, Vancouver, Canada, 2013.
  • Felix Weninger, Jürgen Geiger, Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "The Munich Feature Enhancement Approach to the 2013 CHiME Challenge Using BLSTM Recurrent Neural Networks", in Proc. of 2nd CHiME Speech Separation and Recognition Challenge held in conjunction with ICASSP 2013, IEEE, pp. 86-90, Vancouver, Canada, 2013.
2012
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll: "Temporal and Situational Context Modeling for Improved Dominance Recognition in Meetings", in Proc. of Interspeech 2012, ISCA, pp. 350-353, Portland, Oregon, USA, 2012.
  • Felix Weninger, Martin Wöllmer, Björn Schuller: "Combining Bottleneck-BLSTM and Semi-Supervised Sparse NMF for Recognition of Conversational Speech in Highly Instationary Noise", in Proc. of Interspeech 2012, ISCA, pp. 302-305, Portland, Oregon, USA, 2012.
  • Martin Wöllmer, Moritz Kaiser, Florian Eyben, Felix Weninger, Björn Schuller, Gerhard Rigoll: "Fully Automatic Audiovisual Emotion Recognition: Voice, Words, and the Face", in Proc. of 10th ITG conference on Speech Communication, Braunschweig, Germany, 2012.
  • Martin Wöllmer, Angeliki Metallinou, Nassos Katsamanis, Björn Schuller, Shrikanth Narayanan: "Analyzing the Memory of BLSTM Neural Networks for Enhanced Emotion Classification in Dyadic Spoken Interactions", in Proc. of ICASSP 2012, IEEE, pp. 4157-4160, Kyoto, Japan, 2012.
  • Felix Weninger, Martin Wöllmer, Jürgen Geiger, Björn Schuller, Jort Gemmeke, Antti Hurmalainen, Tuomas Virtanen, Gerhard Rigoll: "Non-Negative Matrix Factorization for Highly Noise-Robust ASR: to Enhance or to Recognize?", in Proc. of ICASSP 2012, IEEE, pp. 4681-4684, Kyoto, Japan, 2012.
  • Felix Weninger, Martin Wöllmer, Björn Schuller: "Sparse, Hierarchical and Semi-Supervised Base Learning for Monaural Enhancement of Conversational Speech", in Proc. of 10th ITG conference on Speech Communication, Braunschweig, Germany, 2012.
  • Cyril Joder, Felix Weninger, Martin Wöllmer, Björn Schuller: "The TUM Cumulative DTW Approach for the Mediaeval 2012 Spoken Web Search Task", in Proc. of MediaEval Workshop 2012, Pisa, Italy, 2012.
  • Emanuele Principi, Rudy Rotili, Martin Wöllmer, Stefano Squartini, Björn Schuller: "Dominance Detection in a Reverberated Acoustic Scenario", in Proc. of International Symposium on Neural Networks (ISNN), Special Session on "Advances in Cognitive and Emotional Information Processing", Springer, Shenyang, China, 2012.
  • Wenjing Han, Zixing Zhang, Jun Deng, Martin Wöllmer, Felix Weninger, Björn Schuller: "Towards Distributed Recognition of Emotion from Speech", in Proc. of International Symposium on Communications, Control, and Signal Processing (ISCCSP), Special Session "Interactive Behaviour Analysis", IEEE, Rome, Italy, 2012.
2011
  • Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "A Novel Bottleneck-BLSTM Front-End for Feature-Level Context Modeling in Conversational Speech Recognition", in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), IEEE, pp. 36-41, Waikoloa, Big Island, Hawaii, 2011.
  • Martin Wöllmer, Björn Schuller: "Enhancing Spontaneous Speech Recognition with BLSTM Features", in Proc. of NOLISP 2011, ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, EURASIP, pp. 17-24, Las Palmas de Gran Canaria, Spain, 2011.
  • Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "Feature Frame Stacking in RNN-based Tandem ASR Systems - Learned vs. Predefined Context", in Proc. of Interspeech 2011, ISCA, pp. 1233-1236, Florence, Italy, 2011.
  • Martin Wöllmer, Felix Weninger, Florian Eyben, Björn Schuller: "Acoustic-Linguistic Recognition of Interest in Speech with Bottleneck-BLSTM Nets", in Proc. of Interspeech 2011, ISCA, pp. 77-80 Florence, Italy, 2011.
  • Martin Wöllmer, Felix Weninger, Stefan Steidl, Anton Batliner, Björn Schuller: "Speech-based Non-prototypical Affect Recognition for Child-Robot Interaction in Reverberated Environments", in Proc. of Interspeech 2011, ISCA, pp. 3113-3116, Florence, Italy, 2011.
  • Felix Weninger, Jürgen Geiger, Martin Wöllmer, Björn Schuller, Gerhard Rigoll: "The Munich 2011 CHiME Challenge Contribution: NMF-BLSTM Speech Enhancement and Recognition for Reverberated Multisource Environments", in Proc. of International Workshop on Machine Listening in Multisource Environments (CHiME 2011), Special Session "The PASCAL 'CHiME' Speech Separation and Recognition Challenge", ISCA, Florence, Italy, 2011.
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll: "A Multi-Stream ASR Framework for BLSTM Modeling of Conversational Speech", in Proc. of ICASSP 2011, IEEE, pp. 4860-4863, Prague, Czech Republic, 2011.
  • Martin Wöllmer, Erik Marchi, Stefano Squartini, Björn Schuller: "Robust Multi-Stream Keyword and Non-Linguistic Vocalization Detection for Computationally Intelligent Virtual Agents", in Proc. of 8th International Symposium on Neural Networks (ISNN 2011), Special Session "Computational Intelligence Algorithms for Advanced Human-Machine Interaction", IEEE Computational Intelligence Society, D. Liu et al. (Eds.): ISNN 2011 Part II, LNCS 6676, Springer Heidelberg, pp. 496-505, Guilin, China, 2011.
  • Zixing Zhang, Felix Weninger, Martin Wöllmer, Björn Schuller: "Unsupervised Learning in Cross-Corpus Acoustic Emotion Recognition", in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2011), IEEE, pp. 523-528, Waikoloa, Big Island, Hawaii, 2011.
  • Florian Eyben, Martin Wöllmer, Michel Valstar, Hatice Gunes, Björn Schuller, Maja Pantic: "String-based Audiovisual Fusion of Behavioural Events for the Assessment of Dimensional Affect", in Proc. of 9th International IEEE Conference on Face and Gesture Recognition 2011 (FG 2011), IEEE, Santa Barbara, California, USA, 2011.
  • Felix Weninger, Martin Wöllmer, Björn Schuller: "Automatic Assessment of Singer Traits in Popular Music: Gender, Age, Height and Race", in Proc. of 12th International Society for Music Information Retrieval Conference (ISMIR), pp. 37-42, Miami, Florida, USA, 2011.
  • Björn Schuller, Martin Wöllmer, Florian Eyben, Gerhard Rigoll, Dejan Arsić: "Semantic Speech Tagging: Towards Combined Analysis of Speaker Traits", in Proc. of 42nd Conference on Semantic Audio, Audio Engineering Society (AES), pp. 89-97, Illmenau, Germany, 2011.
  • Marc Schröder, Sathish Pammi, Hatice Gunes, Maja Pantic, Michel Valstar, Roddy Cowie, Gary McKeown, Dirk Heylen, Mark ter Maat, Florian Eyben, Björn Schuller, Martin Wöllmer, Elisabetta Bevacqua, Catherine Pelachaud, Etienne de Sevin: "Come and Have an Emotional Workout with Sensitive Artificial Listeners!", in Proc. of 9th International IEEE Conference on Face and Gesture Recognition 2011 (FG 2011), IEEE, Santa Barbara, California, USA, 2011.
  • Felix Weninger, Björn Schuller, Martin Wöllmer, Gerhard Rigoll: "Localization of Non-Linguistic Events in Spontaneous Speech by Non-Negative Matrix Factorization and Long Short-Term Memory", in Proc. of ICASSP 2011, IEEE, pp. 5840-5843, Prague, Czech Republic, 2011.
2010
  • Martin Wöllmer, Angeliki Metallinou, Florian Eyben, Björn Schuller, Shrikanth Narayanan: "Context-Sensitive Multimodal Emotion Recognition from Speech and Facial Expression using Bidirectional LSTM Modeling", in Proc. of Interspeech 2010, ISCA, pp. 2362-2365, Makuhari, Japan, 2010.
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll: "Recognition of Spontaneous Conversational Speech using Long Short-Term Memory Phoneme Predictions", in Proc. of Interspeech 2010, ISCA, pp. 1946-1949, Makuhari, Japan, 2010.
  • Martin Wöllmer, Yang Sun, Florian Eyben, Björn Schuller: "Long Short-Term Memory Networks for Noise Robust Speech Recognition", in Proc. of Interspeech 2010, ISCA, pp. 2966-2969, Makuhari, Japan, 2010.
  • Florian Eyben, Martin Wöllmer, Björn Schuller: "openSMILE - The Munich Versatile and Fast Open-Source Audio Feature Extractor", in Proc. of ACM Multimedia, ACM, pp. 1459-1462, Firenze, Italy, 2010.
  • Martin Wöllmer, Nikolaj Klebert, Björn Schuller: "Switching Linear Dynamic Models for Recognition of Emotionally Colored and Noisy Speech", in Proc. of 9th ITG conference on Speech Communication, Bochum, Germany, 2010.
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll: "Spoken Term Detection with Connectionist Temporal Classification: A Novel Hybrid CTC-DBN Decoder", in Proc. of ICASSP 2010, IEEE, pp. 5274-5277, Dallas, Texas, USA, 2010.
  • Dejan Arsic, Luis Roalter, Martin Wöllmer, Florian Eyben, Moritz Kaiser, Björn Schuller, Gerhard Rigoll, Matthias Kranz: "Automated 3D Gesture Recognition Applying Long Short-Term Memory and Contextual Knowledge in a CAVE", in Proc. of ACM Multimedia 2010 Workshop - Multimodal Pervasive Video Analysis, MPVA 2010, ACM, pp. 33-36, Firenze, Italy, 2010.
  • Björn Schuller, Felix Weninger, Martin Wöllmer, Yang Sun, Gerhard Rigoll: "Non-Negative Matrix Factorization as Noise-Robust Feature Extractor for Speech Recognition", in Proc. of ICASSP 2010, IEEE, pp. 4562-4565, Dallas, Texas, USA, 2010.
  • Marc Schröder, Sathish Pammi, Roddy Cowie, Gary McKeown, Hatice Gunes, Maja Pantic, Michel Valstar, Dirk Heylen, Mark ter Maat, Florian Eyben, Björn Schuller, Martin Wöllmer, Elisabetta Bevacqua, Catherine Pelachaud, Etienne de Sevin: "Demo: Have a Chat with Sensitive Artificial Listeners", AISB'2010 Symposium "Towards a Comprehensive Intelligence Test", 2010.
2009
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Gerhard Rigoll: "Robust Vocabulary Independent Keyword Spotting with Graphical Models", in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2009), pp. 349-353, Merano, Italy, 2009.
  • Florian Eyben, Martin Wöllmer, Björn Schuller, Alex Graves: "From Speech to Letters - Using a Novel Neural Network Architecture for Grapheme Based ASR", in Proc. of IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2009), pp. 376-380, Merano, Italy, 2009.
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Ellen Douglas-Cowie, Roddy Cowie: "Data-driven Clustering in Emotional Space for Affect Recognition Using Discriminatively Trained LSTM Networks", in Proc. of Interspeech 2009, ISCA, pp. 1595-1598, Brighton, UK, 2009.
  • Martin Wöllmer, Florian Eyben, Björn Schuller, Yang Sun, Tobias Moosmayr, Nhu Nguyen-Thien: "Robust In-Car Spelling Recognition - A Tandem BLSTM-HMM Approach", in Proc. of Interspeech 2009, ISCA, pp. 2507-2510, Brighton, UK, 2009.
  • Florian Eyben, Martin Wöllmer, Björn Schuller: "openEAR - Introducing the Munich Open-Source Emotion and Affect Recognition Toolkit", in Proc. of 4th International HUMAINE Association Conference on Affective Computing and Intelligent Interaction 2009 (ACII 2009), IEEE, pp. 576-581, Amsterdam, The Netherlands, 2009.
  • Martin Wöllmer, Florian Eyben, Alex Graves, Björn Schuller, Gerhard Rigoll: "A Tandem BLSTM-DBN Architecture for Keyword Spotting with Enhanced Context Modeling", in Proc. of NOLISP 2009, ISCA Tutorial and Research Workshop on Non-Linear Speech Processing, EURASIP, Vic, Spain, 2009.
  • Martin Wöllmer, Florian Eyben, Joseph Keshet, Alex Graves, Björn Schuller, Gerhard Rigoll: "Robust Discriminative Keyword Spotting for Emotionally Colored Spontaneous Speech Using Bidirectional LSTM Networks" in Proc. of ICASSP 2009, IEEE, pp. 3949-3952, Taipei, Taiwan, 2009.
  • Björn Schuller, Salman Can, Hubertus Feussner, Martin Wöllmer, Dejan Arsic, Benedikt Hörnler: "Speech Control in Surgery: a Field Analysis and Strategies", in Proc. of International Conference on Multimedia and Expo (ICME 2009), IEEE, pp. 1214-1217, New York, NY, 2009.
  • Marc Schröder, Elisabetta Bevacqua, Florian Eyben, Hatice Gunes, Dirk Heylen, Mark ter Maat, Sathish Pammi, Maja Pantic, Catherine Pelachaud, Björn Schuller, Etienne de Sevin, Michel Valstar, Martin Wöllmer: "A Demonstration of Audiovisual Sensitive Artificial Listeners", in Proc. of 4th International HUMAINE Association Conference on Affective Computing and Intelligent Interaction 2009 (ACII 2009), IEEE, pp. 263-264, Amsterdam, The Netherlands, 2009.
2008
  • Martin Wöllmer, Florian Eyben, Stephan Reiter, Björn Schuller, Cate Cox, Ellen Douglas-Cowie, Roddy Cowie: "Abandoning Emotion Classes - Towards Continuous Emotion Recognition with Modelling of Long-Range Dependencies", in Proc. of Interspeech 2008, ISCA, pp. 597-600, Brisbane, Australia, 2008.
  • Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll: "Speech Recognition in Noisy Environments using a Switching Linear Dynamic Model for Feature Enhancement", in Proc. of Interspeech 2008, Special Session: Human-Machine Comparisons of Consonant Recognition in Noise (Consonant Challenge), ISCA, pp. 1789-1792, Brisbane, Australia, 2008.
  • Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Günther Ruske, Gerhard Rigoll: "Switching Linear Dynamic Models for Noise Robust In-Car Speech Recognition", in Pattern Recognition, Proc. of 30th DAGM Symposium, Gerhard Rigoll (Ed.), DAGM, Springer LNCS 5096, ISBN 978-3-540-69320-8, Springer Berlin Heidelberg, pp. 244-253, Munich, Germany, 2008.
  • Björn Schuller, Martin Wöllmer, Tobias Moosmayr, Gerhard Rigoll: "Robust Spelling and Digit Recognition in the Car: Switching Models and Their Like", in Proc. of DAGA 2008, Invited Session "Sprachakustik im Kraftfahrzeug", DEGA, pp. 847-848, Dresden, Germany, 2008.

Book chapters:
  • Martin Wöllmer, Florian Eyben, Alex Graves, Björn Schuller, Gerhard Rigoll: "Improving Keyword Spotting with a Tandem BLSTM-DBN Architecture", in Non-Linear Speech Processing, J. Sole-Casals and V. Zaiats (Eds.), LNAI 5933, pp. 68-75, Springer Heidelberg, 2010.
  • Felix Weninger, Martin Wöllmer, Björn Schuller: "Emotion Recognition in Naturalistic Speech and Language - A Survey", to appear in "Advances in Emotion Recognition", Amit Konar, Aruna Chakraborty (eds.), Wiley-Blackwell, 2012.
  • Rudy Rotili, Emanuele Principi, Martin Wöllmer, Stefano Squartini, Björn Schuller: "Conversational Speech Recognition In Non-Stationary Reverberated Environments", in 4th COST International Training School on Cognitive Behavioural Systems, A. Esposito, A. Vinciarelli, R. Hoffmann, and V. Müller (Eds.), vol. 7403, pp. 50-59, LNCS, Springer, 2012.
  • Björn Schuller, Martin Wöllmer, Florian Eyben, Gerhard Rigoll: "Retrieval of Paralinguistic Information in Broadcasts”, in Multimedia Information Extraction, Mark Maybury (ed.), MIT Press, Cambridge, Massachusetts, 2009.
  • Björn Schuller, Martin Wöllmer, Florian Eyben, Gerhard Rigoll: "Spectral or Voice Quality? Feature Type Relevance for the Discrimination of Emotion Pairs", in Prosody and Affect, Linguistic Insights, Maurizio Gotti (ed.), vol. 97, pp. 285-307, Peter Lang Publishing Group, 2009.
Powered by Create your own unique website with customizable templates.