| Title | Author | Source | 
|---|---|---|
| Summarizing Speech: A Comprehensive Survey | Fabian Retkowski, Maike Züfle, Andreas Sudmann, Dinah Pfau, Shinji Watanabe, Jan Niehues, Alexander Waibel | arXiv:2504.08024 | 
| Cocktail-Party Audio-Visual Speech Recognition | Thai-Binh Nguyen, Ngoc-Quan Pham, Alexander Waibel | Interspeech 2025 | 
| The AI Co-Ethnographer: How Far Can Automation Take Qualitative Research? | Fabian Retkowski, Andreas Sudmann, Alexander Waibel | In Proceedings of the 5th International Conference on Natural Language Processing for Digital Humanities, pages 73–90, Albuquerque, New Mexico | 
| Zero-Shot Strategies for Length-Controllable Summarization | Fabian Retkowski, Alexander Waibel | Findings of the Association for Computational Linguistics: NAACL 2025, pages 551–572, Albuquerque, New Mexico | 
| PIER: A Novel Metric for Evaluating What Matters in Code-Switching | Enes Yavuz Ugan, Ngoc-Quan Pham, Leonard Bärmann, Alex Waibel | ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 
| Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS | Tuan Nam Nguyen, Seymanur Akti, Ngoc Quan Pham, Alexander Waibel | ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 
| Continuously Learning New Words in Automatic Speech Recognition | Christian Huber, Alexander Waibel | ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 
| MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models | Thai Binh Nguyen, Alexander Waibel | ICASSP 2025-2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 
| Title | Author | Source | 
|---|---|---|
| Findings of the IWSLT 2024 evaluation campaign | Ibrahim Said Ahmad, Antonios Anastasopoulos, Ondřej Bojar, Claudia Borg, Marine Carpuat, Roldano Cattoni, Mauro Cettolo, William Chen, Qianqian Dong, Marcello Federico, Barry Haddow, Dávid Javorský, Mateusz Krubiński, Tsz Kin Lam, Xutai Ma, Prashant Mathur, Evgeny Matusov, Chandresh Maurya, John McCrae, Kenton Murray, Satoshi Nakamura, Matteo Negri, Jan Niehues, Xing Niu, Atul Kr. Ojha, John Ortega, Sara Papi, Peter Polák, Adam Pospíšil, Pavel Pecina, Elizabeth Salesky, Nivedita Sethiya, Balaram Sarkar, Jiatong Shi, Claytone Sikasote, Matthias Sperber, Sebastian Stüker, Katsuhito Sudoh, Brian Thompson, Alex Waibel, Shinji Watanabe, Patrick Wilken, Petr Zemánek, Rodolfo Zevallos | Proceedings of the 21st IWSLT, Bangkok - Thailand | 
| Blending LLMs into Cascaded Speech Translation: KIT's Offline Speech Translation System for IWSLT 2024 | Koneru, Sai; Nguyen, Thai-Binh; Pham, Ngoc-Quan; Liu, Danni; Li, Zhaolin; Waibel, Alexander; Niehues, Jan | arXiv:2406.16777 | 
| Titanic Calling: Low Bandwidth Video Conference from the Titanic Wreck | Eyiokur, Fevziye Irem; Huber, Christian; Nguyen, Thai-Binh; Nguyen, Tuan-Nam; Retkowski, Fabian; Ugan, Enes Yavuz; Yaman, Dogucan; Waibel, Alexander | Frontiers in Robotics and AI | 
| Handling Numeric Expressions in Automatic Speech Recognition | Christian Huber, Alexander Waibel | arXiv preprint arXiv:2408.00004 | 
| Audio-driven Talking Face Generation with Stabilized Synchronization Loss | Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Hazim Kemal Ekenel, Alexander Waibel | European Conference on Computer Vision (ECCV) 2024 | 
| Accent conversion using discrete units with parallel data synthesized from controllable accented TTS | Tuan-Nam Nguyen, Ngoc-Quan Pham, Alexander Waibel | Interspeech 2024 Workshop | 
| Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation | Dogucan Yaman, Fevziye Irem Eyiokur, Leonard Bärmann, Seymanur Akti, Hazim Kemal Ekenel, Alexander Waibel | IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshop | 
| Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages | Carlos Mullov, Ngoc-Quan Pham, Alexander Waibel | Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics 2024 | 
| DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark | Enes Yavuz Ugan, Ngoc-Quan Pham, Alexander Waibel | LREC-COLING24 | 
| Convoifilter: A case study of doing cocktail party speech recognition | Thai Binh Nguyen, Alexander Waibel | International Conference on Acoustics, Speech, and Signal Workshop Processing 2024 – ICASSPW | 
| Synthetic Conversations Improve Multi-Talker ASR | Thai Binh Nguyen, Alexander Waibel 
 | International Conference on Acoustics, Speech, and Signal Processing 2024 - ICASSP | 
| From Text Segmentation to Smart Chaptering: A Novel Benchmark for Structuring Video Transcriptions | Fabian Retkowski, Alexander Waibel | In Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers), pages 406–419, St. Julian’s, Malta. Association for Computational Linguistics. | 
| Title | Author | Source | 
|---|---|---|
| Modular Design of a Front-End and Back-End Speech-to-Speech Translation Application for Psychiatric Treatment of Refugees | Enes Yavuz Ugan, Mohammed Mediani, Omar Al Jawabra, Aya Khader, Yining Liu, Alexander Waibel | Published at IEEE GHTC23 | 
| Findings of the IWSLT 2023 evaluation campaign | Agarwal, Milind; Agarwal, Sweta; Anastasopoulos, Antonios; Bentivogli, Luisa; Bojar, Ondřej; Borg, Claudia; Carpuat, Marine; Cattoni, Roldano; Cettolo, Mauro; Chen, Mingda | Association for Computational Linguistics | 
| End-to-End Evaluation for Low-Latency Simultaneous Speech Translation | Christian Huber, Tu Anh Dinh, Carlos Mullov, Ngoc-Quan Pham, Thai Binh Nguyen, Fabian Retkowski, Stefan Constantin, Enes Ugan, Danni Liu, Zhaolin Li, Sai Koneru, Jan Niehues, Alexander Waibel | Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations | 
| Face-Dubbing++: Lip-Synchronous, Voice Preserving Translation of Videos | Alexander Waibel, Moritz Behr, Irem Fevziye, Dogucan Yaman, Tuan-Nam Nguyen, Carlos Mullov, Mehmet Arif Demirtas, Alperen Kantarci, Stefan Constantin, Hazim Kemal Ekenel | 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) | 
| Improving Spoken Language Understanding by Enhancing Text Representation | Thai Binh Nguyen, Alexander Waibel | International Conference on Acoustics, Speech, and Signal Processing 2023 - ICASSP | 
| SYNTACC: Synthesizing Multi-Accent Speech By Weight Factorization | Tuan-Nam Nguyen, Ngoc-Quan Pham, Alexander Waibel | ICASSP 2023 (2023 International Conference on Acoustics, Speech, and Signal Processing) | 
| Modular Design of a Front-End and Back-End Speech-to-Speech Translation Application for Psychiatric Treatment of Refugees | Enes Yavuz Ugan, Mohammed Mediani, Omar Al Jawabra, Aya Khader, Yining Liu, Alexander Waibel | 2023 IEEE Global Humanitarian Technology Conference (GHTC) | 
| Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models | Leonard Bärmann and Rainer Kartmann and Fabian Peller-Konrad and Alex Waibel and Tamim Asfour | 2nd Workshop on Language and Robot Learning: Language as Grounding at CoRL 2023 | 
| Multimodal Error Correction with Natural Language and Pointing Gestures | Stefan Constantin, Fevziye Irem Eyiokur, Dogucan Yaman, Leonard Bärmann, and Alex Waibel | ICCV 2023 Workshops, Paris, France, October 2–3, 2023 | 
| Towards continually learning new languages | Pham, Q., Niehues, J., Waibel, A. | Proc. INTERSPEECH 2023, 3262-3266, doi: 10.21437/Interspeech.2023-1867 | 
| Language-agnostic Code-Switching in Sequence-To-Sequence Speech Recognition | Enes Yavuz Ugan, Christian Huber, Juan Hussain, Alexander Waibel | IWSDS 2023, The 13th International Workshop on Spoken Dialogue Systems Technology | 
| Title | Author | Source | 
|---|---|---|
| Improving Spoken Language Understanding by Enhancing Text Representation | Thai Binh Nguyen | IEEE | 
| Interactive Multimodal Robot Dialog using Pointing Gesture Recognition | Stefan Constantin, Fevziye Irem Eyiokur, Dogucan Yaman, Leonard  | Computer Vision – ECCV 2022 Workshops, Tel Aviv, Israel, October 23–27, 2022, Proceedings, Part VI | 
| Error correction and extraction in request dialogs | Stefan Constantin, Alex Waibel | 5th International Conference on Natural Language and Speech Processing (ICNLSP 2022) | 
| Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion | Tuan Nam Nguyen, Ngoc Quan Pham, Alexander Waibel | Interspeech 2022 | 
| Effective combination of pretrained models - KIT@IWSLT2022 | Ngoc-Quan Pham, Tuan Nam Nguyen, Thai-Binh Nguyen, Danni Liu, Carlos Mullov, Jan Niehues, and Alexander Waibel | In Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022), pages 190–197 | 
| Adaptive multilingual speech recognition with pretrained models | Ngoc-Quan Pham, Alexander Waibel, Jan Niehues | Proceedings Interspeech 2022, 3879-3883 | 
| Exposure Correction Model to Enhance Image Quality | Eyiokur, F and Yaman, Dogucan and Ekenel, Hazim Kemal, Alexander Waibel | Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition | 
| Alpha Matte Generation from Single Input for Portrait Matting | Dogucan and Ekenel Yaman, Hazim Kemal, Alexander Waibel | Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition | 
| Where did I leave my keys? — Episodic-Memory-Based Question Answering on Egocentric Videos | Leonard Bärmann, Alex Waibel | 
 CVPR 2022 | 
| Title | Author | Source | 
|---|---|---|
| Findings of the IWSLT 2021 evaluation campaign | Antonios Anastasopoulos, Ondřej Bojar, Jacob Bremerman, Roldano Cattoni, Maha Elbayad, Marcello Federico, Xutai Ma, Satoshi Nakamura, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Sebastian Stüker, Katsuhito Sudoh, Marco Turchi, Alexander Waibel, Changhan Wang, Matthew Wiesner | Proceedings of the 18th IWSLT, Bangkok - Thailand | 
| Deep Episodic Memory for Verbalization of Robot Experience | Leonard Barmann, Fabian Peller-Konrad, Stefan Constantin, Tamim Asfour, Alex Waibel | IEEE Robotics and Automation Letters | 
| Value-Based Reinforcement Learning for Sequence-to-Sequence Models | Fabian Retkowski, Alex Waibel | Adaptive and Learning Agents Workshop | 
| Efficient Weight factorization for Multilingual Speech Recognition, Proceedings of INTERSPEECH 2021 | Ngoc-Quan Pham, Tuan-Nam Nguyen, Sebastian Stüker, Alex Waibel | Proceedings of INTERSPEECH 2021 | 
| Multilingual Speech Translation KIT@ IWSLT2021 | Ngoc-Quan Pham, Tuan Nam Nguyen, Thanh-Le Ha, Sebastian Stüker, Alexander Waibel, Dan He | Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021) | 
| SUPER-HUMAN PERFORMANCE IN ONLINE LOW-LATENCY RECOGNITION OF CONVERSATIONAL SPEECH | Thai-Son Nguyen, Sebastian Stüker, Alex Waibel | Interspeech 2021 | 
| Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition | Alexander Waibel, Juan Hussain, Christian Huber, Sebastian Stüker | SPECOM: International Conference on Speech and Computer | 
| KIT’s IWSLT 2021 Offline Speech Translation System | AlexanderWaibel, Tuan-Nam Nguyen, Thai-Son Nguyen, Christian Huber, Maximilian Awiszus, Ngoc-QuanPham,Thanh-LeHa,FelixSchneider, SebastianStüker | Proceedings of the 18th International Conference on Spoken Language Translation, | 
| CAGAN: Text-To-Image Generation with Combined Attention Generative Adversarial Networks | Henning Schulze, Dogucan Yaman, and Alexander Waibel | DAGM German Conference on Pattern Recognition | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2020 evaluation campaign | Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondřej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alexander Waibel, Changhan Wang | Proceedings of the 17th IWSLT - online | 
| Low Latency ASR for Simultaneous Speech Translation | Nguyen, Thai Son; Niehues, Jan; Cho, Eunah; Ha, Thanh-Le; Kilgour, Kevin; Muller, Markus; Sperber, Matthias; Stueker, Sebastian; Waibel, Alex | arXiv:2003.09891 | 
| KIT’s IWSLT 2020 SLT Translation System | Ngoc-Quan Pham, Felix Schneider, Tuan-Nam Nguyen, Thanh-Le Ha, Thai-Son Nguyen, Maximillian Awiszus, Sebastian Stüker, Alex Waibel | Proceedings of the 17th International Conference on Spoken Language Translation (IWSLT 2020) | 
| Relative Positional Encoding for Speech Recognition and Direct Translation | Ngoc-Quan Pham, Thanh-Le Ha, Tuan-Nam Nguyen, Thai-Son Nguyen, Elizabeth Salesky, Sebastian Stüker, Jan-Niehues, Alex Waibel | INTERSPEECH 2020 | 
| German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis | Juan Hussain, Mohammed Mediani, Moritz Behr, M Amin Cheragui, Sebastian Stüker, Alex Waibel | Proceedings of the Proceedings of the Fifth Arabic Natural Language Processing Workshop. Online. | 
| Supervised Adaptation of Sequence-to-Sequence Speech Recognition Systems using Batch-Weighting | Christian Huber, Juan Hussain, Tuan-Nam Nguyen, Kaihang Song, Sebastian Stüker, Alexander Waibel | The 1st Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 10th International Joint Conference on Natural Language Processing - AACL-IJCNLP 2020 | 
| High Performance Sequence-to-Sequence Model for Streaming Speech Recognition | Thai-Son Nguyen, Ngoc-Quan Pham, Sebastian Stüker, Alex Waibel | Interspeech 2020 | 
| Super-Human Performance in Online Low-latency Recognition of Conversational Speech | Thai-Son Nguyen, Sebastian Stueker, Alex Waibel | https://arxiv.org/abs/2010.03449 | 
| ELITR Non-Native Speech Translation at IWSLT 2020 | Dominik Machácek, Jonáš Kratochivíl, Sangeet Sagar, Matúš Žilinec, Ondrej Bojar, Thai-Son Nguyen, Felix Schneider, Philip Williams, Yuekun Yao | 17th International Conference on Spoken Language Translation – IWSLT, colocated with ACL 2020 | 
| Findings of the IWSLT 2020 Evaluation Campaign | Ebrahim Ansari, Amittai Axelrod, Nguyen Bach, Ondrej Bojar, Roldano Cattoni, Fahim Dalvi, Nadir Durrani, Marcello Federico, Christian Federmann, Jiatao Gu, Fei Huang, Kevin Knight, Xutai Ma, Ajay Nagesh, Matteo Negri, Jan Niehues, Juan Pino, Elizabeth Salesky, Xing Shi, Sebastian Stüker, Marco Turchi, Alex Waibel, Changhan Wang | 17th International Conference on Spoken Language Translation – IWSLT, colocated with ACL 2020 | 
| Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation | Felix Schneider and Alex Waibel | 17th International Conference on Spoken Language Translation – IWSLT, colocated with ACL 2020 | 
| KIT's IWSLT 2020 SLT Translation System | Ngoc-Quan Pham, Felix Schneider, Tuan-Nam Nguyen, Thanh-Le Ha, Thai-Son Nguyen, Maximilian Awiszus, Sebastian Stüker, Alexander Waibel | 17th International Conference on Spoken Language Translation – IWSLT, colocated with ACL 2020 | 
| DaCToR: A Data Collection Tool for the RELATER Project | Juan Hussain, Oussama Zenkri, Sebastian Stüker, Alex Waibel | In Proceedings of the 12th Language Resources and Evaluation Conference – LREC 2020 | 
| Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation | Thai-Son Nguyen, Sebastian Stüker, Jan Niehues, Alex Waibel | 45th International Conference on Acoustics, Speech, and Signal Processing – ICASSP | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Modeling Confidence in Sequence-to-Sequence Models | Jan Niehues and Ngoc Quan Pham | 12th International Conference on Natural Language Generation - INLG | 
| KIT's Submission to the IWSLT 2019 Shared Task on Text Translation | Felix Schneider and Alex Waibel | 16th International Workshop on Spoken Language Translation 2019 - IWSLT | 
| The IWSLT 2019 Evaluation Campaign | J. Niehues, R. Cattoni, S. Stüker, M. Negri, M. Turchi, T. Ha, E. Salesky, R. Sanabria, L. Barrault, L. Specia, M. Federico | 16th International Workshop on Spoken Language Translation 2019 – IWSLT | 
| The IWSLT 2019 KIT Speech Translation System | Ngoc-Quan Pham, Thai-Son Nguyen, Thanh-Le Ha, Juan Hussain, Felix Schneider, Jan Niehues, Sebastian Stüker, Alexander Waibel | 16th International Workshop on Spoken Language Translation 2019 – IWSLT | 
| Improving Zero-shot Translation with Language-Independent Constraints | Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Alex Waibel | 4th Conference in Machine Translation (WMT), ACL 2019 | 
| An Interactive Indoor Drone Assistant | Tino Fuhrman, David Schneider, Felix Altenberg, Tung Nguyen, Simon Blasen, Stefan Constantin, Alex Waibel | IEEE/RSJ International Conference on Intelligent Robots and Systems - IROS 2019 | 
| Neural Codes to Factor Language in Multilingual Speech Recognition | Markus Müller, Sebastian Stüker, Alex Waibel | International Conference on Acoustics, Speech, and Signal Processing - ICASSP | 
| Multi-task learning to improve natural language understanding | Stefan Constantin, Jan Niehues, Alex Waibel | International Workshop on Spoken Dialogue Systems Technology – IWSDS | 
| Toward Cross-Domain Speech Recognition with End-to-End Models | Thai-Son Nguyen, Sebastian Stüker, Alex Waibel | Life Long Learning for Spoken Language Systems Workshop colocated with ASRU 2019 
 | 
| Bimodal Speech Emotion Recognition Using Pre-Trained Language Models | Verena Heusser, Niklas Freymuth, Stefan Constantin, Alex Waibel | Life Long Learning for Spoken Language Systems Workshop / IEEE Automatic Speech Recognition and Understanding Workshop – ASRU 2019 | 
| Very Deep Self-Attention Networks for End-to-End Speech Recognition | Ngoc-Quan Pham, Thai-Son Nguyen, Jan Niehues, Markus Müller, Sebastian Stüker, Alex Waibel | The 20th Annual Conference of the International Speech Communication Association, INTERSPEECH 2019 | 
| Incremental processing of noisy user utterances in the spoken language understanding task | Stefan Constantin, Jan Niehues, Alex Waibel | The 5th Workshop on Noisy User-generated Text – W-NUT / 2019 Conference on Empirical Methods in Natural Language Processing – EMNLP | 
| Title of Paper | Author | Conference | 
|---|---|---|
| The IWSLT 2018 Evaluation Campaign | J. Niehues, R. Cattoni, S. Stüker, M. Cettolo, M. Turchi, M. Federico | 15th International Workshop on Spoken Language Translation (IWSLT 2018) | 
| KIT’s IWSLT 2018 SLT Translation System | Matthias Sperber, Ngoc Quan Pham, Thai Son Nguyen, Jan Niehues,Markus Müller, Thanh-Le Ha, Sebastian Stüker, Alex Waibel | 15th International Workshop on Spoken Language Translation (IWSLT 2018) | 
| Self-Attentional Acoustic Models | Matthias Sperber, Jan Niehues, Graham Neubig, Sebastian Stüker, Alex Waibel | Interspeech 2018 | 
| Neural Language Codes for Multilingual Acoustic Models | Markus Müller, Sebastian Stüker, Alex Waibel | Interspeech 2018 | 
| Low-Latency Neural Speech Translation | Jan Niehues, Ngoc-Quan Pham, Thanh-Le Ha, Matthias Sperber, Alex Waibel | Interspeech 2018 | 
| The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018 | Ngoc-Quan Pham, Jan Niehues, Alex Waibel | EMNLP 2018 Third Conference On | 
| Parameter Optimization for CTC Acoustic Models in a Less-resourced Scenario: An Empirical Study | Markus Müller, Sebastian Stüker, Alex Waibel | 13th ITG Conference on Speech Communication | 
| KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning | Florian Dessloch, Thanh-Le Ha, Markus Müller, Jan Niehues, Thai-Son Nguyen, Ngoc-Quan Pham, | COLING 2018 | 
| Building Real-time Speech Recognition without CMVN | Thai Son Nguyen, Matthias Sperber, Sebastian Stuker, Alex Waibel | 20th International Conference on Speech and Computer – SPECOM 2018 | 
| Towards one-shot learning for rare-word translation with external experts | Ngoc-Quan Pham, Jan Niehues, Alex Waibel | 2nd Workshop on Neural Machine Translation and Generation – WNMT 2018 | 
| An End-to-End Goal-Oriented Dialog System with a Generative Natural Language Response Generation | Stefan Constantin, Jan Niehues, Alex Waibel | International Workshop on Spoken Dialogue Systems Technology – IWSDS 2018 | 
| Inspection of Multilingual Neural Machine Translation | Carlos Mullov, Jan Niehues, Alexander Waibel | International Conference on Language Resources and Evaluation 2018 - LREC | 
| Automated Evaluation of Out-of-Context Errors | Patrick Huber, Jan Niehues, Alex Waibel | International Conference on Language Resources and Evaluation 2018 - LREC | 
| KIT-Multi: A Translation-Oriented Multilingual Embedding Corpus | Thanh-Le Ha, Jan Niehues, Matthias Sperber, Ngoc Quan Pham, Alex Waibel | International Conference on Language Resources and Evaluation 2018 - LREC | 
| Enhancing Multilingual Graphemic RNN-Based ASR-Systems Using Phone Information | Markus Müller, Sebastian Stüker, Alex Waibel | 29. Konferenz Elektronische Sprachsignalverarbeitung ESSV - 2018 | 
| Exploring CTC-Network Derived Features With Conventional Hybrid System | Thai-Son Nguyen, Sebastian Stüker, Alex Waibel | International Conference on Acoustics, Speech, and Signal Processing 2018 - ICASSP | 
| Multilingual Adaptation Of RNN Based ASR Systems | Markus Müller, Sebastian Stüker, Alex Waibel | International Conference on Acoustics, Speech, and Signal Processing 2018 - ICASSP | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2017 evaluation campaign | M. Cettolo, M. Federico, L. Bentivogli, J. Niehues, S. Stüker, K. Sudoh, K. Yoshino, C. Federmann | Proceedings of the 14th IWSLT, Tokyo - Japan | 
| Effective Strategies in Zero-Shot Neural Machine Translation | Thanh-Le Ha, Jan Niehues, Alex Waibel | International Workshop on Spoken Language Translation 2017 | 
| Toward Robust Neural Machine Translation for Noisy Input Sequences | Matthias Sperber, Jan Niehues, Alex Waibel | International Workshop on Spoken Language Translation 2017 
 | 
| KIT’s Multilingual Neural Machine Translation systems for IWSLT 2017 | Ngoc-Quan Pham, Matthias Sperber, Elizabeth Salesky, Thanh-Le Ha, Jan Niehues, Alex Waibel | International Workshop on Spoken Language Translation 2017 | 
| The 2017 KIT IWSLT Speech-to-Text Systems for English and German | Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Sebastian Stüker, Alex Waibel | International Workshop on Spoken Language Translation 2017 | 
| Domain-independent Punctuation and Segmentation Insertion | Eunah Cho, Jan Niehues, Alex Waibel | International Workshop on Spoken Language Translation 2017 | 
| Overview of the IWSLT 2017 Evaluation Campaign | M. Cettolo, M. Federico, L. Bentivogli, J. Niehues, S. Stüker, K. Sudoh, K. Yoshino, C. Federmann | International Workshop on Spoken Language Translation 2017 | 
| DBLSTM Based Multilingual Articulatory Feature Extraction For Language Documentation | Markus Müller, Sebastian Stüker, Alex Waibel | IEEE Automatic Speech Recognition and Understanding Workshop 2017 (ASRU 2017) | 
| Improved Speaker Adaptation by Combining I-Vector and fMLLR with Deep Bottleneck Networks | Thai Son Nguyen, Kevin Kilgour, Matthias Sperber, Alex Waibel | SPECOM 2017, Hatfield, England, Sep. 12-16, 2017 | 
| Language Adaptive Multilingual CTC Speech Recognition | Markus Müller, Sebastian Stüker, Alex Waibel | SPECOM 2017, Hatfield, England, Sep. 12-16, 2017 | 
| Neural Lattice-to-Sequence Models for Uncertain Inputs | Matthias Sperber, Graham Neubig, Jan Niehues, Alex Waibel | EMNLP 2017, Copenhagen, Denmark, Sep. 07-11, 2017 | 
| The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017 | Ngoc-Quan Pham, Jan Niehues, Thanh-Le Ha, Eunah Cho, Matthias Sperber, Alex Waibel | WMT 2017, Copenhagen, Denmark, Sep. 7-8, 2017 | 
| The QT21 Combined Machine Translation System for English to Latvian | Jan-Thorsten Peter, Hermann Ney, Ondrej Bojar Ngoc-Quan Pham, Jan Niehues, Alex Waibel, Franck Burlot, Francois Yvon, Marcis Pinnis, Valters Sics, Joost Bastings, Miguel Rios, Wilker Aziz, Philip Williams, Frederic Blain, Lucia Specia | WMT 2017, Copenhagen, Denmark, Sep. 7-8, 2017 | 
| Exploiting Linguistic Resources for Neural Machine Translation Using Multi-task Learning | Jan Niehues, Eunah Cho | WMT 2017, Copenhagen, Denmark, Sep. 7-8, 2017 | 
| Comparison of Decoding Strategies for CTC Acoustic Models | Thomas Zenkel, Ramon Sanabria, Florian Metze, Jan Niehues, Matthias Sperber, Sebastian Stüker and Alex Waibel | Interspeech, Stockholm, Sweden, Aug. 20-24, 2017 | 
| Enhancing Backchannel Prediction Using Word Embeddings | Robin Ruede, Markus Müller, Sebastian Stüker, Alex Waibel | Interspeech, Stockholm, Sweden, Aug. 20-24, 2017 | 
| NMT-based Segmentation and Punctuation Insertion for Real-time Spoken Language Translation | Eunah Cho, Jan Niehues, Alex Waibel | Interspeech 2017 – Situated interaction, Stockholm, Sweden. 20th - 24th August 2017. | 
| Analyzing Neural MT Search and Model Performance | Jan Niehues, Eunah Cho, Thanh-Le Ha, Alex Waibel | ACL, Vancouver, Canada, July 30 - Aug. 04, 2017 | 
| Yeah, Right, Uh-Huh: A Deep Learning Backchannel Predictor | Robin Ruede, Markus Müller, Sebastian Stüker, Alex Waibel | International Workshop on Spoken Dialogue Systems Technology 2017, Farmington, Pennsylvania, USA. 6th - 9th June, 2017 | 
| Improving phoneme set discovery for documenting unwritten languages | Markus Müller, Jörg Franke, Sebastian Stüker, Alex Waibel | Proceedings of the 28th International Conference on Electronic Speech and Signal Processing (ESVV), Saarbrücken, Germany, March 15-17, 2017 | 
| Towards phoneme inventory discovery for documentation of unwritten languages | Markus Müller, Jörg Franke, Alex Waibel, Sebastian Stüker | Proceedings of the 42nd International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, USA, March 5-9, 2017 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2016 evaluation campaign | M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico | Proceedings of the 13th IWSLT, Seattle - USA | 
| Pre-Translation for Neural Machine Translation | Niehues, Jan; Cho, Eunah; Ha, Thanh-Le; Waibel, Alex | arXiv:1610.05243 | 
| An empirical exploration of CTC acoustic models | Yajie Miao, Mohammad Gowayyed, Xingyu Na, Tom Ko, Florian Metze, Alexander Waibel | 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) | 
| Lightly Supervised Quality Estimation | Matthias Sperber, Graham Neubig, Jan Niehues, Sebastian Stüker, Alex Waibel | COLING 2016, Osaka, Japan, Dec. 11-16, 2016 | 
| The 2016 KIT IWSLT Speech-to-Text Systems for English and German | Thai-Son Nguyen, Markus Müller, Matthias Sperber, Thomas Zenkel, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  
 | 
| Adaptation and Combination of NMT Systems: The KIT Translation Systems for IWSLT 2016 | Eunah Cho, Jan Niehues, Thanh-Le Ha, Matthias Sperber, Mohammed Mediani, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  
 | 
| Multilingual Disfluency Removal using NMT | Eunah Cho, Jan Niehues, Thanh-Le Ha, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  
 | 
| Towards Improving Low-Resource Speech Recognition Using Articulatory and Language Features | Markus Müller, Sebastian Stüker, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  
 | 
| Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder | Thanh-Le Ha, Jan Niehues, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  | 
| Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks | Micha Wetzel, Matthias Sperber, Alex Waibel | Proceedings of the 13th International Workshop on
Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016  
 | 
| Integrating Encyclopedic Knowledge into Neural Language Models | Yang Zhang, Jan Niehues, Alex Waibel | Proceedings of the 13th International Workshop on Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016 | 
| The IWSLT 2016 Evaluation Campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico | Proceedings of the 13th International Workshop on Spoken Language Translation (IWSLT), Seattle, USA, December 8-9, 2016 | 
| Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System | Aleksandar Andonov, Maria Schmidt, Jan Niehues, Alex Waibel | Proceedings of the 12th ITG Conference on Speech Communication, Paderborn, Germany, October 5-7, 2016 | 
| Phoneme Boundary Detection using Deep Bidirectional LSTMs | Jörg Franke, Markus Müller, Fatima Hamaloui, Sebastian Stüker, Alex Waibel | Proceedings of the 12th ITG Conference on Speech Communication, Paderborn, Germany, October 5-7, 2016 | 
| Language Feature Vectors for Resource Constraint Speech Recognition | Markus Müller, Sebastian Stüker, Alex Waibel | Proceedings of the 12th ITG Conference on Speech Communication, Paderborn, Germany, October 5-7, 2016 | 
| Training Deep Neural Networks for Reverberation Robust Speech Recognition | Marvin Ritter, Markus Müller, Sebastian Stüker, Florian Metze, Alex Waibel | Proceedings of the 12th ITG Conference on Speech Communication, Paderborn, Germany, October 5-7, 2016 | 
| Dynamic Transcription for Low-latency Speech Translation | Jan Niehues, Thai Son Nguyen, Eunah Cho, Thanh-Le Ha, Kevin Kilgour, Markus Müller, Matthias Sperber, Sebastian Stüker, Alex Waibel | Proceedings of the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), San Francisco, USA, September 8 - 12, 2016 | 
| Language Adaptive DNNs for Improved Low Resource Speech Recognition | Markus Müller, Sebastian Stüker, Alex Waibel | Proceedings of the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), San Francisco, USA, September 8-12, 2016 | 
| Unsupervised Phoneme Segmentation of Previously Unseen Languages | Marco Vetter, Markus Müller, Fatima Hamlaoui, Graham Neubig, Satoshi Nakamura, Sebastian Stüker, Alex Waibel | Proceedings of the 17th Annual Conference of the International Speech Communication Association (INTERSPEECH), San Francisco, USA, September 8-12, 2016 | 
| Using Factored Word Representation in Neural Network Language Models | Jan Niehues, Thanh-Le Ha, Eunah Cho, Alex Waibel | Proceedings of the ACL 2016 1st Conference on Machine Translation (WMT), Berlin, Germany, August 11 - 12, 2016 | 
| The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016 | Thanh-Le Ha, Eunah Cho, Jan Niehues, Mohammed Mediani, Matthias Sperber, Alexandre Allauzen, Alex Waibel | Proceedings of the ACL 2016 1st Conference on Machine Translation (WMT), Berlin, Germany, August 11 - 12, 2016 | 
| The QT21/HimL Combined Machine Translation System | Jan-Thorsten Peter, Tamer Alkhouli, Hermann Ney, Matthias Huck, Fabienne Braune, Alexander Fraser, Ales Tamchyna, Ondrej Bojar, Barry Haddow, Rico Sennrich, Frederic Blain, Lucia Specia, Jan Niehues, Alex Waibel, Alexandre Allauzen, Lauriane Aufrant, Franck Burlot, Elena Knyazeva, Thomas Lavergne, Francois Yvon, Stella Frank, Marcis Pinnis | Proceedings of the ACL 2016 1st Conference on Machine Translation (WMT), Berlin, Germany, August 11 - 12, 2016 | 
| Lecture Translator - Speech Translation Framework for Simultaneous Lecture Translation | Markus Müller, Thai Son Nguyen, Jan Niehues, Eunah Cho, Bastian Krüger, Thanh-Le Ha, Kevin Kilgour, Matthias Sperber, Mohammed Mediani, Sebastian Stüker, Alex Waibel | Proceedings of the 15th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL), San Diego, USA, June 12-17, 2016 | 
| Evaluation of the KIT Lecture Translation System | Markus Müller, Sarah Fünfer, Sebastian Stüker, Alex Waibel | Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, May 23-28, 2016 | 
| Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces | Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alex Waibel | Proceedings of the 10th Language Resources and Evaluation Conference (LREC), Portoroz, Slovenia, May 23-28, 2016 | 
| Towards an Open-Domain Social Dialog System | Maria Schmidt, Jan Niehues, Alex Waibel | Proceedings of the 7th International Workshop Series on Spoken Dialogue System Technology (IWSDS), Saariselkä, Finnland, January 13-16, 2016 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2015 evaluation campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Roldano Cattoni, Marcello Federico | Procedings of the 12th IWSLT, Da Nang - Vietnam | 
| Preparing Children's Writing Database for Automated Processing | Rémi Lavalley, Kay Berkling, Sebastian Stüker | Proceedings of the L1 Teaching, Learning and Technology (L1TLT, Satellite of SLaTE), Leipzig, Germany, September 4-5, 2015 | 
| The IWSLT 2015 Evaluation Campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Roldano Cattoni, Marcello Federico | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| Source Discriminative Word Lexicon for Translation Disambiguation | Teresa Herrmann, Jan Niehues, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| The KIT Translation Systems for IWSLT 2015 | Thanh-Le Ha, Jan Niehues, Eunah Cho, Mohammed Mediani, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| Multifeature Modular Deep Neural Network Acoustic Models | Kevin Kilgour, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| Punctuation Insertion for Real-time Spoken Language Translation | Eunah Cho, Jan Niehues, Kevin Kilgour, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| The 2015 KIT IWSLT Speech-to-Text Systems for English and German | Markus Müller, Thai-Son Nguyen, Matthias Sperber, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| Using Language Adaptive Deep Neural Networks for Improved Multilingual Speech Recognition | Markus Müller, Alex Waibel | Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), Da Nang, Vietnam, December 3-4, 2015 | 
| The Karlsruhe Institute of Technology Translation Systems for the WMT 2015 | Eunah Cho, Thanh-Le Ha, Jan Niehues, Teresa Herrmann, Mohammed Mediani, Yuqi Zhang, Alex Waibel | Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), Lisboa, Portugal, September 17-18, 2015 | 
| ListNet-based MT Rescoring | Jan Niehues, Quoc Khanh Do, Alexandre Allauzen, Alex Waibel | Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), Lisboa, Portugal, September 17-18, 2015 | 
| The KIT-LIMSI Translation System for WMT 2015 | Thanh-Le Ha, Quoc Khanh Do, Eunah Cho, Jan Niehues, Alexandre Allauzen, Francois Yvon, Alex Waibel | Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), Lisboa, Portugal, September 17-18, 2015 | 
| Combination of NN and CRF Models for Joint Detection of Punctuation and Disfluencies | Eunah Cho, Kevin Kilgour, Jan Niehues, Alex Waibel | Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), Dresden, Germany, September 6-10, 2015 | 
| Gaussian Free Cluster Tree Construction using Deep Neural Network | Linchen Zhu, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), Dresden, Germany, September 6-10, 2015 | 
| Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems | KIT: Maria Schmidt, Markus Müller, Martin Wagner, Sebastian Stüker and Alex Waibel. DAIMLER AG: Hansjörg Hofmann, Steffen Werner | 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), Prague, Czech Republic, September 2-4, 2015 | 
| Using Neural Networks for Data-Driven Backchannel Prediction: A Survey on Input Features and Training Techniques | Markus Müller, David Leuschner, Lars Briem, Maria Schmidt, Kevin Kilgour, Sebastian Stüker, Alex Waibel | 2nd International Conference on Learning and Collaboration Technologies in association with HCI International 2015, Los Angeles, California, USA, August 2-7, 2015 | 
| A Semi-Automatic Word-Level Annotation and Transcription Tool for Spelling Error Categories | Ludwig Linhuber, Sebastian Stüker, Rémy Lavalley, Kay Berkling | 2nd International Conference on Learning and Collaboration
Technologies in association with HCI International 2015, Los Angeles,
California, USA, August 2-7, 2015 | 
| Stripping Adjectives: Integration Techniques for Selective Stemming in SMT Systems | Isabel Slawik, Jan Niehues, Alex Waibel | Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT 2015), Antalya, Turkey, May 1-13, 2015 | 
| Effectiveness of Histogram Equalization and SyDOCC Features on Speech Recognition Performance on a Real-World Noisy Speech Task | Markus Müller, Martin Wagner, Juan Hussain, Sebastian Stüker, Alex Waibel | 41th DAGA-Conference, annual general meeting of the DEGA (German Acoustical Society) (DAGA 2015), Nuremberg, Germany, March 16-19, 2015 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2014 evaluation campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico | Proceedings of the 11th IWSLT, Lake Tahoe - USA | 
| Extracting Translation Pairs from Social Network Content | 
 Matthias Eck, Yury Zemlyanskiy, Joy Zhang, Alexander Waibel | Proceedings of the Spoken Language Technology Workshop (SLT 2014), Lake Tahoe, USA, December 7-10, 2014 | 
| Sesla transcriber: A speech transcription tool that adapts to your skill and time budget | Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alexander Waibel | Proceedings of the Spoken Language Technology Workshop (SLT 2014), Lake Tahoe, USA, December 7-10, 2014 | 
| On-the-fly user modeling for cost-sensitive correction of speech transcripts | Matthias Sperber, Graham Neubig, Satoshi Nakamura, Alexander Waibel | Proceedings of the Spoken Language Technology Workshop (SLT 2014), Lake Tahoe, USA, December 7-10, 2014 | 
| Combined Spoken Language Translation | Markus Freitag, Joern Wuebker, Stephan Peitz, Hermann Ney, Matthias Huck, Alexandra Birch, Nadir Durrani, Philipp Koehn, Mohammed Mediani, Isabel Slawik, Jan Niehues, Eunah Cho, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, Marcello Frederico | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Report on the 11th IWSLT Evaluation Campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| The 2014 KIT IWSLT Speech-to-Text Systems for English, German and Italian | Kevin Kilgour, Michael Heck, Markus Müller, Matthias Sperber, Sebastian Stüker, Alexander Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| The KIT Translation Systems for IWSLT 2014 | Isabel Slawik, Mohammed Mediani, Jan Niehues, Yuqi Zhang, Eunah Cho, Teresa Herrmann, Thanh-Le Ha, Alex Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Improving In-Domain Data Selection for Small In-Domain Sets | Mohammed Mediani, Joshua Winebarger, Alexander Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Rule-Based Preordering on Multiple Syntactic Levels in Statistical Machine Translation | Ge Wu, Yuqi Zhang, Alexander Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Lexical Translation Model using A Deep Neural Network Architecture | Thanh-Le Ha, Jan Niehues, Alexander Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Machine Translation of Multi-party Meetings: Segmentation and Disfluency Removal Strategies | Eunah Cho, Jan Niehues, Alex Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| Multilingual Deep Bottle Neck Features: A Study on Language Selection and Training Techniques | Markus Müller, Sebastian Stüker, Zaid Sheikh, Florian Metze, Alex Waibel | Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), Lake Tahoe, USA, December 4-5, 2014 | 
| A Neural Network Keyword Search System for Telephone Speech | Kevin Kilgour, Alex Waibel | 16th International Conference on Speech and Computer, SPECOM 2014, Novi Sad, Serbia, October 5-9, 2014 | 
| Adapting Automatic Speech Recognition for Foreign Language Learners in a Serious Game | Joshua Winebarger, Sebastian Stüker, Alexander Waibel | Proceedings of the 3rd Workshop on Games and NLP (GAMNLP-14), North Carolina State University, Raleigh, NC, USA, October 3rd 2014 | 
| The Speech Recognition Virtual Kitchen: Launch Party | Andrew Plummer, Eric Riebling, Anuj Kumar, Florian Metze, Eric Fosler-Lussier, Rebecca Bates | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| Query-By-Example Spoken Term Detection on Multilingual Unconstrained Speech | Xavier Anguera, Luis Javier Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Penagarikano | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| Neural Network Language Models for Low Resource Languages | Ankur Gandhe, Florian Metze, Ian Lane | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| Towards Speaker Adaptive Training of Deep Neural Network Acoustic Models | Yajie Miao, Hao Zhang, Florian Metze | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014. | 
| Improving Language-Universal Feature Extraction with Deep Maxout and Convolutional Neural Networks | Yajie Miao, Florian Metze | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| Distributed Learning of Multilingual DNN Feature Extractors Using GPUs | Yajie Miao, Hao Zhang, Florian Metze | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| Word-Based Probabilistic Phonetic Retrieval for Low-Resource Spoken Term Detection | Di Xu, Florian Metze | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| An In-Depth Comparison of Keyword Specific Thresholding and Sum-to-One Score Normalization | Yun Wang, Florian Metze | Proceedings of the 15th Annual Conference of the International Speech Communication Association (INTERSPEECH 2014), Singapore, September 14-18, 2014 | 
| A Multisensory Non-Invasive System for Laughter Analysis | Sarah Cosentino, Susanne Burger, Lara Martin, Florian Metze, Tatsuhiro Kishi, Kenji Hashimoto, Salvatore Sessa, Massimiliano Zecca, Atsuo Takanishi | 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBS), Chicago, USA, August 26-30, 2014 | 
| Improved Audio Features for Large-Scale Multimedia Event Detection | Florian Metze, Shourabh Rawat, Yipei Wang | IEEE International Conference on Multimedia & Expo (ICME), Chengdu, China, July 14-18, 2014 | 
| Query-By-Example Spoken Term Detection Evaluation on Low-Resource Languages | Xavier Anguera Miro, Luis Javier Rodriguez-Fuentes, Igor Szöke, Andi Buzo, Florian Metze, Mikel Penagarikano | 4th International Workshop on Spoken Language Technologies for Under-resourced Language (SLTU), St. Petersburg, Russia, May 14-16, 2014 | 
| Exploring Audio Semantic Concept for Event-Based Video Retrieval | Yipei Wang, Shourabh Rawat, Florian Metze | IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Florence, Italy, May 4 - 9, 2014 | 
| Semi-Automatic Audio Semantic Concept Discovery for Multimedia Retrieval | Yipei Wang, Shourabh Rawat, Florian Metze | IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Florence, Italy, May 4 - 9, 2014 | 
| Optimization of Neural Network Language Models for Keyword Search | Ankur Gandhe, Florian Metze, Alex Waibel, Ian Lane | IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Florence, Italy, May 4 - 9, 2014 | 
| Augmenting Translation Models with Simulated Acoustic Confusions for improved Spoken Language Translation | Yulia Tsetkov, Florian, Metze, Chris Dyer | Conference of the European Chapter of the Association for Computational Linguistics (EACL), Gothenborg, Sweden, April 26 - 30, 2014 | 
| Applause: A learning Tool for Low-Ressource Languages | Nikolas Wolfe, Vinay Vyas Vemuri, Lara J. Martin, Florian Metze, Alan W. Black | ACM Conference on Human Factors in Computing Systems (CHI), Toronto, Canada, April 26 - May 1, 2014 | 
| Demystifying Development of Speech Recognizers for Novices | Anuj Kumar, Florian Metze, Eric Riebling, Matthew Kam | ACM Conference on Human Factors in Computing Systems (CHI), Toronto, Canada, April 26 - May 1, 2014 | 
| The Karlsruhe Institute of Technology Translation Systems for the WMT 2014 | Teresa Herrmann, Mohammed Mediani, Eunah Cho, Thanh-Le Ha, Jan Niehues, Isabel Slawik, Yuqi Zhang, Alex Waibel | Association for Computational Linguistics (ACL), Baltimore, USA, June 22- 27, 2014 | 
| EU-Bridge MT: Combined Machine Translation | Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Matthias Huck, Rico Sennrich, Nadir Durrani, Maria Nadejde, Philip Williams, Philipp Koehn, Teresa Herrmann, Eunah Cho, Alex Waibel | Association for Computational Linguistics (ACL), Baltimore, USA, June 22 - 27, 2014 | 
| The KIT-LIMSI Translation System for WMT 2014 | Quoc Khanh Do, Teresa Herrmann, Jan Niehues, Alexandre Allauzen, Francois Yvon, Alex Waibel | Association for Computational Linguistics (ACL), Baltimore, USA, June 22 - 27, 2014 | 
| Combining Techniques from different NN-based Language Models for Machine Translation | Jan Niehues, Alexander Allauzen, Francois Yvon, Alex Waibel | The eleventh biennial Conference of the Association for Machine Translation in the Americas (AMTA), Vancouver, Canada, October 22 - 26, 2014 | 
| Manual Analysis of Structurally Informed Reordering in German-English Machine Translation | Teresa Herrmann, Jan Niehues, Alex Waibel | Language Resources and Evaluation Conference (LREC 2014), Reykjavik, Iceland, May 26-31, 2014 | 
| A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation | Eunah Cho, Sarah Fünfer, Sebastian Stüker, Alex Waibel | Language Resources and Evaluation Conference (LREC 2014), Reykjavik, Iceland, May 26-31, 2014 | 
| A Database of Freely Written Texts of German School Students for the Purpose of Automatic Spelling Error Classification | K. Berkling, J. Fay, M. Ghayoomi, K. Hein, R. Lavalley, L. Linhuber, S. Stüker | Language Resources and Evaluation Conference (LREC 2014), Reykjavik, Iceland, May 26-31, 2014 | 
| Training Time Reduction and Performance Improvements from Multilingual Techniques on the Babel ASR Task | Sebastian Stüker, Markus Müller, Quoc Bao Nguyen, and Alex Waibel | IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2014), Florence, Italy, May 4-9, 2014 | 
| Multilingual Shifting Deep Bottleneck Features For Low-Resource ASR | Quoc Bao Nguyen, Jonas Gehring, Markus Müller, Sebastian Stüker, Alex Waibel | IEEE International Conference on Acoustics, Speech and Signal Processing, (ICASSP 2014), Florence, Italy, May 4-9, 2014 | 
| Tight Integration of Speech Disfluency Removal into SMT | Eunah Cho, Jan Niehues, Alex Waibel | The 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), Gothenburg, Sweden, April 26-30, 2014 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2013 evaluation campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico | Proceedings of the 10th IWSLT, Heidelberg - Germany | 
| Using Web Text to improve Keyword Spotting in Speech | Ankur Gandhe, Long Qin, Florian Metze, Alexander Rudnicky, Ian Lane, Matthias Eck | IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, December 8 - 12, 2013 | 
| Neighbour Selection and Adaption for Rapid Speaker-Dependent ASR | Udhyakumar Nallasamy, Mark Fuhs, Monika Woszczyna, Florian Metze, Tanja Schultz | IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, December 8 - 12, 2013 | 
| Deep Maxout Networks for Low-Resource Speech Recognition | Yajie Miao, Florian Metze, Shourabh Rawat | IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, December 8 - 12, 2013 | 
| Models of tone for tonal and non-tonal languages | Florian Metze, Zaid. A. W. Sheihk, Alex Waibel, Jonas Gehring, Kevin Kilgour, Quoc Bao Nguyen, Van Huy Nguyen | IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, December 8-12, 2013 | 
| Prosody-Based unsupervised Speech Summarization with two-Layer mutually reinforced Random Walk | Sujay Kumar Jauhar, Yun-Nung Chen, Florian Metze | The 6th International Joint Conference on Natural Language Processing (IJCNLP 2013), Nagoya, Japan, October 14 - 18, 2013 | 
| The Speech Recognition Virtual Kitchen | Florian Metze, Eric Fosler-Lussier, Rebecca Bates | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France, August 25 - 29, 2013 | 
| Multi-Layer Mutual Reinforced Random Walk with Hidden Parameters for Improved Multi-Party Meeting Summarization | Yun-Nung Chen, Florian Metze | Proceedings of the 14the Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France, August 25 - 29, 2013 | 
| Improving Low-Ressource CD-DNN-HMM using Dropout and Multilingual DNN Training | Yajie Miao, Florian Metze | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France, August 25 - 29, 2013 | 
| Formalizing Expert Knowledge for Developing Accurate Speech Recognizers | Anuj Kumar, Florian Metze, Wenyi Wang, Matthew Kam | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France, August 25 - 29, 2013 | 
| Robust Audio Codebooks for Large Scale Event Detection in Consumer Videos | Shourabh Rawat, Peter Schulmam, Susanne Burger, Duo Ding, Yipei Wang, Florian Metze | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France, August 25 - 29, 2013 | 
| DNN Acoustic Modeling with Modular Multi-Lingual Feature Extraction Networks | Jonas Gehring, Quoc Bao Nguyen, Florian Metze, Alex Waibel | Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, December 08-12, 2013 | 
| The 2013 KIT IWSLT Speech-to-Text Systems for German and English | Kevin Kilgour, Christian Mohr, Michael Heck, Quoc Bao Nguyen, Van Huy Nguyen, Evgeniy Shin, Igor Tseyzer, Jonas Gehring, Markus Müller, Matthias Sperber, Sebastian Stüker, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013) Heidelberg, December 5-6, 2013 | 
| EU-BRIDGE MT: Text Translation of Talks in the EU-BRIDGE Project | Markus Freitag, Stephan Peitz, Joern Wuebker, Hermann Ney, Nadir Durrani, Matthias Huck, Philipp Koehn, Thanh-Le Ha, Jan Niehues, Mohammed Mediani, Teresa Herrmann, Alex Waibel, Nicola Bertoldi, Mauro Cettolo, Marcello Federico | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| Maximum Entropy Language Modeling for Russian ASR | Evgeniy Shin, Sebastian Stüker, Kevin Kilgour, Christian Fügen, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| The 2013 KIT Quaero Speech-to-Text System for French | Joshua Winebarger, Bao Nguyen, Jonas Gehring, Sebastian Stüker and Alexander Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| Report on the 10th IWSLT Evaluation Campaign | Mauro Cettolo, Jan Niehues, Sebastian Stüker, Luisa Bentivogli, Marcello Federico | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| Incremental Unsupervised Training for University Lecture Recognition | Michael Heck, Sebastian Stüker, Sakriani Sakti, Alex Waibel, Satoshi Nakamura | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| The KIT Translation Systems for IWSLT 2013 | Thanh-Le Ha, Teresa Herrmann, Jan Niehues, Mohammed Mediani, Eunah Cho, Yuqi Zhang, Isabel Slawik and Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| Analyzing the Potential of Source Sentence Reordering in Statistical Machine Translation | Teresa Herrmann, Jochen Weiner, Jan Niehues, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| CRF-based Disfluency Detection using Semantic Features for German to English Spoken Language Translation | Eunah Cho, Thanh-Le Ha, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2013). Heidelberg, December 5-6, 2013 | 
| Optimizing Deep Bottleneck Feature Extraction | Quoc Bao Nguyen, Jonas Gehring, Kevin Kilgour, Alex Waibel | The 10th IEEE RIVF International Conference on Computing and Communication Technologies, Hanoi, November 10-13, 2013 | 
| Segmentation of Telephone Speech Based on Speech and Non-Speech Models | Michael Heck, Christian Mohr, Sebastian Stüker, Markus Müller, Kevin Kilgour, Jonas Gehring, Quoc Bao Nguyen, Van Huy Nguyen, and Alex Waibel | SPECOM 2013 (International Conference on Speech and Computer), September 1-5, 2013, Pilsen | 
| A Real-World System for Simultaneous Translation of German Lectures | Eunah Cho, Christian Fügen, Teresa Hermann, Kevin Kilgour, Mohammed Mediani, Christian Mohr, Jan Niehues, Kay Rottmann, Christian Saam, Sebastian Stüker, Alex Waibel | Interspeech 2013, August 25-29, 2013, Lyon | 
| Efficient Speech Transcription Through Respeaking | Matthias Sperber, Graham Neubig, Christian Fügen, Satoshi Nakamura, Alex Waibel | Interspeech 2013, August 25-29, 2013, Lyon | 
| Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts | Christian Mohr, Christian Saam, Kevin Kilgour, Jonas Gehring, Sebastian Stüker, Alex Waibel | SLAM 2013 (First Workshop on Speech, Language and Audio in Multimedia), August 22-23, 2013, Marseille | 
| Joint WMT 2013 Submission of the QUAERO Project | Stephan Peitz, Saab Mansour, Matthias Huck, Markus Freitag, Hermann Ney, Eunah Cho, Teresa Herrmann, Mohammed Mediani, Jan Niehues, Alex Waibel, Alexandre Allauzen, Quoc Khanh Do, Bianka Buschbeck, Tonio Wandmacher | WMT 2013 (Eighth Workshop on Statistical Machine Translation), August 8-9, 2013, Sofia | 
| The Karlsruhe Institute of Technology Translation Systems for the WMT 2013 | Eunah Cho, Thanh-Le Ha, Mohammed Mediani, Jan Niehues, Teresa Herrmann, Isabel Slawik, Alex Waibel | ACL 2013 (The Association for Computational Linguistics), Sofia, August 4-9, 2013 | 
| Letter N-Gram-based Input Encoding for Continuous Space Language Models | Henning Sperr, Jan Niehues, Alexander Waibel | ACL 2013 (The Association for Computational Linguistics), Sofia, August 4-9, 2013 | 
| An MT Error-driven Discriminative Word Lexicon using Sentence Structure Features | Jan Niehues and Alex Waibel | ACL 2013 (The Association for Computational Linguistics), Sofia, August 4-9, 2013 | 
| Sequence-Discriminative Training of Deep Neural Networks | K. Vesely, A. Ghosal, L. Burget, D. Povey | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France | 
| Improved Feature Processing for Deep Neural Networks | S. Rath, D. Povey, D. Vesely, J. Cernocky | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France | 
| Finding Recurrent Out-of-Vocabulary Words | L. Qin, A. Rudnicky | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France | 
| Semi-Supervised Manifold Learning Approaches for Spoken Term Verification | A. Norouzian, R. Rose, A. Jansen | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France | 
| Modular Combination of Deep Neural Networks for Acoustic Modeling | J. Gehring, W. Lee, K. Kilgour, I. Lane, Y. Miao, A. Waibel | Proceedings of the 14th Annual Conference of the International Speech Communication Association (INTERSPEECH 2013), Lyon, France | 
| Combining Word Reordering Methods on different Linguistic Abstraction Levels for Statistical Machine Translation | Teresa Herrmann, Jan Niehues, Alex Waibel | The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2013), Atlanta, June 09-15, 2013 | 
| Measuring the Structural Importance through Rhetorical Structure Index | Narine Kokhlikyan, Alex Waibel, Yuqi Zhang, Joy Ying Zhang | The 2013 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT 2013), Atlanta, June 09-15, 2013 | 
| Identification and Modeling of Word Fragments | Yulia Tsvetkov, Zaid Sheikh, Florian Metze | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Comparing RNNS and Log-Linear Interpolation of Improved Skip-Model on Four babel Languages: Cantonese, Pashto, Tagalog, Turkish | Mittul Singh, Dietrich Klakow | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Subspace Mixture Model For Low-Resource Speech Recognition In Cross-Lingual Settings | Yajie Miao, Florian Metze, Alex Waibel | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Learning Disciminative Basis Coefficients for Eigenspace MLLR Unsupervised Adaptation | Yajie Miao, Florian Metze, Alex Waibel | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Quantifying The Value Of Pronunciation Lexicons For Keyword Search In Low Resorce Languages | Guogo Chen, Sanjeev Khudanpur, Daniel Povey, Jan Trmal, David Yarowski, Oguz Yilmaz | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Warped Minimum Variance Distortionless Response Based Bottle Neck Features for LVCSR | Kevin Kilgour, Igor Tseyzer, Quoc Bao Nguyen, Alex Waibel | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Extracting Deep Bottleneck Features Using Stacked Auto-Encoders | Jonas Gehring, Yajie Miao, Florian Metze, Alex Waibel | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Learning Discriminative Basis Coefficients for Eigenspace MLLR unsupervised Adaptation | Yajie Miao, Florian Metze, Alex Waibel | Proceedings of the 2013 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), May 26-31, 2013 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2012 evaluation campaign | M. Federico M. Cettolo, L. Bentivogli, M. Paul, S. Stüker | Proceedings of the 9th IWSLT, Hong Kong | 
| Continuous Space Language Models using Restricted Boltzmann Machines | Jan Niehues, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 | 
| The KIT Translation systems for IWSLT 2012 | Mohammed Mediani, Yuqi Zhang, Thanh-Le Ha, Jan Niehues, Eunah Cho, Teresa Herrmann, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 | 
| Segmentation and Punctuation Prediction in Speech Language Translation Using a Monolingual Translation System | Eunah Cho, Jan Niehues, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 
 | 
| The 2012 KIT and KIT-NAIST English ASR Systems for the IWSLT Evaluation | Christian Saam, Christian Mohr, Kevin Kilgour, Michael Heck, Matthias Sperber, Keigo Kubo, Sebastian Stüker, Sakriani Sakti, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 | 
| The KIT-NAIST (Contrastive) English ASR System for IWSLT 2012 | Michael Heck, Keigo Kubo, Matthias Sperber, Sakriani Sakti, Sebastian Stüker, Christian Saam, Kevin Kilgour, Christian Mohr, Graham Neubig, Tomoki Toda, Satoshi Nakamura, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 | 
| Evaluation of Interactive User Corrections for Lecture Transcription | Henrich Kolkhorst, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012). Hong Kong, December 6-7, 2012 | 
| Detailed Analysis of different Strategies for Phrase Table Adaptation in SMT | Jan Niehues, Alex Waibel | Proceedings of the American Machine Translation Association (AMTA), San Diego, California, October 28 - November 1, 2012 | 
| Automatische Zeichensetzung in Spracherkennungssystemen: Entscheidungsbaum und Sprachmodell im Vergleich | Heike Adel, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the ESSV 2012, Cottbus Germany, August 29-31,2012 | 
| The Karlsruhe Institute of Technology Translation Systems for the WMT 2012 | Jan Niehues, Yuqi Zhang, Mohammed Mediani, Teresa Herrmann, Eunah Cho, Alex Waibel | Proceedings of the 7th Workshop on Statistical Machine Translation, Canada, June 7-8, 2012 | 
| Joint WMT 2012 Submission of the QUAERO Project | Markus Freitag, Stephan Peitz, Matthias Huck, Hermann Ney, Jan Niehues, Teresa Herrmann, Alex Waibel, Le Hai-son, Thomas Lavergne, Alexandre Allauzen, Bianka Buschbeck, Josep Maria Crego, Jean Senellart | Proceedings of the 7th Workshop on Statistical Machine Translation, Canada, June 7-8, 2012 | 
| The IWSLT 2011 Evaluation Campaign on Automatic Talk Translation | Marcello Federico, Sebastian Stüker, Luisa Bentivogli, Michael Paul, Mauro Cettolo, Teresa Herrmann, Jan Niehues and Giovanni Moretti | Proceedings of the LREC 2012, May 23-24, 2012 | 
| The KIT Lecture Corpus for Speech Translation | Sebastian Stüker, Florian Kraft, Christian Mohr, Teresa Herrmann, Eunah Cho und Alex Waibel | Proceedings of LREC 2012, May 23-25, 2012 | 
| A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation | Michael Heck, Sebastian Stüker, Alex Waibel | Proceedings of the 2012 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), March 25-30, 2012 | 
| Blind Dereverberation of Sinusoid Signals using PLL-based combined Phase and Amplitude Analysis | Ralf Huber, Florian Kraft, Alex Waibel | ICASSP 2012, Kyoto, Japan, 25.-30.3.2012 | 
| Unsupervised Vocabulary Selection for Real-Time Speech Recognition of Lectures | Paul Märgner, Alex Waibel, Ian Lane | Proceedings of the 2012 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), March 25-30, 2012 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2011 evaluation campaign | Marcello Federico, Luisa Bentivogli, Michael Paul, Sebastian Stüker | Proceedings of the 8th IWSLT - San Francisco, USA | 
| TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training | Nguyen Bach, Qin Gao, Stephan Vogel, Alex Waibel | Proceedings of the 5th International Joint Conference on Natural Language Processing, IJCNLP 2011, Chiang Mai, Thailand, November 8-12, 2011 | 
| Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation | Paul Märgner, Kevin Kilgour, Ian Lane, Alex Waibel | IWSLT 2011, San Francisco, USA | 
| Multi Domain Language Model Adaptation using Explicit Semantic Analysis | Kevin Kilgour, Florian Kraft, Sebastian Stüker, Alex Waibel | SPECOM´11 14th International Conference on Speech and Computer Kazan, Russia September 2011 | 
| Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation | Paul Märgner, Ian Lane, Alex Waibel | Proceedings of the 13th Machine Translation Summit, Xiamen, China, September 19-23, 2011 | 
| Using Wikipedia to Translate Domain-specific Terms in SMT | Jan Niehues, Alex Waibel | Proceedings of the Eighth International Workshop on Spoken Language Translation (IWSLT), San Francisco, CA, 2011 | 
| The KIT English-French Translation systems for IWSLT 2011 | Mohammed Mediani, Eunah Cho, Jan Niehues, Teresa Herrmann, Alex Waibel | Proceedings of the Eighth International Workshop on Spoken Language Translation (IWSLT), 2011 | 
| Wider Context by Using Bilingual Language Models in Machine Translation | Jan Niehues, Teresa Herrmann, Stephan Vogel, Alex Waibel | Sixth Workshop on Statistical Machine Translation (WMT 2011), Edinburgh, UK, 2011 | 
| The Karlsruhe Institute of Technology Translation Systems for the WMT 2011 | Teresa Herrmann, Mohammed Mediani, Jan Niehues, Alex Waibel | Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011 | 
| Advances on Spoken Language Translation in the Quaero Program | Karim Boudahmane, Bianka Buschbeck, Eunah Cho, Josep Maria Crego, Markus Freitag, Thomas Lavergne, Hermann Ney, Jan Niehues, Stephan Peitz, Jean Senellart, Artem Sokolov, Alex Waibel, Tonio Wandmacher, Joern Wuebker, François Yvon | IWSLT 2011, San Francisco, USA | 
| Joint WMT Submission of the QUAERO Project | Markus Freitag, Gregor Leusch, Joern Wuebker, Stephan Peitz, Hermann Ney, Teresa Herrmann, Jan Niehues, Alex Waibel, Alexandre Allauzen, Gilles Adda, Josep Maria Crego, Bianka Buschbeck, Tonio Wandmacher, Jean Senellart | Proceedings of the Sixth Workshop on Statistical Machine Translation, 2011 | 
| Overview of the IWSLT 2011 Evaluation Campaign | Marcello Federico, Luisa Bentivogli, Michael Paul, Sebastian Stüker | Proceedings of the International Workshop on Speech Translation (IWSLT 2011) | 
| The 2011 KIT QUAERO Speech-to-Text System for Spanish | Kevin Kilgour, Christian Saam, Christian Mohr, Sebastian Stüker, Alex Waibel | Proceedings of the International Workshop on Speech Translation (IWSLT 2011) | 
| The 2011 KIT English ASR System for the IWSLT Evaluation | Sebastian Stüker, Kevin Kilgour, Christian Saam, Alex Waibel | Proceedings of the International Workshop on Speech Translation (IWSLT 2011) | 
| Speech Recognition for Machine Translation in Quaero | Lori Lamel, Sandrine Courcinous, Julien Despres, Jean-Luc Gauvain, Yvan Josse, Kevin Kilgour, Florian Kraft, Le Viet Bac, Hermann Ney, Markus Nußbaum-Thom, Ilya Oparin, Tim Schlippe, Ralf Schlüter, Tanja Schultz, Thiago Fraga Da Silva, Sebastian Stüker, Martin Sundermeyer, Bianca Vieru, Ngoc Thang Vu, Alexander Waibel, Cécile Woehrling | Proceedings of the International Workshop on Speech Translation (IWSLT 2011) | 
| Quaero 2010 Speech-to-Text Evaluation Systems | Sebastian Stüker, Kevin Kilgour | High Performance Computing in Science & Engineering 2011, The 14th Results and Review Workshop of the HLRS | 
| The 2011 KIT QUAERO Speech-to-Text System for the Russian Language | Yury Titov, Kevin Kilgour, Sebastian Stüker, Alex Waibel | Proceedings of the 14th International Conference “Speech and Computer” (SPECOM’2011), Kasan, Russian Federation | 
| Towards Context-dependent Phonetic Spelling Error Correction in Children’s Freely Composed Text for Diagnostic and Pedagogical Purposes | Sebastian Stüker, Johanna Fay, Kay Berkling | Proceedings of the 12th Annual Conference of the International Speech Communication Association (INTERSPEECH 2011), Florence, Italy | 
| Speech Technology-based Framework for Quantitative Analysis of German Spelling Errors in Freely Composed Children’s Texts | Kay Berkling, Johanna Fay, Sebastian Stüker | The 2011 Workshop of the ISCA Special Interest Group on Speech and Language Technology in Education (SLaTE 2011) | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2010 evaluation campaign | Michael Paul, Marcello Federico, Sebastian Stüker | Proceedings of the IWSLT 2010, Paris - France | 
| Jibbigo: Speech-to-Speech Translation on Mobile Devices | Matthias Eck, Ian Lane, Ying Zhang, Alexander Waibel | Proceedings of the Spoken Language Technology Workshop (SLT), CA, USA, Dec 12-15, 2010 | 
| The KIT Translation system for IWSLT 2010 | Jan Niehues, Mohammed Mediani, Teresa Herrmann, Michael Heck, Christian Herff, Alex Waibel | Proceedings of IWSLT, 2010 | 
| Towards Social Integration of Humanoid Robots by Conversational Concept Learning | Florian Kraft, Kevin Kilgour, Rainer Saam, Sebastian Stüker, Matthias Wölfel, Tamim Asfour, Alex Waibel | 10th IEEE-RAS International Conference on Humanoid Robots (HUMANOIDS), December 2010 | 
| Overview of the IWSLT 2010 Evaluation Campaign | Michael Paul, Marcello Federico, Sebastian Stüker | International Workshop on Spoken Language Translation, December 2010 | 
| Quaero Speech-to-Text and Text Translation Evaluation Systems | Sebastian Stüker, Kevin Kilgour, Jan Niehues | High Performance Computing in Science & Engineering 2010, The 13th Results and Review Workshop of the HLRS, October 2010 | 
| Spoken News Queries over the World Wide Web | Sebastian Stüker, Michael Heck, Katja Renner, Alex Waibel | ACM Workshop on Searching Spontaneous Conversational Speech, October 2010 | 
| Rapid Development of Speech Translation Using Consecutive Interpretation | Matthias Paulik, Alex Waibel | 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Chiba, Japan, September 26-30, 2010 | 
| Named-Entity Projection and Data-Driven Morphological Decomposition for Field Maintainable Speech-to-Speech Translation Systems | Ian R. Lane, Alex Waibel | 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Chiba, Japan, September 26-30, 2010 | 
| Study of Human Gesture Recognition by Integration Face and Hand Motion Features | Luo Dan, Hazim Kemal Ekenel, Jun Ohya | 9th Forum on Information Technology 2010, FIT2010, Kyushu, Japan, 07. September 2010 | 
| EMDC: A Semi-supervised Approach for Word Alignment | Qin Gao, Stephan Vogel | International Conference on Computational Linguistics 2010, COLING, Beijing, China, August 2010 | 
| The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010 | Jan Niehues, Teresa Herrmann, Mohammed Mediani, Alex Waibel | Proceedings of the 5th Workshop on Statistical Machine Translation, 2010 | 
| A Semi-supervised Word Alignment Algorithm with Partial Manual Alignments | Qin Gao, Nguyen Bach, Nguyen Bach, Stephan Vogel | ACL2010, ACL2010, Uppsala, Sweden, July 2010 | 
| Consensus Versus Expertise: A Case Study of Word Alignment with Mechanical Turk | Qin Gao, Stephan Vogel | Creating Speech and Language Data With Amazon’s Mechanical Turk, NAACL 2010 - Workshop, Los Angeles, CA, US, June 2010 | 
| Tools for Collecting Speech Corpora via Mechanical-Turk | Ian Lane, Alex Waibel, Matthias Eck, Kay Rottmann | Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk, CSLDAMT 2010, Los Angeles, California, USA, June 6, 2010 | 
| Domain Adaptation in Statistical Machine Translation using Factored Translation Models | Jan Niehues, Alex Waibel | 14th Annual Conference of the European Association for Machine Translation, EAMT 2010, Saint-Raphaël, France, 27-28 May 2010 | 
| Speaker Identification with Distant Microphone Speech | Kornel Laskowski, Qin Jin, Tanja Schultz | International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2010, Dallas, Texas, 14. April 2010 | 
| Spoken Language Translation from parallel Speech Audio: Simultaneous Interpretation as SLT Training Data | M. Paulik, A. Waibel | IEEE International Conference on Acoustic Speech and Signal Processing, ICASSp 2010, Dallas, 14-19 March 2010 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2009 evaluation campaign | Michael Paul | Proceedings of IWSLT 2009, Tokyo - Japan | 
| Pronunciation Modeling for Dialectal Arabic Speech Recognition | H. Al-Haj, R. Hsiao, I. Lane, A.W. Black, A. Waibel | IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009, Merano, Italy, 13-17 December 2009 | 
| Automatic Translation from Parallel Speech: Simultaneous Interpretation as MT Training Data | M. Paulik, A. Waibel | IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009, Merano, Italy, 13-17 December 2009 | 
| Open-Set Face Recognition-based Visitor Interface System | Hazim K. Ekenel, Lorant Szasz-Toth, Rainer Stiefelhagen | 7th International Conference on Computer Vision Systems, LNCS, Vol. 5815, pp. 43-52, Liege, Belgium, October 13-15, 2009. | 
| Human Translations Guided Language Discovery for ASR Systems | Sebastian Stüker, Laurent Besacier, Alex Waibel | Interspeech, Interspeech 2009, Brighton, U.K., 10. September 2009 | 
| Speaker Identification using Warped MVDR Cepstral Features | 
 Matthias Wölfel, Yang Qian, Qin Jin, Tanja Schultz | 
 Interspeech, Interspeech 2009, Brighton, U.K., 06. September 2009 | 
| Improving Speaker Segmentation via Speaker Identification and Text Segmentation | 
 Qin Jin, Tanja Schultz, Runxin Li | 
 Interspeech, Interspeech 2009, Brighton, U.K., 06. September 2009 | 
| Source-side Dependency Tree Reordering Models with Subtree Movements and Constraints | 
 Nguyen Bach, Qin Gao, Stephan Vogel | 
 Machine Translation Summit, MT Summit XII 2009, Ottawa, Canada, 01. August 2009 | 
| Virtual Babel: Towards Context-Aware Machine Translation in Virtual Worlds | 
 Ying (Joy) Zhang, Nguyen Bach, Nguyen Bach | 
 Machine Translation Summit, MT Summit XII 2009, Ottawa, Canada, 01. August 2009 | 
| Reliable evaluation of multimodal dialog systems | 
 Florian Metze, Ina Wechsung, Stefan Schaffer, Julia Seebode, Sebastian Möller | 
 Human Computer Interaction International, HCII 2009, San Diego, USA, 19. July 2009 | 
| Usability evaluation of multimodal interfaces: Is the whole the sum of its parts? | 
 Ina Wechsung, Klaus-Peter Engelbrecht, Stefan Schaffer, Julia Seebode, Florian Metze, Sebastian Möller | 
 Human Computer Interaction International, HCII 2009, San Diego, USA, 19. July 2009 | 
| Porting Speech Recognition Systems to New Languages Supported by Articulatory Feature Models | 
 Sebastian Stüker, Alex Waibel | 
 Speech and Computer, SPECOM 2009, St. Petersburg, Russia, 21. May 2009 | 
| Cohesive Constraints in A Beam Search Phrase-based Decoder | 
 Nguyen Bach, Stephan Vogel | 
 North American Association for Computational Linguistics Human Language Technologies Conference, NAACL-HLT 2009, Boulder, USA, 01. May 2009 | 
| Incremental Adaptation of Speech-to-Speech Translation | 
 Nguyen Bach, Roger Hsiao, Roger Hsiao, Matthias Eck, Paisarn Charoenpornsawat, Stephan Vogel, Tanja Schultz, Ian R. Lane, Alex Waibel | 
 North American Association for Computational Linguistics Human Language Technologies Conference, NAACL-HLT 2009, Boulder, USA, 01. May 2009 | 
| End-to-End Evaluation in Simultaneous Translation | O. Hamon, C. Fügen, D. Mostefa, V. Arranz, M. Kolss, K. Choukri, A. Waibel | 12th Conference of the European Chapter of the Association for Computational Linguistics, EACL 2009, Athens, Greece, 30 march - 3 April 2009 | 
| Detecting real life anger | 
 Felix Burkhardt, Tim Polzehl, Joachim Stegmann, Florian Metze, Florian Metze, Richard Huber | 
 International Conference on Acoustics, Speech, and Signal Processing 2009, ICASSP 2009, Taipei, Taiwan, 19. April 2009 | 
| The 14U System In NIST 2008 Speaker Recognition Evaluation | 
 Li Haizhou, Ma Bin, K.-A. Lee, Hanwu Sun, Donglai Zhu, Khe Chai Sim, Changhuai You, Rong Tong, Ismo Karkkainen, Chien-Lin Huang, Vladimir Pervouchine, Wu Guo, Yijie Li, Lirong Dai, M. Nosratighods, T. Tharmarajah, Julien Epps, E. Ambikairajah, E.-S. Chng, Qin Jin, Tanja Schultz | 
 International Conference on Acoustics, Speech, and Signal Processing 2009, ICASSP 2009, Taipei, Taiwan, 19. April 2009 | 
| Voice Convergin: Speaker De-Identification by Voice Transformation | 
 Qin Jin, Tanja Schultz, Alan Black | 
 International Conference on Acoustics, Speech, and Signal Processing 2009, ICASSP 2009, Taipei, Taiwan, 19. April 2009 | 
| Modeling Instantaneous Intonation for Speaker Identification Using the Fundamental Frequency Variation Spectrum | 
 Kornel Laskowski, Qin Jin | 
 International Conference on Acoustics, Speech, and Signal Processing 2009, ICASSP 2009, Taipei, Taiwan, 19. April 2009 | 
| Detecting Bandlimited Audio in Broadcast Television Shows | 
 Mark Fuhs, Qin Jin, Tanja Schultz | 
 International Conference on Acoustics, Speech, and Signal Processing 2009, ICASSP 2009, Taipei, Taiwan, 19. April 2009 | 
| The Universität Karlsruhe Translation System for the EACL-WMT 2009 | 
 Jan Niehues, Teresa Herrmann, Muntsin Kolss, Alex Waibel | Proceedings of the 4th Workshop on Statistical Machine Translation, 2009 | 
| A POS-Based Model for Long-Range Reorderings in SMT | 
 Jan Niehues, Muntsin Kolss | Proceedings of the 4th Workshop on Statistical Machine Translation, 2009 | 
| User Characteristics And Usage Of Gesture And Speech In A Smart Office Environment | 
 Stefan Schaffer, Julia Seebode, Ina Wechsung, Florian Metze, Christine Kühnel | 
 The 8th International Gesture Workshop 2009, GW 2009, Bielefeld, Germany, 27. February 2009 | 
| Speaker De-identification via Voice Transformation | 
 Qin Jin, Alan Black, Tanja Schultz | Automatic Speech Recognition and Unterstanding, ASRU 2009, Merano, Italy, 01. January 2009 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2008 evaluation campaign | Michael Paul | Proceedings of the IWSLT 2008, Hawaii - USA | 
| A joint particle filter and multi-step linear prediction framework to provide enhanced speech features prior to automatic recognition | Matthias Wölfel | Hands-Free Speech Communication and Microphone Arrays, 2008. HSCMA 2008 | 
| Integration of the predicted walk model estimate into the particle filter framework | Matthias Wölfel | Acoustics, Speech and Signal Processing, 2008. ICASSP 2008. | 
| Confidence Based Multimodal Fusion for Person Identification | P.W.L. Große, H. Holzapfel, A. Waibel | A 16th ACM International Conference on Multimedia, Vancouver, Canada, 27-31 October 2008 | 
| Speech Processing in Support of Human-Human Communication | A. Waibel | A Second International Symposium on Universal Communication, ISUC 2008, Osaka, Japan, December 2008 | 
| Simultaneous Machine Translation of German Lectures into English: Investigating Research Challenges for the Future | 
 Matthias Wölfel, Muntsin Kolss, Florian Kraft, Jan Niehues, Alex Waibel | 
 IEEE Workshop on Spoken Language Technology, SLT 2008, Goa, India, 15. December 2008 | 
| Modelling Multimodal User ID in Dialogue | 
 Hartwig Holzapfel, Alex Waibel | 
 IEEE Workshop on Spoken Language Technology, SLT 2008, Goa, India, 01. March 2008 | 
| Handwritten Chinese Character Recognition Using Local Discriminant Projection with Prior Information | 
 
 H. Zhang, W. Deng, J. Guo, Jie Yang | 
 International Conference on Pattern Recognition, ICPR 2008, Tampa, United States, 08. December 2008 | 
| Layered Object Categorization | 
 Jie Yang, Nanning Zheng, Hong Cheng, Lei Yang | 
 International Conference on Pattern Recognition, ICPR 2008, Tampa, United States, 08. December 2008 | 
| Correlated Bigram LSA for Unsupervised LM Adaptation | 
 Tanja Schultz, Yik-Cheung Tam | 
 Neural Information Processing Systems, NIPS 2008, Vancouver, Canada, 06. December 2008 | 
| Universität Karlsruhe (TH) at TRECVID 2008 | 
 Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 TRECVID Evaluation Workshop, TRECVID 2008, Gaithersburg, MD, USA, 01. November 2008 | 
| Monitoring Visual Focus of Attention via Local Discriminant Projection | 
 H. Zhang, L. Toth, W. Deng, J. Guo, Jie Yang | 
 ACM International Conference on Multimedia Information Retrieval 2008, MIR 2008, Vancouver, Canada, 30. October 2008 | 
| Object Fingerprints for Content Analysis with Applications to Street Landmark Localization | 
 W. Wen, Jie Yang | 
 ACM international conference on Multimedia 2008, ACM Multimedia 2008, Vancouver, Canada, 27. October 2008 | 
| Probabilistic Integration of Sparse Audio-Visual Cues for Identity Tracking | 
 Keni Bernardin, Rainer Stiefelhagen, Alex Waibel | 
 ACM international conference on Multimedia 2008, ACM Multimedia 2008, Vancouver, Canada, 27. October 2008 | 
| A Virtual Secretary In a Smart Office Environment. | 
 Maria Danninger, Rainer Stiefelhagen | 
 ACM international conference on Multimedia 2008, ACM Multimedia 2008, Vancouver, Canada, Oct. 2008 | 
| Wider Pipelines: N-Best Alignments and Parses in MT Training | 
 Ashish Venugopal, Andreas Zollmann, Noah Smith, Stephan Vogel | 
 AMTA 2008, AMTA 2008, Waikiki, United States, 25. October 2008 | 
| Combination of Machine Translation Systems via Hypothesis Selection from Combined n-best lists | 
 Almut Silja Hildebrand, Stephan Vogel | 
 AMTA 2008, AMTA 2008, Waikiki, United States, 21. October 2008 | 
| Diacritization as a machine translation problem and as a sequence labeling problem | 
 Tim Schlippe, ThuyLinh Nguyen, ThuyLinh Nguyen, Stephan Vogel | 
 AMTA 2008, AMTA 2008, Waikiki, United States, 21. October 2008 | 
| The CMU Syntax-Augmented Machine Translation System: SAMT on Hadoop with N-best alignments | 
 Andreas Zollmann, Ashish Venugopal, Stephan Vogel | 
 International Workshop for Spoken Language Technologies, IWSLT 2008, Waikiki, United States, 22. October 2008 | 
| Simultaneous German-English Lecture Translation | 
 Muntsin Kolss, Matthias Wölfel, Florian Kraft, Jan Niehues, Alex Waibel | 
 International Workshop for Spoken Language Technologies, IWSLT 2008, Waikiki, United States, 21. October 2008 | 
| Automatic Calibration of Camera Networks Based on Local Motion Features (Best Paper Award) | 
 Keni Bernardin, Rainer Stiefelhagen | 
 Workshop on Multi-camera and Multi-modal Sensor Fusion Algorithms and Applications, in conjunction with ECCV'08, Marseille, France, 18. October 2008 | 
| Dynamic Integration of Generalized Cues for Person Tracking | 
 Kai Nickel, Rainer Stiefelhagen | 
 European Conference on Computer Vision 2008, ECCV 2008, Marseille, France, 12. October 2008 | 
| Deducing the Visual Focus of Attention from Head Pose Estimation in Dynamic Multi-view Meeting Scenarios | 
 Michael Voit, Rainer Stiefelhagen | 
 ACM & IEEE International Conference on Multimodal Interfaces, ICMI 2008, Chania,Crete, Greece, 01. October 2008 | 
| Stream Decoding for Simultaneous Spoken Language Translation | 
 Muntsin Kolss, Stephan Vogel, Alex Waibel | 
 InterSpeech 2008, InterSpeech 2008, Brisbane, Australia, 26. September 2008 | 
| Robust Far-Field Speaker Identification Under Mismatched Conditions | 
 Qin Jin, Tanja Schultz | 
 InterSpeech 2008, InterSpeech 2008, Brisbane, Australia, 22. September 2008 | 
| The CMU-InterACT 2008 Mandarin Transcription System | 
 Roger Hsiao, Mark Fuhs, Yik-Cheung Tam, Qin Jin, Tanja Schultz | 
 InterSpeech 2008, InterSpeech 2008, Brisbane, Australia, 22. September 2008 | 
| Class-Based Statistical Machine Translation for Field Maintainable Speech-to-Speech Translation | 
 Alex Waibel, Ian R. Lane | 
 InterSpeech 2008, InterSpeech 2008, Brisbane, Australia, 22. September 2008 | 
| Lightly Supervised Acoustic Model Training on EPPS Recordings | 
 Matthias Paulik, Alex Waibel | 
 InterSpeech 2008, InterSpeech 2008, Brisbane, Australia, Sept. 2008 | 
| Multilingual Acoustic Features for Porting Speech Recognition Systems to New Languages | 
 Sebastian Stüker | 
 Elektronische Sprach Signalverarbeitung, ESSV 2008, Frankfurt, Germany, 09. September 2008 | 
| Verbesserung der automatischen Transkription von englischen Wörtern in deutschen Vorlesungen | 
 Sebastian Ochs, Matthias Wölfel, Sebastian Stüker | 
 Elektronische Sprach Signalverarbeitung, ESSV 2008, Frankfurt, Germany, 09. September 2008 | 
| Tracking Identities and Attention in Smart Environments - Contributions and Progress in the CHIL Project | 
 Rainer Stiefelhagen, Keni Bernardin, Hazim Kemal Ekenel, Michael Voit | 
 8th IEEE Int. Conference on Face and Gesture Recognition, IEEE 2008, Amsterdam, Netherlands, 01. September 2008 | 
| Visual Focus of Attention in Dynamic Meeting Scenarios, Springer LNCS 5237 | 
 Michael Voit, Rainer Stiefelhagen | 
 Proceedings of the Fifth Workshop on Machine Learning and Multimodal Interaction, Proceedings of the Fifth Workshop on Machine Learning and Multimodal Interaction, Utrecht, The Netherlands, 01. September 2008 | 
| A Systematic Comparison of Phrase-Based, Hierarchical and Syntax-Augmented Statistical MT | 
 Andreas Zollmann, Ashish Venugopal, Franz Och, Jay Ponte | 
 International Conference on Computational Linguistics 2008, COLING, Manchester, United Kingdom, 18. August 2008 | 
| Semi-Supervised Learning of Object Categories from Paired Local Features | 
 W. Wu, Jie Yang | 
 ACM International Conference on Image and Video Retrieval, CIVR 2008, Niagra Falls, Canada, 07. July 2008 | 
| A Deformable Local Image Description | 
 Hong Cheng, Zicheng Liu, Nanning Zheng, Jie Yang | 
 International Conference on Computer Vision and Pattern Recognition 2008, CVPR 2008, Anchorage, United States, 24. June 2008 | 
| Improving Word Alignment with Language Model Based Confidence Scores | 
 Nguyen Bach, Qin Gao, Stephan Vogel | 
 Third Workshop on Statistical Machine Translation, ACL08-SMT, Columbus, Ohio, USA, 19. June 2008 | 
| Discriminative Word Alignment via Alignment Matrix Modeling | 
 Stephan Vogel, Jan Niehues | 
 Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT, Columbus, United States, 19. June 2008 | 
| Recent improvements in the CMU large-scale Chinese-English SMT system | 
 Almut Silja Hildebrand, Kay Rottmann, Kay Rottmann, Timothy Notari, Qin Gao, Sanjika Hewavitharana, Nguyen Bach, Nguyen Bach, Stephan Vogel | 
 Association for Computational Linguistics: Human Language Technologies, ACL-08: HLT, Columbus, United States, 16. June 2008 | 
| Communicating unknown words in machine Translation | 
 Alex Waibel, Stephan Vogel, Matthias Eck | Language Resources and Evaluation Conference, LREC 2008, Marrakech, Marocco, 26. May 2008 | 
| Towards Human Translations Guided Language Discovery for ASR | 
 Sebastian Stüker, Alex Waibel | 
 International Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2008, Hanoi, Vietnam, 05. May 2008 | 
| Integrating Thai Grapheme Based Acoustic Models into the ML-MIX Framework - For Language Independent and Cross-Language ASR | 
 Sebastian Stüker | 
 International Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2008, Hanoi, Vietnam, 05. May 2008 | 
| Multilingual Spoken Language Processing | 
 Tanja Schultz, Pascale Fung | 
 International Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2008, Hanoi, Vietnam, 01. May 2008 | 
| Spoken Language Translation | 
 Alex Waibel, Christian Fügen | 
 International Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2008, Hanoi, Vietnam, 01. May 2008 | 
| The CLEAR 2007 Evaluation, Springer Lecture Notes in Computer Science, No. 4625., pp 3-34 | 
 Rainer Stiefelhagen, Keni Bernardin, John Garofolo | 
 Proceedings of the International Evaluation Workshops , Clear 2007 und RT 2007, Baltimore, MD, USA, 01. May 2008 | 
| Multi-level Particle Filter Fusion of Features and Cues for Audio-visual Person Tracking, Springer Lecture Notes in Computer Science, No. 4625., pp 70-81 | 
 Keni Bernardin, Tobias Gehrig, Rainer Stiefelhagen | 
 Proceedings of the International Evaluation Workshops , Clear 2007 und RT 2007, Baltimore, MD, USA, 01. May 2008 | 
| ISL Person Identification Systems in the CLEAR 2007 Evaluations, Springer Lecture Notes in Computer Science, No. 4625., pp 256-265 | 
 Hazim Kemal Ekenel, Qin Jin, Rainer Stiefelhagen | 
 Proceedings of the International Evaluation Workshops , Clear 2007 und RT 2007, Baltimore, MD, USA, 01. May 2008 | 
| Head Pose Estimation in Single- and Multi-view Environments - Results on the CLEAR '07 Benchmarks, Springer Lecture Notes in Computer Science, No. 4625., pp 307-316 | 
 Michael Voit, Kai Nickel, Rainer Stiefelhagen | 
 Proceedings of the International Evaluation Workshops , Clear 2007 und RT 2007, Baltimore, MD, USA, 01. May 2008 | 
| Automatic Dietary Assessment from Fast Food Categorization | 
 Lei Yang, Nanning Zheng, Hong Cheng, J.D. Fernstrom, M. Sun, M. Sun, Jie Yang | 
 IEEE 34th Annual Northeast Bioengineering Conference 2008, IEEE 34th Annual Northeast Bioengineering Conference 2008, Providence, United States, 01. April 2008 | 
| Privacy Protection in an Electronic Chronicle System | 
 Jie Yang, Simon Greiner | 
 IEEE 34th Annual Northeast Bioengineering Conference 2008, IEEE 34th Annual Northeast Bioengineering Conference 2008, Providence, United States, 01. April 2008 | 
| Local Binary Pattern Domain Local Appearance Face Recognition | 
 Hazim Kemal Ekenel, Rainer Stiefelhagen, A Ercil | 
 IEEE Signal Processing, Communication and Applications Conference, SIU 2008, Didim, Turkey, 01. April 2008 | 
| Multi-stream Gaussian Mixture Model Based Facial Feature Localization | 
 Kenichi Kumatani, Hazim Kemal Ekenel, Rainer Stiefelhagen, A Ercil | 
 IEEE Signal Processing, Communication and Applications Conference, SIU 2008, Didim, Turkey, 01. April 2008 | 
| Sentence Segmentation and Punctuation Recovery for SLT | 
 Sharath Rao, Ian R. Lane, Stephan Vogel, Tanja Schultz | 
 IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2008, Las Vegas, NV, USA, 31. March 2008 | 
| Modified Polyphone Decision Tree Specialization for Porting Multilingual Grapheme Based ASR Systems to New Languages | 
 Sebastian Stüker | 
 IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2008, Las Vegas, NV, USA, 30. March 2008 | 
| Is Voice Transformation a Threat to Speaker Identification? | 
 Qin Jin, Tanja Schultz, Alan Black, Arthur Toth | 
 IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2008, Las Vegas, NV, USA, 30. March 2008 | 
| Extracting Clues from Human Interpreter Speech for Spoken Language Translation | 
 Alex Waibel | IEEE International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2008, Las Vegas, NV, USA, 01. January 2008 | 
| Learning and Verification of Names with Multimodal User ID in Dialog | 
 Hartwig Holzapfel, Alex Waibel | 
 International Conference on Cognitive Systems, CogSys 2008, Karlsruhe, Germany, Apr. 2008 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2007 evaluation campaign | Cameron S. Fordyce | IWSLT 2007 | 
| Consolidation Based Speech Translation | Chiori Hori, Bing Zhao, Stephan Vogel, Alex Waibel | IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2007, Kyoto, Japan, December 9-13, 2007 | 
| Continuous Electromyographic Speech Recognition with a Multi-Stream Decoding Architecture | Szu-Chen Stan Jou, Tanja Schultz, Alex Waibel | IEEE International Conference on Acoustics, Speech and Signal Processing, 2007. ICASSP 2007 | 
| An Un-Awarely Collected Real World Face Database: The ISL-Door Face Database | Hazim Kemal Ekenel, Rainer Stiefelhagen | In Proc.of International Conference on Computer Vision Systems, ICVS 2007, Bielefeld, Germany, March 2007 | 
| Humanoid Robot noise suppression by particle filters for improved automatic Speech Recognition Accuracy | Florian Kraft, Matthias Wölfel | Proceedings of the 2007 IEEE/RJS International Conference on Intelligent Robots and Systems. San Diego, CA, USA, Oct 29-Nov 2, 2007 | 
| The ISL RT-07 Speech-to-Text System | Matthias Wölfel, Sebastian Stüker und Florian Kraft | Proceedings of the Rich Transcription 2007 Meeting Recognition Evaluation Workshop (RT-07), Baltimore, USA, 2007 | 
| Fast Audio-Visual Multi-Person Tracking for a Humanoid Stereo Camera Head | 
 Kai Nickel, Rainer Stiefelhagen | 
 IEEE-RAS Intl. Conference on Humanoid Robots 2007, Humanoids 2007, Pittsburgh, USA, 29. November 2007 | 
| Semantic Extensions of the Ephyra QA system in TREC 2007 | 
 Nico Schläfer, Guido Sautter, Jeongwoo Ko, Justin Betteridge, Manas Pathak, Eric Nyberg | 
 Text Retrieval Conference 2007, TREC 2007, Gaithersburg, United States, 06. November 2007 | 
| The CMU-UKA Statistical Machine Translation Systems For IWSLT 2007 | 
 Ian R. Lane, Andreas Zollmann, ThuyLinh Nguyen, Nguyen Bach, Ashish Venugopal, Stephan Vogel, Kay Rottmann, Ying (Joy) Zhang, Alex Waibel | 
 International Workshop on Spoken Language Translation, IWSLT 2007, Trento, Italy, 15. October 2007 | 
| The CMU TransTac 2007 Eyes-free and Hands-free Two-way Speech-to-Speech Translation System | 
 Nguyen Bach, Matthias Eck, Paisarn Charoenpornsawat, Thilo Kohler, Sebastian Stüker, ThuyLinh Nguyen, Roger Hsiao, Alex Waibel, Stephan Vogel, Tanja Schultz, Alan Black | 
 International Workshop on Spoken Language Translation, IWSLT 2007, Trento, Italy, 15. October 2007 | 
| Improving Spoken Language Translation by Automatic Disfluency Removal : Evidence from Conversational Speech Transcripts | 
 Sharath Rao, Ian R. Lane, Tanja Schultz | 
 Machine Translation Summit XI, MT Summit XI, Copenhagen, Denmark, 09. October 2007 | 
| Estimating Phrase Pair Relevance for Translation Model Pruning | Matthias Eck, Stephan Vogel, Alex Waibel | Machine Translation Summit XI, MT Summit XI, Copenhagen, Denmark, 10. September 2007 | 
| Word Reordering in Statistical Machine Translation with a POS-Based Distortion Model | Kay Rottmann, Stephan Vogel | The 11th International Conference on Theoretical and Methodological Issues in Machine Translation (TMI-07), Skövde, Sweden, Sept. 7-9, 2007 | 
| Channel Selection by Class Separability Measures for Automatic Transcriptions on Distant Microphones | Matthias Wölfel | Interspeech, Interspeech 2007, Antwerp, Belgium, August 27-31, 2007 | 
| Behavior Models for Learning and Receptionist Dialogs | Hartwig Holzapfel, Alex Waibel | Interspeech, Interspeech 2007, Antwerp, Belgium, August 27-31, 2007 | 
| Computer-Supported Human-Human Multilingual Communication | Alex Waibel, Keni Bernardin, Matthias Wölfel | Interspeech, Interspeech 2007, Antwerp, Belgium, 27. August 2007 | 
| The ISL 2007 English Speech Transcription System for European Parliament Speeches | Sebastian Stüker, Christian Fügen, Florian Kraft, Matthias Wölfel | Interspeech, Interspeech 2007, Antwerp, Belgium, 27. August 2007 | 
| The Influence of Utterance Chunking on Machine Translation Performance | Christian Fügen, Muntsin Kolss | Interspeech, Interspeech 2007, Antwerp, Belgium, 27. August 2007 | 
| Handing OOV Words in Arabic ASR via Flaxible Morphological Constraints | Nguyen Bach, Mohamed Noamany, Ian R. Lane, Tanja Schultz | Interspeech, Interspeech 2007, Antwerp, Belgium, 27. August 2007 | 
| Optimizing Sentence Segmentation for Spoken Language Translation | Sharath Rao, Ian R. Lane, Tanja Schultz | Interspeech, Interspeech 2007, Antwerp, Belgium, 27. August 2007 | 
| Bilingual-LSA Based LM Adaptation for Spoken Language Translation | Yik-Cheung Tam, Ian R. Lane, Tanja Schultz | 45th Annual Meeting of the Association for Computational Linguistics, ACL 2007, Prague, Czech Republic, 23. July 2007 | 
| The Syntax Augmented MT (SAMT) System at the Shared Task for the 2007 ACL Workshop on Statistical Machine Translation | Ashish Venugopal, Stephan Vogel | ACL 2007 Second Workshop on Statistical Machine Translation, ACL SMT WS 2007, Prague, Czech Republik, 23. June 2007 | 
| The ISL Phrase-Based MT System for the 2007 ACL Workshop on Statistical Machine Translation | Kay Rottmann, Almut Silja Hildebrand, Stephan Vogel | ACL 2007 Second Workshop on Statistical Machine Translation, ACL SMT WS 2007, Prague, Czech Republik, 23. June 2007 | 
| Head Pose Estimation in Single- and Multi-view Environments - Results on the CLEAR'07 Benchmarks | Michael Voit, Kai Nickel, Rainer Stiefelhagen | CLEAR 2007 Evaluation Workshop, CLEAR 2007, Baltimore, USA, 08. May 2007 | 
| Translation Model Pruning via Usage Statistics for Statistical Machine Translation | Matthias Eck, Stephan Vogel, Alex Waibel | NAACL Human Language Technology Conference, HLT-NAACL 2007, Rocherster,NY, USA, 27. April 2007 | 
| An Efficient Two-Pass Approach to Synchronous-CFG Driven Statistical MT | Ashish Venugopal, Stephan Vogel | NAACL Human Language Technology Conference, HLT-NAACL 2007, Rocherster,NY, USA, 22. April 2007 | 
| A Log-Linear Block Transliteration Model Based on Bi-Stream HMMs | Bing Zhao, Ian R. Lane, Stephan Vogel, Nguyen Bach | NAACL Human Language Technology Conference, HLT-NAACL 2007, Rocherster,NY, USA, 02. January 2007 | 
| Correlated Latent Semantic Model for Unsupervised LM Adaptation | Yik-Cheung Tam, Tanja Schultz | International Conferences on Acoustic Speech and Signal Processing, ICASSP 2007, Honolulu, HI, USA, 15. April 2007 | 
| Multi-Stream Articulatory Feature Classifiers for Surface Elctromyographic Continuous Speech Recognition | Szu-Chen Jou, Alex Waibel, Tanja Schultz | International Conferences on Acoustic Speech and Signal Processing, ICASSP 2007, Honolulu, HI, USA, 15. April 2007 | 
| Speech Translation Enhanced ASR for European Parliament Speeches - on the Influence of ASR Performance on Speech Translation | Sebastian Stüker, Matthias Paulik, Muntsin Kolss, Christian Fügen, Alex Waibel | International Conferences on Acoustic Speech and Signal Processing, ICASSP 2007, Honolulu, HI, USA, 31. March 2007 | 
| Multimodal Technologies for Perception of Humans | Rainer Stiefelhagen, John Garofolo | CLEAR Evaluation Workshop, CLEAR Evaluation Workshop 2006, Southampton, UK, 01. January 2007 | 
| Can you Talk or only Touch-Talk? A VoIP-based phone feature for quick, quiet, and private communication | Maria Danninger, L. Takayama, Q. Wang, C. Schultz, J. Beringer, P. Hofmann, F. James, C. Nass | ACM & IEEE International Conference on Multimodal Interfaces, ICMI 2008, Chania,Crete, Greece, 01. January 2007 | 
| Integrating Face-ID into an Interactive Person-ID Learning System | Stephan Könn, Hartwig Holzapfel, Hazim Kemal Ekenel, Alex Waibel | Proceedings of the 5th International Conference on Computer Vision Systems, ICVS 2007, Bielefeld University, Germany, March 21-24, 2007 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2006 evaluation campaign | Michael Paul | IWSLT 2006 | 
| The UKA/CMU Statistical Machine Translation System for IWSLT 2006 | 
 Matthias Eck, Ian R. Lane, Nguyen Bach, Sanjika Hewavitharana, Muntsin Kolss, Bing Zhao, Bing Zhao, Almut Silja Hildebrand, Stephan Vogel, Alex Waibel | 
 International Workshop on Spoken Language Translation, IWSLT 2006, Kyoto, Japan, 28. November 2006 | 
| The CMU-UKA Syntax Augmented Machine Translation System for IWSLT-06 | 
 Ashish Venugopal, Stephan Vogel | 
 International Workshop on Spoken Language Translation, IWSLT 2006, Kyoto, Japan, 27. November 2006 | 
| The Ephyra QA system at TREC 2006 | 
 Nico Schläfer, Petra Gieselmann, Guido Sautter | 
 TREC2006, TREC2006, Gaithersburg, United States, 14. November 2006 | 
| Multimodal Estimation of User Interruptibility for Smart Mobile Telephones | Robert Malkin, Datong Chen, Jie Yang, Alex Waibel | 8th International Conference on Multimodal Interfaces, ICMI 2006, Banff, Canada, November 2-4, 2006 | 
| Comparing Error-Handling Strategies in Human-Human and Human-Robot Dialogues | 
 Petra Gieselmann | 
 Proceedings of the 8th Conference on Natural, KONVENS 2006, Konstanz, Germany, October 2006 
 | 
| Natural Human Robot Communication | 
 Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Florian Kraft | 
 Human Centered Robotic Systems, HCRS, München, Germany, 01. October 2006 | 
| Multi-Source Far-Distance Microphone Selection and Combination for Automatic Transcription of Lectures | Matthias Wölfel, Christian Fügen, Shajith Ikbal, John W. McDonough | Interspeech 2006 - Ninth International Conference on Spoken Language Processing (ICSLP), Pittsburgh, PA, USA, September 17-21, 2006 | 
| Optimizing Components for Handheld Two-way Speech Translation English-Iraqi Arabic System | Roger Hsiao, Ashish Venugopal, Thilo Köhler, Ying Zhang, Paisarn Charoenpornsawat, Andreas Zollmann, Stephan Vogel, Alan W. Black, Tanja Schultz, Alex Waibel | International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, September 2006 | 
| Rapid Simulation-Driven Reinforcement Learning of Multimodal Strategies in Human-Robot Interaction | Thomas Prommer, Hartwig Holzapfel, Alex Waibel | International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, September 2006 | 
| Unsupervised Language Model Adaptation Using Latent Semantic Marginals | 
 Yik-Cheung Tam, Tanja Schultz | International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, 01. September 2006 | 
| A Pattern Learning Approach To Question Answering Within The Ephyra Framework | 
 Nico Schläfer, Petra Gieselmann, Thomas Schaaf, Alex Waibel | 
 Proceedings of the Ninth International Conference on TEXT, SPEECH and DIALOGUE, TSD 2006, Brno, Czech Republic, September 2006 
 | 
| A Multilingual Expectations Model for Contextual Utterances in Mixed-Initiative Spoken Dialogue | Hartwig Holzapfel, Alex Waibel | International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, September 2006 | 
| Sub-Word Unit based Non-Audible Speech Recognition using Surface Electromyography | 
 Matthias Walliczek, Florian Kraft, Szu-Chen Jou, Tanja Schultz, Alex Waibel | 
 International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, 01. September 2006 | 
| Towards Continuous Speech Recognition Using Surface Electromyography | 
 Szu-Chen Jou, Tanja Schultz, Matthias Walliczek, Florian Kraft, Alex Waibel | 
 International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, 01. September 2006 | 
| Dynamic Extension of a Grammar-based Dialogue System: Constructing an All-Recipes Knowing Robot | 
 Petra Gieselmann, Alex Waibel | 
 International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, Sept. 2006 | 
| Advances in Lecture Recognition: The ISL RT-06S Evaluation System | 
 Christian Fügen, Matthias Wölfel, John McDonough, Shajith Ikbal Mohamed, Florian Kraft, Kornel Laskowski, Mari Ostendorf, Sebastian Stüker, Kenichi Kumatani | 
 International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, 31. August 2006 | 
| Cross-System Adaptation and Combination for Continuous Speech Recognition: The Influence of Phoneme Set and Acoustic Front-End | 
 Sebastian Stüker, Christian Fügen, Susanne Burger, Matthias Wölfel | 
 International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh, PA, USA, 31. August 2006 | 
| Detection-Assisted Initialization, Adaptation and Fusion of Body Region Trackers for Robust Multiperson Tracking | 
 Keni Bernardin, A. Elbs, Rainer Stiefelhagen | 
 International Conference on Pattern Recognition, ICPR 2006, Hong Kong, China PR, 01. August 2006 | 
| Exploiting High Dimensional Video Features Using Layered Gaussian Mixture Models | 
 D. Chen, Jie Yang | 
 International Conference on Pattern Recognition, ICPR 2006, Hong Kong, China PR, Aug. 2006 | 
| Towards the Application of a Handwriting Interface for Mathematics Learning | 
 L. Anthony, Jie Yang, K.R. Ködinger | 
 IEEE International Conference on Multimedia & Expo 2006, ICME 2006, Toronto, Canada, 01. July 2006 | 
| Directing Attention in Online Aggregate Sensor Streams via Auditory Blind Value Assignment | 
 D. Chen, Jie Yang, Alex Waibel, Rob Malkin | 
 IEEE International Conference on Multimedia & Expo 2006, ICME 2006, Toronto, Canada, 01. July 2006 | 
| A Multimedia System for Route Sharing and Video-based Navigation | 
 W. Wu, Jie Yang, Jing Zhang | 
 IEEE International Conference on Multimedia & Expo 2006, ICME 2006, Toronto, Canada, 01. July 2006 | 
| WEBDOVE: A Web-Based Collaboration System for Physical Tasks | 
 Weiyi Yang, Jiazhi Ou, Y. Rui, Jie Yang | 
 IEEE International Conference on Multimedia & Expo 2006, ICME 2006, Toronto, Canada, 01. July 2006 | 
| People Identification with Limited Labels in Privacy-Protected Video | 
 Y. Chang, R. Yan, D. Chen, Jie Yang | 
 IEEE International Conference on Multimedia & Expo 2006, ICME 2006, Toronto, Canada, 01. July 2006 | 
| Open Domain Speech Translation: From Seminars and Speeches to Lectures | 
 Christian Fügen, Muntsin Kolss, Alex Waibel | 
 TC-Star Workshop on Speech-to-Speech Translation, TC-STAR-WS 2006, Barcelona, Spain, 19. June 2006 | 
| A Robot Learns to Know People - First Contacts of a Robot | Hartwig Holzapfel, Thomas Schaaf, Hazim Kemal Ekenel, Christoph Schaa, Alex Waibel | Proceedings of the 29th Annual German Conference on Artificial Intelligence, KI 2006, Bremen, Germany, June 2006 | 
| The ISL TC-STAR Spring 2006 ASR Evaluation Systems | 
 Sebastian Stüker, Christian Fügen, Shajith Ikbal Mohamed, Qin Jin, Florian Kraft, Martin Raab, Yik-Cheung Tam, Matthias Wölfel | 
 TC-Star Workshop on Speech-to-Speech Translation, TC-STAR-WS 2006, Barcelona, Spain, 19. June 2006 | 
| The ISL Statistical Machine Translation System for the TC-STAR Spring 2006 Evaluation | 
 Muntsin Kolss, Bing Zhao, Stephan Vogel, Ashish Venugopal, Ying (Joy) Zhang | 
 TC-Star Workshop on Speech-to-Speech Translation, TC-STAR-WS 2006, Barcelona, Spain, June 2006 | 
| Bridging the Inflection Morphology Gap for Arabic Statistical Machine Translation | 
 Ashish Venugopal, Stephan Vogel | 
 HLT-NAACL, HLAT-NAACL 2006, New York City, NY, USA, June 2006 | 
| Multimodal Identity Tracking in a Smartroom | 
 Keni Bernardin, Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 3rd IFIP Conference on Artificial Intelligence Applications & Innovations, AIAI 2006, Athens, Greece, 07. June 2006 | 
| Thai Grapheme-Based Speech Recognition | 
 Sanjika Hewavitharana, Paisarn Charoenpornsawat, Tanja Schultz | 
 HLT-NAACL 2006, HLT-NAACL 2006, New York , United States, 05. June 2006 | 
| Tracking of the Articulated Upper Body on Multi-View Stereo Image Sequences | 
 Julius Ziegler, Kai Nickel, Rainer Stiefelhagen | 
 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006, New York, USA, 01. June 2006 | 
| Robust AAM Fitting by Fusion of Images and Disparity Data | 
 J. Liebelt, Jing Xiao, Jie Yang | 
 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2006, New York, USA, June 2006 | 
| Block Selection in the Local Appearance-based Face Recognition Scheme | 
 Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 CVPR Biometrics Workshop, CVPR Biometrics Workshop 2006, New York, USA, 01. June 2006 | 
| Analysis of Local Appearance-based Face Recognition: Effects of Feature Selection and Feature Normalization | 
 Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 CVPR Biometrics Workshop, CVPR Biometrics Workshop 2006, New York, USA, 01. June 2006 | 
| Syntax Augmented Machine Translation via Chart Parsing | 
 Andreas Zollmann, Ashish Venugopal | 
 Workshop on Statistical Machine Translation, WSMT 2006, New York City, NY, USA, 31. May 2006 | 
| A Flexible Online Server for Machine Translation Evaluation | 
 Matthias Eck, Stephan Vogel, Alex Waibel | 
 11th EAMT Conference, EAMT 2006, Oslo, Norway, 31. May 2006 | 
| Annotation and Analysis of Emotionally Relevant Behavior in the ISL Meeting Corpus | 
 Susanne Burger, Kornel Laskowski | 
 5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, 24. May 2006 | 
| Articulatory Feature Classification using Surface Electromyography | 
 Szu-Chen Jou, Lena Maier-Hein, Tanja Schultz, Alex Waibel | 
 International Conference on Acoustics, Speech, and Signal Processing 2006, ICASSP 2006, Toulouse, France, 15. May 2006 | 
| Challenges with Rapid Adaptation of Speech Translation Systems to New Language Pairs | 
 Alan Black, Tanja Schultz | 
 International Conference on Acoustics, Speech, and Signal Processing 2006, ICASSP 2006, Toulouse, France, 15. May 2006 | 
| Unsupervised Learning of Overlapped Speech Model Parameters for Multichannel Speech Activity Detection in Meetings | 
 Kornel Laskowski, Tanja Schultz | 
 International Conference on Acoustics, Speech, and Signal Processing 2006, ICASSP 2006, Toulouse, France, 14. May 2006 | 
| Open Domain Speech Recognition & Translation: Lectures and Speeches | 
 Christian Fügen, Muntsin Kolss, Dietmar Bernreuther, Sebastian Stüker, Stephan Vogel, Alex Waibel | 
 International Conference on Acoustics, Speech, and Signal Processing 2006, ICASSP 2006, Toulouse, France, 01. May 2006 | 
| Competitive Evaluation of Commercially Avaiable Speech Recognizers in Multiple Languages | 
 Zachary Sloane, Susanne Burger, Jie Yang | 
 5th International Conference on Language Resources and Evaluation, LREC 2006, Genoa, Italy, 01. May 2006 | 
| Multiple Object Tracking Performance Metrics and Evaluation in a Smart Room Environment | 
 Keni Bernardin, A. Elbs, Rainer Stiefelhagen | 
 The Sixth IEEE International Workshop on Visual Surveillance, VS 2006, Graz, Austria, 01. May 2006 | 
| The Connector Service - Predicting Availability in Mobile Contexts | 
 Maria Danninger, Tobias Kluge, E. Robles, L. Takayama, Q. Wang, Rainer Stiefelhagen, C. Nass, Alex Waibel | 
 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, MLMI 2006, Washington D.C., USA, 01. May 2006 | 
| Speech Recognition in Human Mediated Translation Scenarios | 
 Sebastian Stüker, Christian Fügen | 
 The 13th IEEE Mediterranean Electrotechnical Conference 2006, MELECON 2006, Málaga, Spain, 01. May 2006 | 
| Speech-to-Speech Translation Services for the Olympic Games 2008 | 
 Sebastian Stüker, Chengqing Zong, Jürgen Reichert, Wenjie Cao, Muntsin Kolss, Guodong Xie, Kay Peterson, Peng Ding, Victoria Arranz, Jian Yu, Alex Waibel | 
 3rd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, MLMI 2006, Washington D.C., USA, 30. April 2006 | 
| Illumination Subspaces based Robust Face Recognition | 
 Daniella Kern, Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 
 14th Signal Processing and Communication Applications, Antalya, Turkey, April 17. - 19. 2006 | 
| Speaker Clustering for Multilingual Synthesis | 
 Alan Black, Tanja Schultz | 
 ISCA, ISCA, Stellenbosch, South Aftrica, 09. April 2006 | 
| Video-Based Face Recognition Evaluation in the CHIL Project – Run 1 | 
 Hazim Kemal Ekenel, A Pnevmatikakis | 
 7th International Conference Automatic Face and Gesture Recognition, FG2006, Southampton, UK, 01. April 2006 | 
| An Audio-visual Particle Filter for Speaker Tracking on the CLEAR06 Evaluation Dataset | 
 Kai Nickel, Tobias Gehrig, Hazim Kemal Ekenel, John McDonough, Rainer Stiefelhagen | 
 CLEAR Evaluation Workshop, CLEAR Evaluation Workshop 2006, Southampton, UK, 01. April 2006 | 
| Multi- and Single View Multiperson Tracking for Smart Room Environments | 
 Keni Bernardin, Tobias Gehrig, Rainer Stiefelhagen | 
 CLEAR Evaluation Workshop, CLEAR Evaluation Workshop 2006, Southampton, UK, 01. April 2006 | 
| Neural Network-based Head Pose Estimation and Multi-View Fusion | 
 Michael Voit, Kai Nickel, Rainer Stiefelhagen | 
 CLEAR Evaluation Workshop, CLEAR Evaluation Workshop 2006, Southampton, UK, 01. April 2006 | 
| ISL Person Identification Systems in the CLEAR Evaluations | 
 Hazim Kemal Ekenel, Qin Jin | 
 CLEAR Evaluation Workshop, CLEAR Evaluation Workshop 2006, Southampton, UK, 01. April 2006 | 
| Analysis of Local Appearance-based Face Recognition on FRGC 2.0 Database | 
 Hazim Kemal Ekenel, Rainer Stiefelhagen | 
 Face Recognition Grand Challenge Workshop, FRGC 2006, Arlington, USA, 01. March 2006 | 
| MyConnector – Analysis of Context Cues to Predict Human Availability for Communication | 
 Maria Danninger, Tobias Kluge, Rainer Stiefelhagen | 
 International Conference on Multimodal Interfaces, ICMI 2006, Banff, Canada, 01. January 2006 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2005 evaluation campaign | Matthias Eck, Chiori Hori | IWSLT 2005 | 
| Bilingual Word Spectral Clustering for Statistical Machine Translation | Bing Zhao, Eric P. Xing, Alex Waibel | Proceedings of the ACL Workshop on Building and Using Parallel Texts, ParaText 2005, Ann Arbor, Michigan, USA, 2005 | 
| Clarification Questions to Improve Dialogue Flow and Speech Recognition in Spoken Dialogue Systems | U. Krumm, H. Holzapfel, A. Waibel | Proceedings of Interspeech 2005, Lisbon, 2005 | 
| Session Independent Non-Audible Speech Recognition Using Surface Electromyography | L. Maier-Hein, F. Metze, T. Schultz, A. Waibel | Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2005, Cancun, Mexiko, November 27 - December 1, 2005 | 
| Automatically Transcribing Meetings Using Distant Microphones | F. Metze, C. Fügen, Y. Pan, A. Waibel | Proceedings ICASSP, Philadelphia, USA, March 2005 | 
| Speech Translation Enhanced Automatic Speech Recognition | Sebastian Stüker, Christian Fügen, Tanja Schultz, Thomas Schaaf, Alex Waibel | Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2005, Cancun, Mexiko, November 27 - December 1, 2005 | 
| Dynamic Language Model Adaptation using Variational Bayes Inference | Yik-Cheung Tam, Tanja Schultz | Proceedings of the 2005 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2005, Cancun, Mexiko, November 27 - December 1, 2005 | 
| A Cognitive Architecture for a Humanoid Robot: A First Approach International Conference on Humanoid Robots | C. Burghart, R. Mikut, Rainer Stiefelhagen, T. Asfour, Hartwig Holzapfel, P. Steinhaus, R. Dillmann | IEEE-RAS International Conference on Humanoid Robots 2005, Humanoid 2005, Tsukuba, Japan, 01. December 2005 | 
| Multimodal Context Management within Intelligent Rooms | Petra Gieselmann, Hartwig Holzapfel | Proceedings of the 10th International Conference on Speech and Computer 2005, SPECOM 2005, Patras, Greece, 01. October 2005 | 
| Kalman Filters for Audio-Video Source Localization | Tobias Gehrig, Kai Nickel, Hazim Kemal Ekenel, Ulrich Klee, John McDonough | IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, WASPAA 2005, New York, USA, 01. October 2005 | 
| The Connector - Facilitating Context-aware Communication | Maria Danninger, Gopi Flaherty, Keni Bernardin, Hazim Kemal Ekenel, Thilo Köhler, Rainer Stiefelhagen, Alex Waibel, Rob Malkin | 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, 01. October 2005 | 
| Analyzing and Predicting Focus of Attention in Remote Collaborative Tasks | Jiazhi Ou, L. M. Oh, S. R. Fussell, T. Blum, Jie Yang | 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, 01. October 2005 | 
| A Joint Particle Filter for Audio-visual Speaker Tracking | Kai Nickel, Tobias Gehrig, Rainer Stiefelhagen, John McDonough | 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, 01. October 2005 | 
| Integrating Co-Training and Recognition for Text Detection | W. Wu, D. Chen, Jie Yang | 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, Oct. 2005 | 
| Automatic Speech Activity Detection, Source Localization, and Speech Recognition on the Chil Seminar Corpus | D. Macho, J. Padrell, A. Abad, C. Nadeu, J. Hernando, John McDonough, Matthias Wölfel, Ulrich Klee, M. Omologo, A. Brutti, P. Svaizer, G. Potamianos, S.M. Chu | 7th International Conference on Multimodal Interfaces, ICMI 2005, Trento, Italy, Oct. 2005 | 
| A Study of Detecting Social Interaction in a Nursing Home Environment | D. Chen, Jie Yang, Howard Wactlar | IEEE International Workshop on Human-Computer Interaction, HCI 2005, Beijing, China P.R., 01. October 2005 | 
| Online Learning Region Confidences for Object Tracking | D. Chen, Jie Yang | The Second Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, VS-PETS 2005, Beijing, China P.R., 01. October 2005 | 
| Modeling Background from Compressed Video | W. Wang, D. Chen, W. Gao, Jie Yang | The Second Joint IEEE International Workshop on Visual Surveillance and Performance Evaluation of Tracking and Surveillance, VS-PETS 2005, Beijing, China P.R., 01. October 2005 | 
| Low Cost Portability for Statistical Machine Translation based on N-gram Frequency and TF-IDF | Matthias Eck, Stephan Vogel, Alex Waibel | International Workshop on Spoken Language Translation, IWSLT 2005, Pittsburgh, PA, USA, 13. September 2005 | 
| The CMU Statistical Machine Translation System for IWSLT 2005 | Sanjika Hewavitharana, Bing Zhao, Matthias Eck, Chiori Hori, Stephan Vogel, Alex Waibel | International Workshop on Spoken Language Translation, IWSLT 2005, Pittsburgh, PA, USA, Oct. 2005 | 
| Learning a Log-Linear Model with Bilingual Phrase-Pair Features for Statistical Machine Translation | Bing Zhao, Alex Waibel | 4th SIGHAN Workshop on Chinese Language Processing, IJCNLP 2005, Jeju Island, Korea, October 14-15, 2005 | 
| Low Cost Portability for Statistical Machine Translation based on N-Gram Coverage | Matthias Eck, Stephan Vogel, Alex Waibel | 10th Machine Translation Summit, MT-Summit 2005, Phuket, Thailand, 13. September 2005 | 
| Local Appearance based Face Recognition Using Discrete Cosine Transform | Hazim Kemal Ekenel, Rainer Stiefelhagen | 13th European Signal Processing Conference, EUSIPCO 2005, Antalya, Turkey, 01. September 2005 | 
| Feature Weighted Mahalanobis Distance: Improved Robustness for Gaussian Classifiers | Matthias Wölfel, Hazim Kemal Ekenel | 13th European Signal Processing Conference, EUSIPCO 2005, Antalya, Turkey, 01. September 2005 | 
| Spontaneous Speech Consolidation for Language Applications | Chiori Hori, Alex Waibel | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, September 2005 | 
| Document Driven Machine Translation Enhanced ASR | Matthias Paulik, Christian Fügen, Sebastian Stüker, Tanja Schultz, Thomas Schaaf, Alex Waibel | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, 01. September 2005 | 
| Rapid Porting of ASR-Systems to Mobile Devices | Thilo Köhler, Christian Fügen, Sebastian Stüker, Alex Waibel | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, 01. September 2005 | 
| Temporal ICA for Classification of Acoustic Events in a Kitchen Environment | Florian Kraft, Thomas Schaaf, Alex Waibel, Rob Malkin | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, 01. September 2005 | 
| Frame Based Model Order Selection of Spectral Envelopes | Matthias Wölfel | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, Sept. 2005 | 
| Combining Multi-Source Far Distance Speech Recognition Strategies: Beamforming, Blind Channel and Confusion Network Combination | Matthias Wölfel, John McDonough | 9th European Conference on Speech Communication and Technology 2005, Interspeech 2005, Lisboa, Portugal, Sept. 2005 | 
| Clustering and Classifying Person Names by Origin | Fei Huang, Stephan Vogel, Alex Waibel | 20th National Conference on Artificial Intelligence, AAAI 2005, Pittsburgh, Pennsylvania, USA, July 9-13, 2005 | 
| Microphone Array Driven Speech Recognition: Influence of Localization on the Word Error Rate | Matthias Wölfel, Kai Nickel, John McDonough | 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, MLMI 2005, Edinburgh, UK, 01. July 2005 | 
| The "FAME" Interactive Space | Florian Metze, Petra Gieselmann, Hartwig Holzapfel, Tobias Kluge, Ivica Rogina, Alex Waibel, J. Crowley, P. Reignier, D Vaufreydaz, F. Berard, B. Cohen, J. Coutaz, S. Rouillard, Victoria Arranz, M. Bertran, H. Rodriguez | 2nd Joint Workshop on Multimodal Interaction and Related Machine Learning Algorithms, MLMI 2005, Edinburgh, UK, 01. July 2005 | 
| Training and Evaluating Error Minimization Decision Rules for Statistical Machine Translation | Ashish Venugopal, Alex Waibel | ACL Workshop on Building and Using Parallel Texts, ACL 2005, New York City, NY, USA, 31. May 2005 | 
| What makes Human-Robot Dialogues struggle? | Petra Gieselmann, Alex Waibel | Proceedings of the Ninth Workshop on the Semantics and Pragmatics of Dialogue 2005, DIALOR 2005, Nancy, France, 01. June 2005 | 
| Multi-modal Person Recognition for Vehicular Applications | H Erdogan, A Ercil, Hazim Kemal Ekenel, S. Y Bilgin, I Eden, M Kirisci | 6th International Workshop on Multiple Classifier Systems, MCS 2005, California, USA, 01. June 2005 | 
| A Generic Face Representation Approach for Local Appearance based Face Verification | Hazim Kemal Ekenel, Rainer Stiefelhagen | 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, USA, 01. June 2005 | 
| Adaptation of the Translation Model for Statistical Machine Translation Based on Information Retrieval | Almut Silja Hildebrand, Matthias Eck, Stephan Vogel, Alex Waibel | 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2005, San Diego, USA, 30.May 2005 | 
| Adaptation of the Translation Model for Statistical Machine Translation | Almut Silja Hildebrand, Matthias Eck, Stephan Vogel, Alex Waibel | 10th EAMT Conference, EAMT 2005, Budapest, Hungary, 30. May 2005 | 
| Augmenting a Statistical Translation System with a Translation Memory | Sanjika Hewavitharana, Stephan Vogel, Alex Waibel | 10th Annual Conference of the European Association for Machine Translation, EAMT 2005, Budapest, Hungary, May 30-31, 2005 | 
| First Evaluation of Acoustic Event Classification Systems in the CHIL project | Rob Malkin | Hands-Free Speech Communication and Microphone Arrays 2005, HSCMA 2005, Piscataway, New Jersey, USA, 17. May 2005 | 
| Multi-view Head Pose Estimation using Neural Networks | Michael Voit, Kai Nickel, Rainer Stiefelhagen | Face Processing in Video 2005, FPiV 2005, Victoria, British Columbia, Canada, 01. May 2005 | 
| Lecturer Localization and Identification In A Smart Room | Hazim Kemal Ekenel, Kai Nickel, Rainer Stiefelhagen | Face Processing in Video 2005, FPiV 2005, Victoria, British Columbia, Canada, 01. May 2005 | 
| Towards Development of Multilingual Spoken Dialogue Systems | Hartwig Holzapfel | 2nd Language and Technology Conference, L&T 2005, Poznań, Poland, April 21-23, 2005 | 
| Effects of Task Properties, Partner Actions, and Message Content on Eye Gaze Patterns in a Collaborative Task | Jiazhi Ou, Lui Min Oh, Jie Yang, Susan R. Fussell | Proceedings of the ACM Conference on Human Factors in Computing Systems, CHI 2005, Portland, Oregon, USA, April 2-7, 2005 | 
| Evaluation of Multimodal Input for Entering Mathematical Equations on the Computer | Lisa Anthony, Jie Yang, Kenneth R. Ködinger | Proceedings of the ACM Conference on Human Factors in Computing Systems, CHI 2005, Portland, Oregon, USA, April 2-7, 2005 | 
| Automatic Disfluency Removal on Recognized Spontaneous Speech - Rapid Adaptation to Speaker-dependent Disfluencies | Matthias Honal, Tanja Schultz | Proceedings of the 30th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2005, Philadelphia, Pennsylvania, USA, March 19-23, 2005 | 
| Classifying User Environment for Mobile Applications Using Linear Autoencoding of Ambient Audio | Rob Malkin, Alex Waibel | Proceedings of the 30th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2005, Philadelphia, Pennsylvania, USA, March 19-23, 2005 | 
| Thai Automatic Speech Recognition | Sinaporn Suebvisai, Paisarn Charoenpornsawat, Alan Black, Monika Woszczyna, Tanja Schultz | Proceedings of the 30th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2005, Philadelphia, Pennsylvania, USA, March 19-23, 2005 | 
| Whispery Speech Recognition using Adapted Articulatory Features | Szu-Chen Jou, Tanja Schultz, Alex Waibel | Proceedings of the 30th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2005, Philadelphia, Pennsylvania, USA, March 19-23, 2005 | 
| Image Registration in Uncalibrated Omnidirectional Camera Network | Datong Chen, Jie Yang | Proceedings of the IEEE Workshop on Applications of Computer Vision, WACV 2005, Breckenridge, Colorado, USA, January 5-7, 2005 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Findings of the IWSLT 2004 evaluation campaign | Yasuhiro Akiba, Marcello Federico, Noriko Kando, Hiromi Nakaiwa , Michael Paul, Jun’ichi Tsujii | ISCA Archive | 
| Towards Rapid Language Portability of Speech Processing Systems | Tanja Schultz | Invited Paper, Conference on Speech and Language Systems for Human Communication, SPLASH 2004, New Delhi, India, November 17-19, 2004 | 
| The ISL RT04 Mandarin Broadcast News Evaluation System | Hua Yu, Yik-Cheung Tam, Thomas Schaaf, Sebastian Stüker, Qin Jin, Mohamed Noamany, Tanja Schultz | Proceedings of the EARS Fall Rich Transcription Workshop, EARS RT-04f, Palisades, New York, USA, November 7-10, 2004 | 
| A Way Out of Dead End Situations in Dialogue Systems for Human-Robot Interaction | Hartwig Holzapfel, Petra Gieselmann | Proceedings of the International Conference on Humanoid Robots, Humanoids 2004, Los Angeles, California, USA, November 3-5, 2004 | 
| Multimodal Detection of Human Interaction Events in a Nursing Home Environment | Datong Chen, Robert Malkin, Jie Yang | Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, Pennsylvania, USA, October 13-15, 2004 | 
| Identifying the Addressee in Human-Human-Robot Interactions based on Head Pose and Speech | Michael Katzenmaier, Rainer Stiefelhagen, Tanja Schultz | Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, Pennsylvania, USA, October 13-15, 2004 | 
| Implementation and Evaluation of a Constraint Based Multimodal Fusion System for Speech and 3D Pointing Gestures | Hartwig Holzapfel, Kai Nickel, Rainer Stiefelhagen | Proceedings of the 6th International Conference on Multimodal Interfaces, ICMI 2004, State College, Pennsylvania, USA, October 13-15, 2004 | 
| Towards Automatic Analysis of Social Interaction Patterns in a Nursing Home Environment from Video | Datong Chen, Jie Yang, Howard Wactlar | Proceedings of the 6th ACM SIGMM International Workshop on Multimedia Information Retrieval, ACM MIR 2004, New York, New York, USA, October 15-16, 2004 | 
| Incremental Detection of Text on Road Signs from Video with Application to a Driving Assistant System | Wen Wu, Xilin Chen, Jie Yang | Proceedings of the 12th Annual ACM International Conference on Multimedia, MM 2004, New York, New York, USA, October 10-16, 2004 | 
| Speech Translation: Past, Present and Future | Alex Waibel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Worldwide Ongoing Activities On Multilingual Speech to Speech Translation | Gianni Lazzari, Alex Waibel, Chengqing Zong | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Automatic Detection And Recognition Of Signs From Natural Scenes | Xilin Chen, Jie Yang, Jing Zhang, Alex Waibel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Adaptation for Soft Whisper Recognition Using a Throat Microphone | Szu-Chen Jou, Tanja Schultz, Alex Waibel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Crosscorrelation-based Multispeaker Speech Activity Detection | Kornel Laskowski, Qin Jin, Tanja Schultz | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Issues in Meeting Transcription - The ISL Meeting Transcription System | Florian Metze, Qin Jin, Christian Fügen, Kornel Laskowski, Yue Pan, Tanja Schultz | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Speaker Segmentation and Clustering in Meetings | Qin Jin, Tanja Schultz | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Tight Coupling of Speech Recognition and Dialog Management - Dialog-Context Dependent Grammar Weighting for Speech Recognition | Christian Fügen, Hartwig Holzapfel, Alex Waibel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Using Word Lattice Information for a Tighter Coupling in Speech Translation Systems | Shirin Saleem, Szu-Chen Jou, Stephan Vogel, Tanja Schultz | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Speaker Dependent Model Order Selection of Spectral Envelopes | Matthias Wölfel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| A Cepstral Domain Maximum Likelihood Beamformer for Speech Recognition | Dominik Raub, John McDonough, Matthias Wölfel | Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, October 4-8, 2004 | 
| Towards Named Entity Extraction and Translation in Spoken Language Translation | Fei Huang, Stephan Vogel, Alex Waibel | Proceedings of the International Workshop on Spoken Language Translation, IWSLT 2004, Kyoto, Japan, September 30 - October 1, 2004 | 
| The ISL EDTRL System | Jürgen Reichert, Alex Waibel | Proceedings of the International Workshop on Spoken Language Translation, IWSLT 2004, Kyoto, Japan, September 30 - October 1, 2004 | 
| The ISL Statistical Machine Translation System for Spoken Language Translation | Stephan Vogel, Sanjika Hewavitharana, Muntsin Kolss, Alex Waibel | Proceedings of the International Workshop on Spoken Language Translation, IWSLT 2004, Kyoto, Japan, September 30 - October 1, 2004 | 
| Natural Human-Robot Interaction Using Speech, Head Pose and Gestures | Rainer Stiefelhagen, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Kai Nickel, Alex Waibel | Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2004, Sendai, Japan, September 28 - October 2, 2004 | 
| Robuste Spracherkennung im Cockpit von Luftfahrzeugen | Michael Dambier, Matthias Wölfel, Christian Fügen | Proceedings of the Conference Elektronische Sprachsignalverarbeitung, ESSV 2004, Cottbus, Germany, September 20-22, 2004 | 
| Multiquellentraining: Chancen für kleine Trainingsmengen in der automatischen Spracherkennung | Matthias Wölfel | Proceedings of the Conference Elektronische Sprachsignalverarbeitung, ESSV 2004, Cottbus, Germany, September 20-22, 2004 | 
| Flexible Decision Trees for Grapheme Based Speech Recognition | Borislava Mimer, Sebastian Stüker, Tanja Schultz | Proceedings of the Conference Elektronische Sprachsignalverarbeitung, ESSV 2004, Cottbus, Germany, September 20-22, 2004 | 
| A Grapheme based Speech Recognition System for Russian | Sebastian Stüker, Tanja Schultz | Proceedings of the 9th International Conference Speech and Computer, SPECOM 2004, St. Petersburg, Russia, September 20-22, 2004 | 
| Large Vocabulary Audio-Visual Speech Recognition Using the Janus Speech Recognition Toolkit | Jan Kratt, Florian Metze, Rainer Stiefelhagen, Alex Waibel | Proceedings of the 26th DAGM Symposium for Pattern Recognition, Tübingen, Germany, August 30 - September 1, 2004 | 
| Interpreting BLEU/NIST Scores: How Much Improvement Do We Need to Have a Better System | Ying Zhang, Stephan Vogel, Alex Waibel | Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004, Lisbon, Portugal, May 26-28, 2004 | 
| Language Model Adaptation for Statistical Machine Translation with Structured Query Models | Bing Zhao, Matthias Eck, Stephan Vogel | Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004, Geneva, Switzerland, August 23-27, 2004 | 
| Improving Statistical Machine Translation in the Medical Domain | Matthias Eck, Stephan Vogel, Alex Waibel | Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004, Geneva, Switzerland, August 23-27, 2004 | 
| Estimating Head Pose with Neural Networks - Results on the Pointing04 ICPR Workshop Evaluation Data | Rainer Stiefelhagen | Pointing '04 ICPR Workshop, Satellite Workshop to the IEEE International Conference on Pattern Recognition, ICPR 2004, Cambridge, United Kingdom, August 22, 2004 | 
| Phrase Pair Rescoring with Term Weighting fpr Statistical Machine Translation | Bing Zhao, Stephan Vogel, Alex Waibel | Proceedings of the Conference on Empirical Methods in Natural Language Processing, EMNLP 2004, Barcelona, Spain, July 25-26, 2004 | 
| Reference Resolution Mechanisms in Dialogue Management | Petra Gieselmann | Proceedings of the 8th Workshop on the Semantics and Pragmatics of Dialogue (SemDial), CATALOG 2004, Barcelona, Spain, July 19-21, 2004 | 
| A Discriminative Learning Framework with Pairwise Constraints for Video Object Classification | Rong Yan, Jing Zhang, Jie Yang, Alexander Hauptmann | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2004, Washington DC, USA, June 27 - July 2, 2004 | 
| Natürliche Mensch-Roboter Interaktion mittels Sprache, Blickrichtung und Gestik | Rainer Stiefelhagen, Christian Fügen, Petra Gieselmann, Hartwig Holzapfel, Kai Nickel, Alex Waibel | Tagungsband VDI Robotik 2004, Robotik 2004, Munich, Germany, June 17-18, 2004 | 
| Language Model Adaptation For Statistical Machine Translation Based On Information Retrieval | Matthias Eck, Stephan Vogel, Alex Waibel | Proceedings of the 4th International Conference on Language Resources and Evaluation, LREC 2004, Lisbon, Portugal, May 26-28, 2004 | 
| 3D-Tracking of Heads and Hands for Pointing Gesture Recognition in a Human-Robot Interaction Scenario | Kai Nickel, Edgar Seemann, Rainer Stiefelhagen | Proceedings of the 6th International Conference on Face and Gesture Recognition, FGR 2004, Seoul, Korea, May 17-19, 2004 | 
| Head Pose Estimation Using Stereo Vision for Human-Robot Interaction | Edgar Seemann, Kai Nickel, Rainer Stiefelhagen | Proceedings of the 6th International Conference on Face and Gesture Recognition, FGR 2004, Seoul, Korea, May 17-19, 2004 | 
| Integrating Thumbnail Features For Speech Recognition Using Conditional Exponential Models | Hua Yu, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Quebec, Canada, May 17-21, 2004 | 
| Minimum Kullback-Leibler Distance Based Multivariate Gaussian Feature Adaptation for Distant-Talking Speech Recognition | Yue Pan, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Quebec, Canada, May 17-21, 2004 | 
| Performance Comparisons of All-pass Transform Adaptation with Maximum Likelihood Linear Regression | John McDonough, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Quebec, Canada, May 17-21, 2004 | 
| Towards Language Portability in Statistical Speech Translation | Alex Waibel, Tanja Schultz, Stephan Vogel, Christian Fügen, Matthias Honal, Muntsin Kolss, Jürgen Reichert, Sebastian Stüker | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Quebec, Canada, May 17-21, 2004 | 
| Issues in Meeting Transcription - The ISL Meeting Transcription System | Florian Metze, Qin Jin, Christian Fügen, Kornel Laskowski, Yue Pan, Tanja Schultz | Proceedings of the Rich Transcription Meeting Recognition Workshop, sponsored by the National Institute of Standards and Technology (NIST), held at ICASSP 2004, Montreal, Canada, May 17, 2004 | 
| Speaker Segmentation and Clustering in Meetings | Qin Jin, Kornel Laskowski, Tanja Schultz, Alex Waibel | Proceedings of the Rich Transcription Meeting Recognition Workshop, sponsored by the National Institute of Standards and Technology (NIST), held at ICASSP 2004, Montreal, Canada, May 17, 2004 | 
| The ISL Meeting Corpus: Categorical Features of Communicative Group Interactions | Susanne Burger, Zachary Sloane | Proceedings of the Rich Transcription Meeting Recognition Workshop, sponsored by the National Institute of Standards and Technology (NIST), held at ICASSP 2004, Montreal, Canada, May 17, 2004 | 
| Real-time Person Tracking and Pointing Gesture Recognition for Human-Robot Interaction | Kai Nickel, Rainer Stiefelhagen | Proceedings of the ECCV Workshop on Human-Computer Interaction, HCI 2004, held at ECCV 2004 , Prague, Czech Republic, May 16, 2004 | 
| A Thai Speech Translation System For Medical Dialogs | Tanja Schultz, Dorcas Alexander, Alan W Black, Kay Peterson, Sinaporn Suebvisai, Alex Waibel | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, HLT-NAACL 2004, Boston, Massachusetts, USA, May 2-7, 2004 | 
| Improving Named Entity Translation Combining Phonetic and Semantic Similarities | Fei Huang, Stephan Vogel, Alex Waibel | Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, HLT-NAACL 2004, Boston, Massachusetts, USA, May 2-7, 2004 | 
| CHIL - Computers in the Human Interaction Loop | Alex Waibel, Hartwig Steusloff, Rainer Stiefelhagen and the CHIL Project Consortium | Proceedings of the 5th International Workshop on Image Analysis for Multimedia Interactive Services, WIAMIS 2004, Lisboa, Portugal, April 21-24, 2004 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Using Articulatory Information for Speaker Adaptation | Florian Metze, Alex Waibel | Proceedings of the Automatic Speech Recognition and Understanding Workshop, ASRU 2003, St. Thomas, Virgin Islands, USA, November 30 - December 4, 2003 | 
| Efficient Handling Of Multilingual Language Models | Christian Fügen, Sebastian Stüker, Hagen Soltau, Florian Metze, Tanja Schultz | Proceedings of the Automatic Speech Recognition and Understanding Workshop, ASRU 2003, St. Thomas, Virgin Islands, USA, November 30 - December 4, 2003 | 
| Warping And Scaling Of The Minimum Variance Distortionless Response | Matthias Wölfel, John McDonough, Alex Waibel | Proceedings of the Automatic Speech Recognition and Understanding Workshop, ASRU 2003, St. Thomas, Virgin Islands, USA, November 30 - December 4, 2003 | 
| Pointing Gesture Recognition Based On 3Dtracking Of Face, Hands And Head Orientation | Kai Nickel, Rainer Stiefelhagen | Proceedings of the 5th International Conference on Multimodal Interfaces, ICMI 2003, Vancouver, Canada, November 5-7, 2003 | 
| Overlapping Phrase-Level Translation Rules in an SMT Engine | Alicia Tribble, Stephan Vogel, Alex Waibel | Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003, Beijing, China PR, October 26-29, 2003 | 
| Integrated Phrase Segmentation and Alignment Algorithm for Statistical Machine Translation | Ying Zhang, Stephan Vogel, Alex Waibel | Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003, Beijing, China PR, October 26-29, 2003 | 
| Gesture Recognition For Remote Collaborative Physical Tasks Using Tablet PCs | Jiazhi Ou, Xilin Chen, Jie Yang | Proceedings of the 9th IEEE International Conference on Computer Vision, ICCV 2003, Nice, France, October 14-17, 2003 | 
| Calibration Of A Hybrid Camera Network | Xilin Chen, Jie Yang, Alex Waibel | Proceedings of the 9th IEEE International Conference on Computer Vision, ICCV 2003, Nice, France, October 14-17, 2003 | 
| Towards Multimodal Communication with a Household Robot | Petra Gieselmann, Christian Fügen, Hartwig Holzapfel, Thomas Schaaf, Alex Waibel | Proceedings of the International Conference on Humanoid Robots, Humanoids 2003, Karlsruhe, Munich, Germany, October 1-3, 2003 | 
| Recognition of 3D-Pointing Gestures for Human-Robot-Interaction | Kai Nickel, Rainer Stiefelhagen | Proceedings of the International Conference on Humanoid Robots, Humanoids 2003, Karlsruhe, Munich, Germany, October 1-3, 2003 | 
| Capturing Interactions In Meetings With Omnidirectional Cameras | Rainer Stiefelhagen, Xilin Chen, Jie Yang | Proceedings of the IEEE International Workshop on Multimedia Technologies in E-Learning and Collaboration, WOMTEC 2003, Nice, France, September 30, 2003 | 
| Communicative Strategies And Patterns Of Multimodal Integration In A Speech-to-Speech Translation System | Susanne Burger, Erica Costantini, Fabio Pianesi | Proceedings of the Machine Translation Summit IX, MT Summit 2003, New Orleans, Louisiana, USA, September 23-27, 2003 | 
| The CMU Statistical Machine Translation System | Stephan Vogel, Ying Zhang, Fei Huang, Alicia Tribble, Ashish Venugopal, Bing Zhao, Alex Waibel | Proceedings of the Machine Translation Summit IX, MT Summit 2003, New Orleans, Louisiana, USA, September 23-27, 2003 | 
| MEL-Frequenzanpassung der Minimum Varianz Distortionless Response Einhüllenden | Matthias Wölfel | 14th Conference Elektronische Sprachsignalverarbeitung, ESSV 2003, Karlsruhe, Germany, September 24-26, 2003 | 
| Real-time Recognition Of 3D-Pointing Gestures For Human-Machine-Interaction | Kai Nickel, Rainer Stiefelhagen | Proceedings of the 25th DAGM Symposium for Pattern Recognition, DAGM 2003, Magdeburg, Germany, September 10-12, 2003 | 
| Communicative Effectiveness In Multimodal And Multilingual Dialogues | Erica Costantini, Susanne Burger, Fabio Pianesi | Proceedings of the 7th Workshop on the Semantics and Pragmatics of Dialogue, DiaBruck 2003, Saarbrücken, Germany, September 4-6, 2003 | 
| Speechalator: two-way speech-to-speech translation on a consumer PDA | Alex Waibel, Ahmed Badran, Alan W Black, Robert Frederking, Donna Gates, Alon Lavie, Lori Levin, Kevin Lenzo, Laura Mayfield Tomokiyo, Jürgen Reichert, Tanja Schultz, Dorcas Wallace, Monika Woszczyna, Jing Zhang | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Minimum Variance Distortionless Response on a Warped Frequency Scale | Matthias Wölfel, John McDonough, Alex Waibel | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Towards Multimodal Interaction With An Intelligent Room | Petra Gieselmann, Matthias Denecke | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| The Nespole! VoIP Corpora In Tourism And Medical Domains | Nadia Mana, Susanne Burger, Roldano Cattoni, Laurent Besacier, Victoria MacLaren, John McDonough, Florian Metze | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Integrating Multilingual Articulatory Features Into Speech Recognition | Sebastian Stüker, Florian Metze, Tanja Schultz, Alex Waibel | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Grapheme Based Speech Recognition | Mirijam Killer, Sebastian Stüker, Tanja Schultz | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Enhanced Tree Clustering With Single Pronunciation | Hua Yu, Tanja Schultz | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Correction of Disfluencies in Spontaneous Speech using a Noisy-Channel | Matthias Honal, Tanja Schultz | Proceedings of the 8th European Conference on Speech Communication and Technology, 4th INTERSPEECH Event, EUROSPEECH 2003, Geneva, Switzerland, September 1-4, 2003 | 
| Effective Phrase Translation Extraction from Alignment Models | Ashish Venugopal, Stephan Vogel, Alex Waibel | Proceedings of the 41st Annual Conference of the Association for Computational Linguistics, ACL 2003, Sapporo, Japan, July 7-12, 2003 | 
| Automatic Extraction Of Named Entity Translingual Equivalence Based On Multi-feature Cost Minimization | Fei Huang, Stephan Vogel, Alex Waibel | Proceedings of the Workshop on Multilingual and Mixed-language Named Entity Recognition, 41st Annual Meeting of the Association for Computational Linguistics 2003, ACL 2003, Sapporo, Japan, July 11, 2003 | 
| Extracting Named Entity Translingual Equivalence With Limited Resources | Fei Huang, Stephan Vogel, Alex Waibel | 41st Annual Meeting of the Association for Computational Linguistics 2003, ACL 2003, Sapporo, Japan, July 7-12, 2003 | 
| The NESPOLE! Multimodal Speech-to-Speech Translation System: User Based System Improvements | Susanne Burger, Erica Costantini, Fabio Pianesi | Proceedings of the 8th International Conference on Human Aspects of Advanced Manufacturing: Agility & Hybrid Automation, HAAMAHA 2003, Rome, Italy, May 26-30, 2003 | 
| Efficient Optimization For Bilingual Sentence Alignment Based On Linear Regression | Bing Zhao, Klaus Zechner, Stephan Vogel, Alex Waibel | Proceedings of the HLT-NAACL Workshop: Building and Using Parallel Texts Data Driven Machine Translation and Beyond, HLT-NAACL 2003, Edmonton, Canada, May 27 - June 1, 2003 | 
| Speechalator: Two-way Speech-to-Speech Translation on a Consumer PDA | Alex Waibel, Ahmed Badran, Alan W Black, Robert Frederking, Donna Gates, Alon Lavie, Lori Levin, Kevin Lenzo, Laura Mayfield Tomokiyo, Jürgen Reichert, Tanja Schultz, Dorcas Wallace, Monika Woszczyna, Jing Zhang | Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Demonstrations, HLT-NAACL 2003, Edmonton, Canada, May 27 - June 1, 2003 | 
| Implicit Trajectory Modeling Through Gaussian Transition Models For Speech Recognition | Hua Yu, Tanja Schultz | Short Proceedings of the Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology, Companion Volume of the Proceedings 2003, HLT-NAACL 2003, Edmonton, Canada, May 27 -June 1, 2003 | 
| Word Alignment Based On Bilingual Bracketing | Bing Zhao, Stephan Vogel | Proceedings of the HLT-NAACL Workshop: Building and Using Parallel Texts Data Driven Machine Translation and Beyond, HLT-NAACL 2003, Edmonton, Canada, May 27 - June 1, 2003 | 
| Recent Advances in Lingwear: A Wearable Linguistic Assistant for Tourists | Christian Fügen, Tanja Schultz, Jia-Cheng Hu, Alex Waibel | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| Comparison of Acoustic Model Adaptation Techniques on Non-native Speech | Zhirong Wang, Tanja Schultz, Alex Waibel | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| Maximum Mutual Information Speaker Adapted Training with Semi-Tied Covariance Matrices | John McDonough, Alex Waibel | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| Combining Cross-Stream And Time Dimensions In Phonetic Speaker Recognition | Qin Jin, Jiri Navratil, Douglas Reynolds, Joseph Campbell, Walter Andrews, Joy Abramson | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| Multilingual Articulatory Features | Sebastian Stüker, Tanja Schultz, Florian Metze, Alex Waibel | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| SMaRT: The Smart Meeting Room Task At ISL | Alex Waibel, Tanja Schultz, Michael Bett, Matthias Denecke, Robert Malkin, Ivica Rogina, Rainer Stiefelhagen, Jie Yang | Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China PR, April 6-10, 2003 | 
| Flexible Parameter Tying For Conversational Speech Recognition | Hua Yu, Alex Waibel | Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, SSPR 2003, Tokyo, Japan, April 13-16, 2003 | 
| Advances in ISL’s Lecture and Meeting Trackers | Alex Waibel, Ivica Rogina | Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, SSPR 2003, Tokyo, Japan, April 13-16, 2003 | 
| Semantische Suche zur Unterstützung des Internet-basierten Wissenstransfers | Karsten Krutz, Matthias Eck, Christian Mayerl, Matthias Riechmann, Sebastian Abeck | Fachtagung Kommunikation in Verteilten Systemen, KiVS 2003, Leipzig, Germany, February 25-28, 2003 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| A robust approach for recognition of text embedded in natural scenes | Zhang, Jing; Chen, Xilin; Hanneman, Andreas; Yang, Jie; Waibel, Alex | International Conference on Pattern Recognition | 
| Eyes and Ears for a Humanoid Robot | Dirk Bechler, Markus Schlosser, Kristian Kroschel, Rainer Stiefelhagen, Kai Nickel, Alex Waibel | Human Centered Robotic Systems - HCRS, Karlsruhe, Germany, December 2002 | 
| A PDA-based Face Recognition System | Jie Yang, Xilin Chen, William Kunz | Proceedings of the 6th IEEE Workshop on Applications of Computer Vision, WACV 2002, Orlando, Florida, USA, December 3-4, 2002 | 
| Automatic Detection of Signs with Affine Transformation | Xilin Chen, Jie Yang, Jing Zhang, Alex Waibel | Proceedings of the 6th IEEE Workshop on Applications of Computer Vision, WACV 2002, Orlando, Florida, USA, December 3-4, 2002 | 
| A PDA-based Sign Translator | Jing Zhang, Xilin Chen, Jie Yang, Alex Waibel | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Flexi-modal And Multi-Machine User Interfaces | Brad Myers, Robert Malkin, Michael Bett, Alex Waibel, Ben Bostwick, Robert C. Miller, Jie Yang, Matthias Denecke, Edgar Seemann, Jie Zhu, Choon Hong Peck, Dave Kong, Jeffrey Nichols, Bill Scherlis | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Improved Named Entity Translation And Bilingual Named Entity Extraction | Fei Huang, Stephan Vogel | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Integrating Emotional Cues into a Framework for Dialogue Management | Hartwig Holzapfel, Christian Fügen, Matthias Denecke, Alex Waibel | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Lecture And Presentation Tracking In An Intelligent Meeting Room | Ivica Rogina, Thomas Schaaf | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| The Added Value Of Multimodality In NESPOLE! Speech-to-Speech Translation System: An Experimental Study | Erica Costantini, Fabio Pianesi, Susanne Burger | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Towards Monitoring Human Activities Using An Omnidirectional Camera | Xilin Chen, Jie Yang | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Towards Universal Speech Recognition | Zhirong Wang, Umut Topkara, Tanja Schultz, Alex Waibel | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Towards Vision-based 3-D People Tracking In A Smart Room | Dirk Focken, Rainer Stiefelhagen | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Tracking Focus Of Attention In Meetings | Rainer Stiefelhagen | Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, Pennsylvania, USA, October 14-16, 2002 | 
| Interlingua based Statistical Machine Translation | Manuel Kauers, Stephan Vogel, Christian Fügen, Alex Waibel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| Phonetic Speaker Identification | Qin Jin, Tanja Schultz, Alex Waibel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| A Flexible Stream Architecture For ASR Using Articulatory Features | Florian Metze, Alex Waibel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| Automatic SIGN Translation | Ying Zhang, Bing Zhao, Jie Yang, Alex Waibel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| Compensating For Hyperarticulation By Modeling Articulatory Properties | Hagen Soltau, Florian Metze, Alex Waibel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| Full-text Story Alignment Models For Chinese-English Bilingual News Corpora | Bing Zhao, Stephan Vogel | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| The ISL Meeting Corpus: The Impact Of Meeting Type On Speech Style | Susanne Burger, Victoria MacLaren, Hua Yu | Proceedings of the 7th International Conference on Spoken Language Processing, 2nd INTERSPEECH Event, ICSLP 2002 - INTERSPEECH 2002, Denver, Colorado, USA, September 16-20, 2002 | 
| Rapid Prototyping For Spoken Dialogue Systems | Matthias Denecke | Proceedings of the 19th International Conference on Computational Linguistics, COLING 2002, Taipei, China, August 24 - September 1, 2002 | 
| A Robust Approach for Recognition of Text Embedded in Natural Scenes | Jing Zhang, Xilin Chen, Andreas Hannemann, Jie Yang, Alex Waibel | Proceedings of the 16th International Conference on Pattern Recognition, ICPR 2002, Québec, Canada, August 11-15, 2002 | 
| A Multi-Perspective Evaluation of the NESPOLE! Speech-to-Speech Translation System | Alon Lavie, Florian Metze, Roldano Cattoni, Erica Costantini, Susanne Burger, Donna Gates, Chad Langley, Kornel Laskowski, Lori Levin, Kay Peterson, Tanja Schultz, Alex Waibel, Dorcas Wallace, John McDonough, Hagen Soltau, Gianni Lazzari, Nadja Mana, Fabio Pianesi, Emanuele Pianta, Laurent Besacier, Hervé Blanchon, Dominique Vaufreydaz, Loredana Taddei | Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems at the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, Philadelphia, Pennsylvania, USA, July 6-12, 2002 | 
| Improvements in Non-Verbal Cue Identification Using Multilingual Phone Strings | Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble, Alex Waibel | Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems at the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, Philadelphia, Pennsylvania, USA, July 6-12, 2002 | 
| Spoken Language Parsing Using Phrase-Level Grammars And Trainable Classifiers | Chad Langley, Alon Lavie, Lori Levin, Dorcas Wallace, Donna Gates, Kay Peterson | Proceedings of the Workshop on Speech-to-Speech Translation: Algorithms and Systems at the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, Philadelphia, Pennsylvania, USA, July 6-12, 2002 | 
| Analysis For Speech Translation Using Grammar-Based Parsing And Automatic Classification | Chad Langley | Proceedings of Student Research Workshop at the 40th Annual Meeting of the Association for Computational Linguistics, ACL 2002, Philadelphia, Pennsylvania, USA, July 6-12, 2002 | 
| NESPOLE!'s Multilingual And Multimodal Corpus | Erica Costantini, Susanne Burger, Fabio Pianesi | Proceedings of the 3rd International Conferernce on Language Resources and Evaluation, LREC 2002, Las Palmas, Spain, May 29-31, 2002 | 
| Subpixel Eye Gaze Tracking | Jie Zhu, Jie Yang | Proceedings of the 5th IEEE International Conference on Automatic Face and Gesture Recognition, FGR 2002, Washington DC, USA, May 20-21, 2002 | 
| On Maximum Mutual Information Speaker-Adapted Training | John McDonough, Thomas Schaaf, Alex Waibel | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Automatic Speech Summarization Applied to English Broadcast News Speech | Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu, Alex Waibel | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Automatic Detection and Translation of Text from Natural Scenes | Jie Yang, Xilin Chen, Jing Zhang, Ying Zhang, Alex Waibel | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Efficient Language Model Lookahead Through Polymorphic Linguistic Context Assignment | Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Speaker Identification Using Multilingual Phone Strings | Qin Jin, Tanja Schultz, Alex Waibel | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Toward Robust Parametric Trajectory Segmental Model For Vowel Recognition | Bing Zhao, Tanja Schultz | Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Orlando, Florida, USA, May 13-17, 2002 | 
| Head Orientation And Gaze Direction In Meetings | Rainer Stiefelhagen, Jie Zhu | Proceedings of the Conference on Human Factors in Computing Systems, CHI 2002, Minneapolis, Minnesota, USA, April 20-25, 2002 | 
| Automatic Speech Summarization Applied To English Broadcast News Speech | Chiori Hori, Sadaoki Furui, Hua Yu, Alex Waibel, Rob Malkin | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| Enhancing The Usability And Performance Of NESPOLE! --- A Real-World Speech-to-Speech Translation System | Alon Lavie, Florian Metze, Fabio Pianesi | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| Speaker, Accent, And Language Identification Using Multilingual Phone Strings | Tanja Schultz, Qin Jin, Kornel Laskowski, Alicia Tribble, Alex Waibel | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| The NESPOLE! Speech-to-Speech Translation System | Florian Metze, John McDonough, Hagen Soltau, Alex Waibel, Alon Lavie, Susanne Burger, Chad Langley, Lori Levin, Tanja Schultz, Fabio Pianesi, Ronaldo Cattoni, Gianni Lazzari, Nadia Mana, Emanuele Pianta | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| Automatic Summarization of English Broadcast News Speech | Chiori Hori, Sadaoki Furui, Rob Malkin, Hua Yu, Alex Waibel | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| An Adaptive Approach to Named Entity Extraction for Meeting Applications | Fei Huang, Alex Waibel | Proceedings of the 2nd Human Language Technology Conference, HLT 2002, San Diego, California, USA, March 24-27, 2002 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| A One-Pass Decoder Based On Polymorphic Linguistic Context Assignment | Hagen Soltau, Florian Metze, Christian Fügen, Alex Waibel | Proceedings of the IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001, Trento, Italy, December 9-13, 2001 | 
| Multilinguality Issues And Speech/Text Resources | Jimmy Kunzmann, Khalid Choukri, Eric Janke, Andreas Kießling, Kate Knill, Lori Lamel, Tanja Schultz, Seiichi Yamamoto | IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001, Trento, Italy, December 9-13, 2001 | 
| An Adaptive Algorithm For Text Detection From Natural Scenes | Jiang Gao, Jie Yang | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR 2001, Kauai Marriott, Hawaii, USA, December 8-14, 2001 | 
| Tracking Focus Of Attention For Human-Robot Communication | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the IEEE-RAS International Conference on Humanoid Robots, Humanoids 2001, Tokyo, Japan, November 22-24, 2001 | 
| Estimating Focus Of Attention Based On Gaze And Sound | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the Workshop on Perceptual User Interfaces, PUI 2001, Orlando, Florida, USA, November 15-16, 2001 | 
| An Automatic Sign Recognition And Translation System | Jie Yang, Jiang Gao, Ying Zhang, Xilin Chen, Alex Waibel | Proceedings of the Workshop on Perceptual User Interfaces, PUI 2001, Orlando, Florida, USA, November 15-16, 2001 | 
| Language Independent And Language Adaptive Acoustic Modeling For Speech Recognition | Tanja Schultz, Alex Waibel | 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, New Orleans, Louisiana, USA, September 9-13, 2001 | 
| Automatic Generation Of Concise Summaries Of Spoken Dialogues In Unrestricted Domains | Klaus Zechner | Proceedings of the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, New Orleans, Louisiana, USA, September 9-13, 2001 | 
| Segmenting Conversations By Topic, Initiative And Style | Klaus Ries | Proceedings of the Workshop on Information Retrieval Techniques for Speech Applications at the 24th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2001, New Orleans, Louisiana, USA, September 9-13, 2001 | 
| Adaptation Methods For Non-Native Speech | Laura Mayfield Tomokiyo, Alex Waibel | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| Detection Of OOV Words Using Generalized Word Models And A Semantic Class Language Model | Thomas Schaaf | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| Experiments On Cross-Language Acoustic Modeling | Tanja Schultz, Alex Waibel | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| Hypothesis-Driven Accent Discrimination | Laura Mayfield Tomokiyo | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| Speech Recognition Over NetMeeting Connections | Florian Metze, John McDonough, Hagen Soltau | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| The NESPOLE! VoIP Dialogue Database | Susanne Burger, Laurent Besacier, Paolo Coletti, Florian Metze, Céline Morel | Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Aalborg, Denmark, September 3-7, 2001 | 
| Increasing The Coherence Of Spoken Dialogue Summaries By Cross-Speaker Information Linking | Klaus Zechner, Alon Lavie | Proceedings of the 2nd Meeting of the North American Chapter of the Association for Computational Linguistics, NAACL 2001, Pittsburgh, Pennsylvania, USA, June 2-7, 2001 | 
| Advances In Automatic Meeting Record Creation And Access | Alex Waibel, Michael Bett, Florian Metze, Klaus Ries, Thomas Schaaf, Tanja Schultz, Hagen Soltau, Hua Yu, Klaus Zechner | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, Utah, USA, May 7-11, 2001 | 
| Model-Combination-Based Acoustic Mapping | Martin Westphal, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, Utah, USA, May 7-11, 2001 | 
| Speaker Compensation With Sine-Log All-Pass Transforms | John McDonough, Florian Metze, Hagen Soltau, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, Utah, USA, May 7-11, 2001 | 
| The ISL Evaluation System for Verbmobil-II | Hagen Soltau, Thomas Schaaf, Florian Metze, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, Utah, USA, May 7-11, 2001 | 
| Visual Speech Synthesis Unsing Quadtree Splines | Xue-Wen Chen, Jie Yang | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, Utah, USA, May 7-11, 2001 | 
| The ISL Meeting Room System | Tanja Schultz, Alex Waibel, Michael Bett, Florian Metze, Yue Pan, Klaus Ries, Thomas Schaaf, Hagen Soltau, Martin Westphal, Hua Yu, Klaus Zechner | Proceedings of the International Workshop on Hands-Free Speech Communication, HSC 2001, Kyoto, Japan, April 9-11, 2001 | 
| Towards Automatic Sign Translation | Jie Yang, Jiang Gao, Ying Zhang, Alex Waibel | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| LingWear: A Mobile Tourist Information System | Christian Fügen, Martin Westphal, Mike Schneider, Tanja Schultz, Alex Waibel | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| Domain Portability in Speech-to-Speech Translation | Alon Lavie, Lori Levin, Tanja Schultz, Chad Langley, Benjamin Han, Alicia Tribble, Donna Gates, Dorcas Wallace, Kay Peterson | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| Advances in Meeting Recognition | Alex Waibel, Hua Yu, Martin Westphal, Hagen Soltau, Tanja Schultz, Thomas Schaaf, Yue Pan, Florian Metze, Michael Bett | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| Activity Detection For Information Access To Oral Communication | Klaus Ries, Alex Waibel | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| Architecture and Design Considerations in NESPOLE!: A Speech Translation System For E-commerce Applications | Alon Lavie, Chad Langley, Alex Waibel, Fabio Pianesi, Gianni Lazzari, Paolo Coletti, Loredana Taddei, Franco Balducci | Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, California, USA, March 18-21, 2001 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Emotion-sensitive human-computer interfaces | Thomas S Polzin, Alexander Waibel | ISCA tutorial and research workshop (ITRW) on speech and emotion | 
| An Efficient LDA Algorithm For Face Recognition | Jie Yang, Hua Yu, William Kunz | Proceedings of the 6th International Conference on Control, Automation, Robotics and Vision, ICARCV 2000, Singapore, Singapore, December 5-8, 2000 | 
| User Registration Using Your Face And Mouth | Jie Yang, Fei Huang, William Kunz | Proceedings of the 6th International Conference on Control, Automation, Robotics and Vision, ICARCV 2000, Singapore, Singapore, December 5-8, 2000 | 
| Phone Dependent Modeling of Hyperarticulated Effects | Hagen Soltau, Alex Waibel | Proceedings of the 6th International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| A Naive De-Lambing Method For Speaker Identification | Jie Yang, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Acoustic Models For Hyperarticulated Speech | Hagen Soltau, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Application of LDA to Speaker Recognition | Qin Jin, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Dialogue Management For Multimodal User Registration | Fei Huang, Jie Yang, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Informational Characterization Of Dialogue States | Matthias Denecke | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Lexical And Acoustic Modeling Of Non-Native Speech In LVCSR | Laura Mayfield Tomokiyo | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| New Developments In Automatic Meeting Transcription | Hua Yu, Takashi Tomokiyo, Zhirong Wang, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Streamlining the Front End of a Speech Recognizer | Hua Yu, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| The Effects Of The Room Acoustics On MFCC Speech Parameter | Yue Pan, Alex Waibel | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Verbmobil Dialogues: Multifaced Analysis | Akira Kurematsu, Youichi Akegami, Susanne Burger, Susanne Jekat, Brigitte Lause, Victoria L Maclaren, Daniela Oppermann, Tanja Schultz | Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Beijing, China, October 16-20, 2000 | 
| Language Portability In Acoustic Modeling | Tanja Schultz, Alex Waibel | Proceedings of the Workshop on Multilingual Speech Communication, MSC 2000, Kyoto, Japan, October 11-13, 2000 | 
| Object-Oriented Techniques In Grammar And Ontology Specification | Matthias Denecke | Proceedings of the Workshop on Multilingual Speech Communication, MSC 2000, Kyoto, Japan, October 11-13, 2000 | 
| Das View4You-System: End-to-End Evaluation | Florian Metze, Thomas Kemp | Proceedings of the 5th KONVENS Conference on Natural Language Processing (Konferenz zur Verarbeitung natürlicher Sprache), KONVENS 2000, Ilmenau, Germany, October 9-12, 2000 | 
| NPen++: An On-line Handwriting Recognition System | Stefan Jaeger, Stefan Manke, Alex Waibel | Proceedings of the 7th International Workshop on Frontiers in Handwriting Recognition, IWFHR 2000, Amsterdam, The Netherlands, September 11-13, 2000 | 
| On the Complexity of Cognition | Stefan Jaeger | Proceedings of the 7th International Workshop on Frontiers in Handwriting Recognition, IWFHR 2000, Amsterdam, The Netherlands, September 11-13, 2000 | 
| Growing Gaussian Mixture Models for Pose Invariant Face Recognition | Ralph Gross, Jie Yang, Alex Waibel | Proceedings of the 15th International Conference on Pattern Recognition, ICPR 2000, Barcelona, Spain, September 3-8, 2000 | 
| Simultaneous Tracking of Head Poses in a Panoramic View | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the 15th International Conference on Pattern Recognition, ICPR 2000, Barcelona, Spain, September 3-8, 2000 | 
| An Intergrated Development Environment For Spoken Dialogue Systems | Matthias Denecke | Proceedings of the 18th International Conference on Computational Linguistics, COLING 2000, Saarbrücken, Germany, July 31 - August 4, 2000 | 
| DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains | Klaus Zechner, Alex Waibel | Proceedings of the 18th International Conference on Computational Linguistics, COLING 2000, Saarbrücken, Germany, July 31 - August 4, 2000 | 
| Dialogue Act Modeling For Automatic Tagging And Recognition Of Conversational Speech | Andreas Stolcke, Klaus Ries, Noah Coccaro, Elizabeth Shriberg, Rebecca Bates, Daniel Jurafsky, Paul Taylor, Rachel Martin, Carol Van Ess-Dykema, Marie Meteer | Proceedings of the 18th International Conference on Computational Linguistics, COLING 2000, Saarbrücken, Germany, July 31 - August 4, 2000 | 
| Partial Information In Multimodal Dialogue | Matthias Denecke, Jie Yang | Proceedings of the International Conference on Multimedia and Expo, ICME 2000, New York City, New York, USA, July 30 - August 2, 2000 | 
| Automatic Selection Of Visemes For Image-based Visual Speech Synthesis | Jie Yang, Jing Xiao, Max Ritter | Proceedings of the International Conference on Multimedia and Expo, ICME 2000, New York City, New York, USA, July 30 - August 2, 2000 | 
| Towards A Multimodal Meeting Record | Ralph Gross, Michael Bett, Hua Yu, Xiaojin Zhu, Yue Pan, Jie Yang, Alex Waibel | Proceedings of the International Conference on Multimedia and Expo, ICME 2000, New York City, New York, USA, July 30 - August 2, 2000 | 
| Confidence Measure Based Language Identification | Florian Metze, Thomas Kemp, Thomas Schaaf, Tanja Schultz, Hagen Soltau | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Integrating Dynamic Speech Modalities Into Context Decision Trees | Christian Fügen, Ivica Rogina | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Linguistic Properties Of Non-Native Speech | Laura Mayfield Tomokiyo | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Polyphone Decision Tree Specialization For Language Adaptation | Tanja Schultz, Alex Waibel | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Specialized Acoustic Models For Hyperarticulated Speech | Hagen Soltau, Alex Waibel | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Strategies For Automatic Segmentation Of Audio Data | Thomas Kemp, Michael Schmidt, Martin Westphal, Alex Waibel | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Turkish LVCSR: Toward Better Speech Recognition For Agglutinative Languages | Kenan Çarki, Petra Geutner, Tanja Schultz | Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, Istanbul, Turkey, June 5-9, 2000 | 
| Lessons Learned From A Task-Based Evaluation Of Speech-to-Speech Machine Translation | Lori Levin, Boris Bartlog, Ariadna Llitjos, Donna Gates, Alon Lavie, Dorcas Wallace, Taro Watanabe, Monika Woszczyna | Proceedings of the 2nd International Conference on Language Resources and Evaluation, LREC 2000, Athens, Greece, May 31 - June 1, 2000 | 
| Shallow Discourse Annotation In CallHome Spanish | Klaus Ries, Lori Levin, Liza Valle, Alon Lavie, Alex Waibel | Proceedings of the 2nd International Conference on Language Resources and Evaluation, LREC 2000, Athens, Greece, May 31 - June 1, 2000 | 
| Minimizing Word Error Rate In Textual Summaries Of Spoken Language | Klaus Zechner, Alex Waibel | Proceedings of the ANLP/NAACL Workshop on Automatic Summarization, Seattle, Washington, USA, April 30, 2000 | 
| End To End Evaluation Of The ISL VIEW4YOU Broadcast News Transcription System | Thomas Kemp, Manfred Weber, Alex Waibel | Proceedings of the 6th Conference on Content-based Multimedia Information Access, RIAO 2000, Paris, France, April 12-14, 2000 | 
| Multimodal Meeting Tracker | Michael Bett, Ralph Gross, Hua Yu, Xiaojin Zhu, Yue Pan, Jie Yang, Alex Waibel | Proceedings of the 6th Conference on Content-based Multimedia Information Access, RIAO 2000, Paris, France, April 12-14, 2000 | 
| Face Recognition In A Meeting Room | Ralph Gross, Jie Yang, Alex Waibel | Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2000, Grenoble, France, March 26-30, 2000 | 
| Segmenting Hands Of Arbitrary Color | Xiaojin Zhu, Jie Yang, Alex Waibel | Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2000, Grenoble, France, March 26-30, 2000 | 
| Multilinguality In Speech And Spoken Language Systems | Alex Waibel, Petra Geutner, Laura Mayfield Tomokiyo, Tanja Schultz, Monika Woszczyna | Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2000, Grenoble, France, March 26-30, 2000 | 
| On-Line Handwriting Recognition: The NPen++ Recognizer | Stefan Jäger, Stefan Manke, Jürgen Reichert, Alex Waibel | Proceedings of the 4th IEEE International Conference on Automatic Face and Gesture Recognition, FG 2000, Grenoble, France, March 26-30, 2000 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Multimodal People ID For A Multimedia Meeting Browser | Jie Yang, Xiaojin Zhu, Ralph Gross, John Kominek, Yue Pan, Alex Waibel | Proceedings of the 7th ACM International Conference on Multimedia, MULTIMEDIA 1999, Orlando, Florida, USA, October 30 - November 5, 1999 | 
| Modeling Focus Of Attention For Meeting Indexing | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the 7th ACM International Conference on Multimedia, MULTIMEDIA 1999, Orlando, Florida, USA, October 30 - November 5, 1999 | 
| Smart Sight: A Tourist Assistant System | Jie Yang, Weiyi Yang, Matthias Denecke, Alex Waibel | Proceedings of the 3rd International Symposium on Wearable Computers, ISWC 1999, San Francisco, California, USA, October 18-19, 1999 | 
| Modeling People's Focus Of Attention | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the IEEE International Workshop on Modelling People, MPeople 1999, Kerkyra, Corfu, Greece, September 20, 1999 | 
| Language Adaptive LVCSR Through Polyphone Decision Tree Specialization | Tanja Schultz, Alex Waibel | Proceedings of the Workshop on Multi-lingual Interoperability in Speech Technology, MIST 1999, Leusden, The Netherlands, September 13-14, 1999 | 
| Mandarin Large Vocabulary Speech Recognition Using The GLOBALPHONE Database | Jürgen Reichert, Tanja Schultz, Alex Waibel | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Towards Spontaneous Speech Recognition For On-Board Car Navigation And Information Systems | Martin Westphal, Alex Waibel | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Towards The Detection And Description Of Textual Meaning Indicators In Spontaneous Conversations | Klaus Ries | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Unsupervised Training Of A Speech Recognizer: Recent Experiments | Thomas Kemp, Alex Waibel | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Mixture Trees-- Hierarchically Tied Mixture Densities For Modeling HMM Emission Probabilities | Jürgen Fritsch | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Progress In Automatic Meeting Transcription | Hua Yu, Michael Finke, Alex Waibel | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Modeling and efficient decoding of large vocabulary conversational speech | Michael Finke, Jürgen Fritsch, Detlef Koll, Alex Waibel | Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, September 5-9, 1999 | 
| Data-Driven Determination Of Appropriate Dictionary Units For Korean LVCSR | Daniel Kiecza, Tanja Schultz, Alex Waibel | Proceedings of the International Conference on Speech Processing, ICSP 1999, Seoul, South Korea, August 18-20, 1999 | 
| Face Translation: A Multimodal Translation Agent | Max Ritter, Uwe Meier, Jie Yang, Alex Waibel | Proceedings of the 4th annual Auditory-Visual Speech Processing Conference, AVSP 1999, Santa Cruz, California, USA, August 7-10, 1999 | 
| Regional Variants Of German: Categories Of Pronounciation Deviation from Standard German | Susanne Burger, Daniela Oppermann | Proceedings of the 14th International Congress of Phonetic Sciences, ICPhS 1999, San Francisco, California, USA, August 1-7, 1999 | 
| What Makes Speech Data Spontaneous? | Daniela Oppermann, Susanne Burger | Proceedings of the 14th International Congress of Phonetic Sciences, ICPhS 1999, San Francisco, California, USA, August 1-7, 1999 | 
| Integrating Knowledge Sources For The Specification Of A Task-Orientated Dialogue System | Matthias Denecke, Alex Waibel | Proceedings of the 16th International Joint Conference on Artificial Intelligence, IJCAI 1999, Stockholm, Sweden, July 31 - August 6, 1999 | 
| Multimodal People ID For A Multimedia Meeting Browser | Jie Yang, Xiaojin Zhu, Ralph Gross, John Kominek, Yue Pan, Alex Waibel | Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL 1999, College Park, Maryland, USA, June 20-26, 1999 | 
| Modeling Focus Of Attention For Meeting Indexing | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL 1999, College Park, Maryland, USA, June 20-26, 1999 | 
| Eliciting Natural Speech From Non-Native Users: Collecting Speech Data For LVCSR | Laura Mayfield Tomokiyo, Susanne Burger | Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL 1999, College Park, Maryland, USA, June 20-26, 1999 | 
| Tagging Of Speech Acts And Dialogue Games In Spanish Call Home | Lori Levin, Klaus Ries, Ann Thymé-Gobbel, Alon Lavie | Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL 1999, College Park, Maryland, USA, June 20-26, 1999 | 
| Hidden Understanding Models for Machine Translation | Wolfgang Minker, Marsal Gavalda, Alex Waibel | Proceedings of the ESCA Tutorial and Research Workshop (ETRW) on Interactive Dialogue in Multi-Modal Systems, Kloster Irsee, Germany, June 22-25, 1999 | 
| From Gaze to Focus of Attention | Rainer Stiefelhagen, Michael Finke, Jie Yang, Alex Waibel | Proceedings of the 3rd International Conference On Visual Information Systems, VISUAL 1999, Amsterdam, The Netherlands, June 2-4, 1999 | 
| Model-based And Empirical Evaluation Of Multimodal Interactive Error Correction | Bernhard Suhm, Brad Myers, Alex Waibel | Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, CHI 1999, Pittsburgh, Pennsylvania, USA, May 15-20, 1999 | 
| Development Of Data Collection And Transliteration Of Japanese Spontaneous Database In The Travel Arrangement Task Domain | Akira Kurematsu, Youichi Akegami, Tanja Schultz, Susanne Burger | Proceedings of the International Workshop on East-Asian Language Resource and Evaluation, Oriental COCOSDA 1999, Taipei, China, May 12-14, 1999 | 
| HMM And Neural Network Based Speech Act Detection | Klaus Ries | Proceedings of the IEEE International Conference on Acoustics and Signal Processing, ICASSP 1999, Phoenix, Arizona, USA, March 15-19, 1999 | 
| Selection Criteria For Hypothesis Driven Lexical Adaptation | Petra Geutner, Michael Finke, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics and Signal Processing, ICASSP 1999, Phoenix, Arizona, USA, March 15-19, 1999 | 
| Towards Unrestricted Lipreading | Uwe Meier, Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the 2nd International Conference on Multimodal Interface, ICMI 1999, Hong Kong, China PR, January 5-7, 1999 | 
| Experiments towards a Multi-language LVCSR Interface | Tanja Schultz, Alex Waibel | Proceedings of the 2nd International Conference on Multimodal Interface, ICMI 1999, Hong Kong, China PR, January 5-7, 1999 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Real-time Face And Facial Feature Tracking And Applications | Jie Yang, Rainer Stiefelhagen, Uwe Meier, Alex Waibel | Proceedings of the International Conferences on Auditory-Visual Speech Processing, AVSP 1998, New-South-Wales, Australia, December 4-7, 1998 | 
| An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues | Lori Levin, Donna Gates, Alon Lavie, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| A Discourse Coding Scheme For Conversational Spanish | Lori Levin, Ann Thymé-Gobbel, Alon Lavie, Klaus Ries, Klaus Zechner | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Conversational Speech Systems For On-board Car Navigation And Assistance | Petra Geutner, Matthias Denecke, Uwe Meier, Martin Westphal, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Effective Structural Adaptation Of LVCSR Systems To Unseen Domains Using Hierarchical Connectionist Acoustic Models | Jürgen Fritsch, Michael Finke, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Fast Decoding For Statistical Machine Translation | Ye-Yi Wang, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Language Independent And Language Adaptive Large Vocabulary Speech Recognition | Tanja Schultz, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Linear Discriminant - A New Method For Speaker Normalization | Martin Westphal, Tanja Schultz, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Linguistically Engineered Tools For Speech Recognition Error Analysis | Carol Van Ess-Dykema, Klaus Ries | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| On The Influence Of Hyperarticulated Speech On Recognition Performance | Hagen Soltau, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Phonetic-Distance-Based Hypothesis Driven Lexical Adaptation for Transcribing Multilingual Broadcast News | Petra Geutner, Michael Finke, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Probabilistic Dialogue Act Extraction For Concept Based Multilingual Translation Systems | Toshiaki Fukada, Detlef Koll, Alex Waibel, Kouichi Tanigaki | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Reducing The OOV Rate In Broadcast News Speech Recognition | Thomas Kemp, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| The Interactive Systems Labs View4you Video Indexing System | Thomas Kemp, Petra Geutner, Michael Schmidt, Borislaw Tomaz, Manfred Weber, Martin Westphal, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| Unsupervised Training Of A Speech Recognizer Using TV Broadcasts | Thomas Kemp, Alex Waibel | Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998, Sydney, Australia, November 30 - December 4, 1998 | 
| From Gaze to Focus of Attention | Rainer Stiefelhagen, Michael Finke, Jie Yang, Alex Waibel | Proceedings of the Workshop on Perceptual Interfaces, PUI 1998, San Francisco, California, USA, November 5-6, 1998 | 
| A Visual Timing Tool for Language Training of Hearing Impaired Children | Uwe Meier, Jie Yang, Alex Waibel | Proceedings of the Workshop on Perceptual Interfaces, PUI 1998, San Francisco, California, USA, November 5-6, 1998 | 
| A Modular Approach To Spoken Language Translation For Large Domains | Monika Woszczyna, Matthew Broadhead, Donna Gates, Marsal Gavalda, Alon Lavie, Lori Levin, Alex Waibel | Proceedings of the 3rd Conference of the Association for Machine Translation in the Americas, AMTA 1998, Langhorne, Pennsylvania, USA, October 28-31, 1998 | 
| Adaptation Of Pronunciation Dictionaries For Recognition Of Unseen Languages | Tanja Schultz, Alex Waibel | Proceedings of the International Workshop on Speech and Computer, SPECOM 1998, St. Petersburg, Russia, October 26-29, 1998 | 
| An Adaptive Multimodal Interface For Wireless Applications | Jie Yang, William Holtz, Weiyi Yang, Minh Tue Vo | Proceedings of International Symposium on Wearable Computers, ISCW 1998, Pittsburgh, Pennsylvania, USA, October 19-20, 1998 | 
| Das Projekt Globalphone: Multilinguale Spracherkennung | Tanja Schultz, Alex Waibel | Proceedings of the 4th KONVENS Conference on Natural Language Processing (Konferenz zur Verarbeitung natürlicher Sprache), KONVENS 1998, Bonn, Germany, October 5-7, 1998 | 
| Can Prosody Aid The Automatic Classification Of Dialog Acts In Conversational Speech? | Elizabeth Shrieberg, Rebecca Bates, Andreas Stolcke, Paul Taylor, Daniel Jurafsky, Klaus Ries, Noah Coccaro, Rachel Martin, Marie Meteer, Carol Van Ess-Dykema | Proceedings of the 4th KONVENS Conference on Natural Language Processing (Konferenz zur Verarbeitung natürlicher Sprache), KONVENS 1998, Bonn, Germany, October 5-7, 1998 | 
| Development Of Multilingual Acoustic Models In The GlobalPhone Project | Tanja Schultz, Alex Waibel | Proceedings of the 1st Workshop on Text, Speech and Dialogue, TSD 1998, Brno, Czech Republic, September 23-26, 1998 | 
| Automatic Construction Of Frame Representations For Spontaneous Speech In Unrestricted Domains | Klaus Zechner | Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, COLING-ACL 1998, Montréal, Quebec, Canada, August 10-14, 1998 | 
| Growing Semantic Grammars | Marsal Gavalda, Alex Waibel | Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, COLING-ACL 1998, Montréal, Quebec, Canada, August 10-14, 1998 | 
| Modeling With Structures In Statistical Machine Translation | Ye-Yi Wang, Alex Waibel | Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, COLING-ACL 1998, Montréal, Quebec, Canada, August 10-14, 1998 | 
| Using Chunk Based Partial Parsing Of Spontaneous Speech In Unrestricted Domains For Reducing Word Error Rate In Speech Recognition | Klaus Zechner, Alex Waibel | Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and the 17th International Conference on Computational Linguistics, COLING-ACL 1998, Montréal, Quebec, Canada, August 10-14, 1998 | 
| Pronunciation Variations In Emotional Speech | Thomas Polzin, Alex Waibel | Proceedings of the ESCA Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition 1998, ESCA 1998, Kerkrade, The Netherlands, May 1998 | 
| Error Repair in Human Handwriting - An Intelligent User Interface For Automatic On-Line Handwriting Recognition | Wolfgang Huerst, Jie Yang, Alex Waibel | Proceedings of the IEEE International Joint Symposia on Intelligence and Systems, IJSIS 1998, Rockville, Maryland, USA, May 21-23, 1998 | 
| ACID/HNN: Clustering Hierarchies Of Neural Networks For Context-Dependent Connectionist Acoustic Modeling | Jürgen Fritsch, Michael Finke | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| Adaptive Vocabularies For Transcribing Multilingual Broadcast News | Petra Geutner, Michael Finke, Peter Scheytt | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| An Automatic Method For Learning A Japanese Lexicon For Recognition Of Spontaneous Speech | Laura Mayfield Tomokiyo, Klaus Ries | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| Experiments In Automatic Meeting Transcription Using JRTK | Hua Yu, Cortis Clark, Robert Malkin, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| Recognition Of Music Types | Hagen Soltau, Tanja Schultz, Martin Westphal, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| Serbo-Croatian LVCSR On The Dictation And Broadcast News Domain | Peter Scheytt, Petra Geutner, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1998, Seattle, Washington, USA, May 12-15, 1998 | 
| Hierarchies Of Neural Networks For Connectionist Speech Recognition | Jürgen Fritsch, Alex Waibel | Proceedings of the 6th European Symposium on Artificial Neural Networks, ESANN 1998, Bruges, Belgium, April 22-24, 1998 | 
| Interactive Error Repair for an Online Handwriting Interface | Wolfgang Huerst, Jie Yang, Alex Waibel | Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, CHI 1998, Los Angeles, California, USA, April 18-21, 1998 | 
| Visual Tracking For Multimodal Human Computer Interaction | Jie Yang, Rainer Stiefelhagen, Uwe Meier, Alex Waibel | Proceedings of the ACM SIGCHI Conference on Human Factors in Computing Systems, CHI 1998, Los Angeles, California, USA, April 18-21, 1998 | 
| CLARITY: Inferring Discourse Structure From Speech | Michael Finke, Maria Lapara, Alon Lavie, Lori Levin, Laura Mayfield Tomokiyo, Thomas Polzin, Klaus Ries, Alex Waibel, Klaus Zechner | AAAI Spring Symposium Series, Stanford University, Palo Alto, California, USA, March 23-25, 1998 | 
| CLARITY: Automatic Discourse And Dialogue Analysis For A Speech And Natural Language Processing System | Michael Finke, Maria Lapara, Alon Lavie, Lori Levin, Laura Mayfield Tomokiyo, Thomas Polzin, Klaus Ries, Alex Waibel, Klaus Zechner | AAAI Spring Symposium Series, Stanford University, Palo Alto, California, USA, March 23-25, 1998 | 
| Dialog Act Modeling For Conversational Speech | Andreas Stolcke, Elizabeth Shriberg, Rebecca Bates, Noah Coccaro, Daniel Jurafsky, Rachel Martin, Marie Meteer, Klaus Ries, Paul Taylor, Carol Van Ess-Dykema | AAAI Spring Symposium Series, Stanford University, Palo Alto, California, USA, March 23-25, 1998 | 
| Towards Tracking Interaction Between People | Rainer Stiefelhagen, Jie Yang, Alex Waibel | AAAI Spring Symposium Series, Stanford University, Palo Alto, California, USA, March 23-25, 1998 | 
| Meeting Browser: Tracking And Summarizing Meetings | Alex Waibel, Michael Bett, Michael Finke, Rainer Stiefelhagen | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop 1998, Landsdowne, Virginia, USA, February 8-11, 1998 | 
| Multilingual And Crosslingual Speech Recognition | Tanja Schultz, Alex Waibel | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, Virginia, USA, February 8-11, 1998 | 
| Transcribing Multilingual Broadcast News Using Hypothesis Driven Lexical Adaptation | Petra Geutner, Michael Finke, Peter Scheytt, Alex Waibel, Howard Wactlar | Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Landsdowne, Virginia, USA, February 8-11, 1998 | 
| Detecting Emotions In Speech | Thomas Polzin, Alex Waibel | Proceedings of the 2nd International Conference on Cooperative Multimodal Communication, CMC 1998, Tilburgh, The Netherlands, January 28-30, 1998 | 
| Skin-Color Modeling And Adaptation | Jie Yang, Weier Lu, Alex Waibel | Proceedings of the 3rd Asian Conference on Computer Vision, ACCV 1998, Hong Kong, China PR, January 8-10, 1998 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| ACID/HNN: A Framework For Hierarchical Connectionist Acoustic Modeling | Jürgen Fritsch | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 1997, Santa Barbara, California, USA, December 17, 1997 | 
| Automatic Detection Of Discourse Structure For Speech Recognition And Understanding | Daniel Jurafsky, Rebecca Bates, Noah Coccaro, Rachel Martin, Marie Meteer, Klaus Ries, Elizabeth Shriberg, Andreas Stolcke, Paul Taylor, Carol Van Ess-Dykema | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 1997, Santa Barbara, California, USA, December 17, 1997 | 
| Empirical Evaluation Of Interactive Multimodal Error Recovery | Bernhard Suhm | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 1997, Santa Barbara, California, USA, December 17, 1997 | 
| Flexible Transcription Alignment | Michael Finke, Alex Waibel | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 1997, Santa Barbara, California, USA, December 17, 1997 | 
| Pronunciation Modelling For Conversational Speech Recognition: A Status Report From WS97 | B. Byrne, M. Finke, S. Khudanpur, J. McDonough, H. Nock, M. Riley, M. Saraclar, C. Wooters, G. Zavaliagkos | Proceedings of the IEEE Automatic Speech Recognition and Understanding Workshop, ASRU 1997, Santa Barbara, California, USA, December 17, 1997 | 
| Java Front-End For Web-Based Multimodal Human-Computer Interaction | Xing Jing, Jie Yang, Minh Tue Vo, Alex Waibel | Proceedings of the Workshop on Perceptual User Interfaces, PUI 1997, Banff, Canada, October 20-21, 1997 | 
| Tracking Eyes And Monitoring Eye Gaze | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the Workshop on Perceptual User Interfaces, PUI 1997, Banff, Canada, October 20-21, 1997 | 
| Preprocessing Of Visual Speech Under Real World Conditions | Uwe Meier, Rainer Stiefelhagen, Jie Yang | Proceedings of the ESCA Workshop on Audio-Visual Speech Processing, AVSP 1997, Rhodes, Greece, September 26-27, 1997 | 
| A Class Based Approach To Domain Adaptation And Constraint Integration For Empirical M-Gram Models | Klaus Ries | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Dialogue Strategies Guiding Users To Their Communicative Goals | Matthias Denecke, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Estimating Confidence Using Word Lattices | Thomas Kemp, Thomas Schaaf | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Fast Bootstrapping Of LVCSR Systems With Multilingual Phoneme Sets | Tanja Schultz, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Fuzzy Class Rescoring: A Part-Of-Speech Language Model | Petra Geutner | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Improving Performance On Switchboard By Combining Hybrid HME/HMM And Mixture Of Gaussians Acoustic Models | Jürgen Fritsch, Michael Finke | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Japanese LVCSR On The Spontaneous Scheduling Task With Janus-3 | Tanja Schultz, Detlef Koll, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Real-Time Lip-Tracking For Lipreading | Rainer Stiefelhagen, Uwe Meier, Jie Yang | 5th European Conference on Speech Communication and Technology 1997, Eurospeech 1997, Rhodes, Greece, 01. September 1997 | 
| Recognition Of Spoken And Spelled Proper Names | Michael Meyer, Hermann Hild | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Speaker Normalization And Speaker Adaptation - A Combination For Conversational Speech Recognition | Puming Zhan, Martin Westphal, Michael Finke, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Speaking Mode Dependent Pronunciation Modeling In Large Vocabulary Conversational Speech Recognition | Michael Finke, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Statistical Analysis Of Dialogue Structure | Ye-Yi Wang, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Stochastically-Based Natural Language Understanding Across Tasks And Languages | Wolfgang Minker | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| The Use Of Cepstral Means In Conversational Speech Recognition | Martin Westphal | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Exploiting Repair Context In Interactive Error Recovery | Bernhard Suhm, Alex Waibel | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| Automatic Architecture Design By Likelihood-Based Context Clustering With Crossvalidation | Ivica Rogina | Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997 | 
| An Information-Based Approach For Guiding Multi-Modal Human-Computer Interaction | Matthias Denecke | Proceedings of the 15th International Joint Conference on Artificial Intelligence, IJCAI 1997, Nagoya, Aichi, Japan, August 23-29, 1997 | 
| A Programmable Multi-Blackboard Architecture For Dialogue Processing Systems | Matthias Denecke | Proceedings of the 35th Annual Meeting of the ACL joint with the 8th Meeting of the European Chapter of the ACL 1997, ACL/EACL 1997, Madrid, Spain, July 1997 | 
| Decoding Algorithm In Statistical Machine Translation | Ye-Yi Wang, Alex Waibel | Proceedings of the 35th Annual Meeting of the ACL joint with the 8th Meeting of the European Chapter of the ACL 1997, ACL/EACL 1997, Madrid, Spain, July 1997 | 
| Expanding the Domain of a Multi-lingual Speech-to-Speech Translation System | Alon Lavie, Lori Levin, Puming Zhan, Maite Taboada, Donna Gates, Mirella Lapata, Cortis Clark, Matthew Broadhead, Alex Waibel | Proceedings of the 35th Annual Meeting of the ACL joint with the 8th Meeting of the European Chapter of the ACL 1997, ACL/EACL 1997, Madrid, Spain, July 1997 | 
| Improving Translation Through Contextual Information | Maite Taboada | Proceedings of the 35th Annual Meeting of the ACL joint with the 8th Meeting of the European Chapter of the ACL 1997, ACL/EACL 1997, Madrid, Spain, July 1997 | 
| What Makes A Word: Learning Base Units In Japanese For Speech Recognition | Laura Mayfield Tomokiyo, Klaus Ries | Proceedings of the 35th Annual Meeting of the ACL joint with the 8th Meeting of the European Chapter of the ACL 1997, ACL/EACL 1997, Madrid, Spain, July 1997 | 
| Contextual Information For Disambiguation In A Speech-To-Speech Translation System | Maite Taboada | 24th International Systemic Functional Congresses, ISFC 1997, Toronto, Canada, July 1997 | 
| Planning Transition Relevance Points In Speech-Based Information Systems | Yan Qu | Proceedings of the 14th National Conference on Artificial Intelligence, AAAI 1997, Providence, Rhode Island, USA, July 27-31, 1997 | 
| The JanusRTk Switchboard/CallHome Evaluation System | Michael Finke, Jürgen Fritsch, Petra. Geutner, Klaus Ries, Torsten Zeppenfeld, Alex Waibel | Proceedings of the LVCSR Hub 5-E Workshop, Linthicum, Maryland, USA, May 13-15, 1997 | 
| The Global Phone Project: Multilingual Lvcsr With Janus-3 | Tanja Schultz, Martin Westphal, Alex Waibel | 2nd SQEL Workshop 1997, SQEL 1997, Plzen, Czech Republic, April 1997 | 
| Verbmobil: The Combination of Deep and Shallow Processing for Spontaneous Speech Translation | Thomas Bub, Wolfgang Wahlster, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Wide Context Acoustic Modeling In Read Vs. Spontaneous Speech | Michael Finke, Ivica Rogina | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| The Karlsruhe-Verbmobil Speech Recognition Engine | Michael Finke, Petra Geutner, Hermann Hild, Thomas Kemp, Klaus Ries, Martin Westphal | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Speaker Normalization Based On Frequency Warping | Puming Zhan, Martin Westphal | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Recognition Of Conversational Telephone Speech Using The Janus Speech Engine | Torsten Zeppenfeld, Michael Finke, Klaus Ries, Martin Westphal, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Multimodal Interfaces For Multimedia Information Agents | Alex Waibel, Bernhard Suhm, Minh Tue Vo, Jie Yang | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| JANUS-III: Speech-To-Speech Translation In Multiple Languages | Alon Lavie, Alex Waibel, Lori Levin, Michael Finke, Donna Gates, Marsal Gavalda, Torsten Zeppenfeld, Puming Zhan | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Gaze Tracking For Multimodal Human-Computer Interaction | Rainer Stiefelhagen, Jie Yang | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Context-Dependent Hybrid HME/HMM Speech Recognition Using Polyphone Clustering Decision Trees | Jürgen Fritsch, Michael Finke, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| A Contextual Blind Separation Of Delayed And Convolved Sources | Te-Won Lee, Reinhold Orglmeister | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| Confidence Measures For Spontaneous Speech Recognition | Thomas Schaaf, Thomas Kemp | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, April 21-24, 1997 | 
| High Performance Segmentation Of Spontaneous Speech Using Part Of Speech And Trigger Word Information | Marsal Gavalda, Klaus Zechner, Gregory Aist | Proceedings of the 5th Conference on Applied Natural Language Processing, ANLP 1997, Washington D.C., USA, March 31- April 3, 1997 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Adaptively Growing Hierarchical Mixtures Of Experts | Jürgen Fritsch, Michael Finke, Alex Waibel | Proceedings of the Conference on Neural Information Processing Systems, NIPS 1996, Denver, Colorado, USA, December 2-5, 1996 | 
| Blind Separation Of Delayed And Convolved Sources | Te-Won Lee, Anthony J. Bell, Russel H. Lambert | Proceedings of the Conference on Neural Information Processing Systems, NIPS 1996, Denver, Colorado, USA, December 2-5, 1996 | 
| A Real-Time Face Tracker | Jie Yang, Alex Waibel | Proceedings of the 3rd IEEE Workshop on Applications of Computer Vision, WACV 1996, Sarasota, Florida, USA, December 2-4, 1996 | 
| A Model-Based Gaze Tracking System | Rainer Stiefelhagen, Jie Yang, Alex Waibel | Proceedings of the IEEE International Joint Symposia on Intelligence and Systems, IJSIS 1996, Rockville, Maryland, USA, November 4-5, 1996 | 
| Automatische Identifizierung spontan gesprochener Sprachen mit neuronalen Netzen | Tanja Schultz, Hagen Soltau | Proceedings of the 3rd KONVENS Conference (Konferenz zur Verarbeitung natürlicher Sprache), KONVENS 1996, Bielefeld, Germany, October 1996 | 
| Regelbasiert Generierte Aussprachvarianten Für Spontansprache | Thomas Kemp | Proceedings of the 3rd KONVENS Conference (Konferenz zur Verarbeitung natürlicher Sprache), KONVENS 1996, Bielefeld, Germany, October 1996 | 
| A Stochastic Case Frame Approach For Natural Language Understanding | Wolfgang Minker, Samir Bennacef, Jean-Luc Gauvain | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Class Phrase Models For Language Modeling | Klaus Ries, Finn Dag Buø, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Dialogue Processing In A Conversational Speech Translation System | Alon Lavie, Lori Levin, Yan Qu, Alex Waibel, Donna Gates, Marsal Gavalda, Laura Mayfield, Maite Taboada | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Dictionary Learning for Spontaneous Speech Recognition | Tilo Sloboda, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Interactive Recovery From Speech Recognition Errors In Speech User Interfaces | Bernhard Suhm, Brad Myers, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Introducing Linguistic Constraints Into Statistical Language Modeling | Petra Geutner | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| JANUS II: Towards Spontaneous Spanish Speech Recognition | Puming Zhan, Klaus Ries, Marsal Gavalda, Donna Gates, Alon Lavie, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Learning To Parse Spontaneous Speech | Finn Dag Buø, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Recognition Of Spelled Names Over The Telephone | Hermann Hild, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Recognizing Emotion in Speech | Frank Dellaert, Thomas Polzin, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Translation Of Conversational Speech With Janus-II | Alon Lavie, Alex Waibel, Lori Levin, Donna Gates, Marsal Gavalda, Torsten Zeppenfeld, Puming Zhan, Oren Glickman | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| Word Clustering With Parallel Spoken Language Corpora | Ye-Yi Wang, John Lafferty, Alex Waibel | Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996 | 
| JANUS: a Multi-lingual Speech-to-speech Translation System for Spontaneously Spoken Language in a Limited Domain | Alon Lavie, Lori Levin, Alex Waibel, Donna Gates, Marsal Gavalda, Laura Mayfield | Proceedings of the 2nd Conference of the Association for Machine Translation in the Americas, AMTA 1996, Montreal, Canada, October 2-5, 1996 | 
| System Description JANUS. Multi-lingual Translation of Spontaneous Speech in a Limited Domain | Alon Lavie, Lori Levin, Alex Waibel, Donna Gates, Marsal Gavalda, Laura Mayfield | Proceedings of the 2nd Conference of the Association for Machine Translation in the Americas, AMTA 1996, Montreal, Canada, October 2-5, 1996 | 
| A Fast Search Technique for Large Vocabulary On-Line Handwriting Recognition | Stefan Manke, Michael Finke, Alex Waibel | Proceedings of the 5th International Workshop on Frontiers in Handwriting Recognition, IWFHR 1996, Colchester, United Kingdom, September 2-5, 1996 | 
| Focus Of Attention: Towards Low Bitrate Video Tele-Conferencing | Jie Yang, Leejay Wu, Alex Waibel | Proceedings of the 3rd IEEE International Conference on Image Processing, ICIP 1996, Lausanne, Switzerland, September 16-19, 1996 | 
| Minimizing Cumulative Error In Discourse Context | Yan Qu, Barbara Di Eugenio, Alon Lavie, Lori Levin, Carolyn P. Rosé | Proceedings of the European Conference on Artificial Intelligence (ECAI) Workshop on Dialogue Processing in Spoken Language Systems, Budapest, Hungary, August 13, 1996 | 
| Input Segmentation of Spontaneous Speech In Janus: A Speech-to-Speech Translation System | Alon Lavie, Donna Gates, Noah Coccaro, Lori Levin | Proceedings of the European Conference on Artificial Intelligence (ECAI) Workshop on Dialogue Processing in Spoken Language Systems, Budapest, Hungary, August 13, 1996 | 
| End-To-End Evaluation In Janus: A Speech-to-Speech Translation System | Donna Gates, Alon Lavie, Lori Levin, Alex Waibel, Marsal Gavalda, Laura Mayfield, Monika Woszczyna, Puming Zhan | Proceedings of the European Conference on Artificial Intelligence (ECAI) Workshop on Dialogue Processing in Spoken Language Systems, Budapest, Hungary, August 13, 1996 | 
| Multi-lingual Translation of Spontaneously Spoken Language in a Limited Domain | Alon Lavie, Donna Gates, Marsal Gavalda, Laura Mayfield, Alex Waibel, Lori Levin | Proceedings of the 16th International Conference on Computational Linguistics, COLING 1996, Copenhagen, Denmark, August 5-9, 1996 | 
| FeasPar - A Feature Structure Parser Learning to Parse Spoken Language | Finn Dag Buø, Alex Waibel | Proceedings of the 16th International Conference on Computational Linguistics, COLING 1996, Copenhagen, Denmark, August 5-9, 1996 | 
| GLR*: A Robust Parser For Spontaneously Spoken Language | Alon Lavie | 8th European Summer School in Logic, Language, and Information, ESSLLI 1996, Prague, Czech Republic, August 12-23, 1996 | 
| Search in a Learnable Spoken Language Parser | Finn Dag Buø, Alex Waibel | Proceedings of the 12th European Conference on Artificial Intelligence, ECAI 1996, Budapest, Hungary, August 11-16, 1996 | 
| Using Discourse Predictions For Ambiguity Resolution | Yan Qu, Carolyn P. Rosé, Barbara Di Eugenio | Proceedings of the 16th International Conference on Computational Linguistics, COLING 1996, Copenhagen, Denmark, August 5-9, 1996 | 
| The Rhythm Of Lexical Stress In Prose | Doug Beeferman | Proceedings of the 34th Annual Meeting of the Association for Computational Linguistics, ACL 1996, Santa Cruz, California, USA, June 24-27, 1996 | 
| Compréhension Et Évaluation Dans Le Domaine Atis | Wolfgang Minker, Samir Bennacef | Proceedings of the 21st Journées d'Etude sur la Parole 1996, JEP 1996, Avignon, France, June 10-14, 1996 | 
| The Bucket Box Intersection (BBI) Algorithm For Fast Approximative Evaluation Of Diagonal Mixture Gaussians | Jürgen Fritsch, Ivica Rogina | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| Modelling Unknown Words In Spontaneous Speech | Thomas Kemp, Andreas Jusek | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| Minimizing Search Errors Due To Delayed Bigrams In Real-Time Speech Recognition Systems | Monika Woszczyna, Michael Finke | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| LVCSR-Based Language Identification | Tanja Schultz, Ivica Rogina, Alex Waibel | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| JANUS-II - Advances in Spontaneous Speech Translation | Monika Wosczyna, Michael Finke, Thomas Kemp, Arthur McNair, Alon Lavie, Laura Mayfield, Martin Maier, Ivica Rogina, Tilo Sloboda, Alex Waibel, Puming Zhan, Torsten Zeppenfeld | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| JANUS-II – Translation of Spontaneous Conversational Speech | Alex Waibel, Michael Finke, Donna Gates, Marsal Gavalda, Thomas Kemp, Alon Lavie, Lori Levin, Martin Meier, Laura Mayfield, Arthur McNair, Ivica Rogina, Kaori Shima, Tilo Sloboda, Monika Woszczyna, Torsten Zeppenfeld, Puming Zhan | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| Building An Application Framework For Speech And Pen Input Integration In Multimodal Learning Interfaces | Minh Tue Vo, Cindy Wood | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| Adaptive Bimodal Sensor Fusion For Automatic Speechreading | Uwe Meier, Wolfgang Hürst, Paul Duchnowski | Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996 | 
| Designing Interactive Error Recovery Methods for Speech Interfaces | Bernhard Suhm, Brad Myers, Alex Waibel | Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI), Workshop on Designing the User Interface for Speech Recognition Applications, CHI 1996, Vancouver, Canada, April 13-18, 1996 | 
| Switchboard evaluation report | Michael Finke, Torsten Zeppenfeld, Martin Maier, Laura Mayfield, Klaus Ries, Puming Zhan, John Lafferty, Alex Waibel | Proceedings of the LVCSR Hub 5 Workshop, April 1996 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Tracking Human Faces in Real-Time | Yang, Jie; Waibel, Alex | Carnegie-Mellon University. Department of Computer Science | 
| Natural speech processing in practice: Experiences with the Verbmobil / Janus-2 System | Alex Waibel, Michael Finke, Donna Gates, Marsal Gavalda, Petra Geutner, Thomas Kemp, Alon Lavie, Arthur McNair, Laura Mayfield, Martin Maier, Ivica Rogina, Kaori Shima, Tilo Sloboda, Monika Woszczyna, Torsten Zeppenfeld, Puming Zhan | Proceedings of the Verbmobil TP13 Workshop, Hamburg, Germany, October 18-20, 1995 | 
| Speeding Up The Score Computation Of HMM Speech Recognizers With The Bucket Voronoi Intersection Algorithm | Jürgen Fritsch, Ivica Rogina, Tilo Sloboda, Alex Waibel | Proceedings of the 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995, Madrid, Spain, September 18-21, 1995 | 
| Integrating Spelling Into Spoken Dialogue Recognition | Hermann Hild, Alex Waibel | Proceedings of the 4th European Conference on Speech Communication and Technology, EUROSPEECH 1995, Madrid, Spain, September 18-21, 1995 | 
| Connectionist Transfer in Machine Translation | Ye-Yi Wang, Alex Waibel | Proceedings of the International Conference on Recent Advances in Natural Language Processing, RANLP 1995, Tzigov Chark, Bulgaria, September 14-16, 1995 | 
| Integrating Different Learning Approaches Into A Multilingual Spoken Translation System | Petra Geutner, Bernhard Suhm, Finn Dag Buø, Thomas Kemp, Laura Mayfield, Arthur E. McNair, Ivica Rogina, Tanja Schultz, Tilo Sloboda, Wayne Ward, Monika Woszczyna, Alex Waibel | Proceedings of the 14th International Joint Conference on Artificial Intelligence, IJCAI 1995, Montreal, Quebec, Canada, August 20-25, 1995 | 
| NPen++: A Writer Independent, Large Vocabulary On-Line Cursive Handwriting Recognition System | Stefan Manke, Michael Finke, Alex Waibel | Proceedings of the 3rd International Conference on Document Analysis and Recognition, ICDAR 1995, Montreal, Canada, August 14-16, 1995 | 
| Efficient Iterative Scaling Of A Class Of Maximum Entropy Language Models | John D. Lafferty, Bernhard Suhm | Proceedings of the 15th Workshop on Maximum Entropy and Bayesian Methods, WSME 1995, Santa Fe, New Mexiko, USA, July 31 - August 4, 1995 | 
| Parsing Real Input in JANUS: A Concept-Based Approach to Spoken Language Translation | Laura Tomokiyo, Marsal Gavalda, Y-H. Seo, Bernhard Suhm, Wayne Ward, Alex Waibel | Proceedings of the 6th International Conference on Theoretical and Methodological Issues in Machine Translation, TMI 1995, Leuven, Belgium, July 5-7, 1995 | 
| Using Context in Machine Translation of Spoken Language | Lori Levin, Oren Glickman, Yan Qu, Donna Gates, Alon Lavie, Carolyn P. Rosé, Carol van Ess-Dykema, AlexWaibel | Proceedings of the 6th International Conference on Theoretical and Methodological Issues in Machine Translation, TMI 1995, Leuven, Belgium, July 5-7, 1995 | 
| Estimation of the Head Orientation Based on a Face-Color-Intensifier | Bernt Schiele, Alex Waibel | Proceedings of the 3rd International Symposium on Intelligent Robotic Systems, SIRS 1995, Pisa, Italy, July 1995 | 
| Gaze-Tracking based on Face-Color | Bernt Schiele, Alex Waibel | Proceedings of the International Workshop on Automatic Face- and Gesture-Recognition, IWAFGR 1995, Zurich, Switzerland, June 1995 | 
| Using Morphology Towards Better Large-Vocabulary Speech Recognition Systems | Petra Geutner | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Toward Movement-Invariant Automatic Lip-Reading And Speech Recognition | Paul Duchnowski, Martin Hunke, Dietrich Büsching, Uwe Meier, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Language Models For A Spelled Letter Recognizer | Martin Betz, Hermann Hild | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Knowing Who To Listen To In Speech Recognition: Visually Guided Beamforming | Udo Bub, Martin Hunke, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Improved Language Modeling By Unsupervised Acquisition Of Structure | Klaus Ries, Finn Dag Buø, Ye-Yi Wang | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Dictionary Learning: Performance Through Consistency | Tilo Sloboda | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Data-Driven Codebook Adaptation In Phonetically Tied SCHMMs | Thomas Kemp | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Concept-Based Speech Translation | Laura Mayfield, Marsal Gavalda, Wayne Ward, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Acoustic And Language Modeling Of Human And Nonhuman Noises For Human-To-Human Spontaneous Speech Recognition | Tanja Schultz, Ivica Rogina | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995 | 
| Experiments With LVCSR Based Language Identification | Tanja Schultz, Ivica Rogina, Alex Waibel | Proceedings of the 15th Annual Speech Research Symposium, SRS 1995, Baltimore, Maryland, USA, January 1995 | 
| The Janus Speech Recognizer | Ivica Rogina, Alex Waibel | Proceedings of the ARPA Spoken Language Technology Workshop 1995, Austin, Texas, USA, January 22-25, 1995 | 
| Multimodal Learning Interfaces | Minh Tue Vo, Ricky Houghton, Jie Yang, Udo Bub, Uwe Meier, Alex Waibel, Paul Duchnowski | Proceedings of the ARPA Spoken Language Technology Workshop 1995, Austin, Texas, USA, January 22-25, 1995 | 
| JANUS: Towards Multi-Lingual Spoken Language Translation | Bernhard Suhm, Petra Geutner, Thomas Kemp, Alon Lavie, Laura Mayfield, Arthur E. McNair, Ivica Rogina, Tanja Schultz, Tilo Sloboda, Wayne Ward, Monika Woszczyna, Alex Waibel | Proceedings of the ARPA Spoken Language Technology Workshop 1995, Austin, Texas, USA, January 22-25, 1995 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Connectionist Models In Multimodal Human-Computer Interaction | Alex Waibel, Paul Duchnowski | Proceedings of the Government Microcircuit Applications Conference, GOMAC 1994, San Diego, California, USA, November 7-10, 1994 | 
| Face Locating And Tracking For Human-Computer Interaction | Martin Hunke, Alex Waibel | Conference Record of the 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994, San Diego, California, USA, October 31 - November 2, 1994 | 
| Hybrid Connectionist and Classical Approaches in JANUS. An Advanced Speech-to-Speech Translation System | A. Waibel, T. S. Polzin, U. Bodenhausen, F. D. Buø, N. Coccaro, H. Hild, B. Suhm | Proceedings of the the International Conference on Neural Information Processing, ICONIP 1994, Seoul, South Korea, October 17-20, 1994 | 
| The Use Of Dynamic Writing Information In A Connectionist On-Line Cursive Handwriting Recognition System | Stefan Manke, Michael Finke, Alex Waibel | Proceedings of the Conference on Neural Information Processing Systems, NIPS 1994, Denver, Colorado, USA, November 1994 | 
| Combining Bitmaps With Dynamic Writing Information For On-Line Handwriting Recognition | Stefan Manke, Michael Finke, Alex Waibel | Proceedings of the 12th International Conference on Pattern Recognition, ICPR 1994, Jerusalem, Israel, October 9-13, 1994 | 
| Improving Recognizer Acceptance Through Robust, Natural Speech Repair | Arthur E. McNair, Alex Waibel | Proceedings of the 3rd International Conference on Spoken Language Processing, ICSLP 1994, Yokohama, Japan, September 18-22, 1994 | 
| See Me, Hear Me: Integrating Automatic Speech Recognition And Lip-Reading | Paul Duchnowski, Uwe Meier, Alex Waibel | Proceedings of the 3rd International Conference on Spoken Language Processing, ICSLP 1994, Yokohama, Japan, September 18-22, 1994 | 
| Towards Better Language Models For Spontaneous Speech | Bernhard Suhm, Alex Waibel | Proceedings of the 3rd International Conference on Spoken Language Processing, ICSLP 1994, Yokohama, Japan, September 18-22, 1994 | 
| Inferring Linguistic Structure in Spoken Language | Monika Woszczyna, Alex Waibel | Proceedings of the International Conference on Speech and Language Processing, ICSLP 1994, Yokohama, Japan, September 18-22, 1994 | 
| Speech-Language Integration In A Multi-Lingual Speech Translation System | Bernhard Suhm, Lori Levin, Noah Coccaro, Jaime Carbonell, Keiko Horiguchi, Rosukye Isotani, Alon Lavie, Laura Mayfield, Carolyn Rose, Carol Van Ess-Dykema, Alex Waibel | Proceedings of the 12th National Conference on Artificial Intelligence, AAAI 1994, Seattle, Washington, USA, July 31 - August 4, 1994 | 
| Recovering From Parser Failures: A Hybrid Statistical/Symbolic Approach | Caroyln Penstien Rosé, Alex Waibel | Proceedings of the ACL Workshop "The Balancing Act: Combining Symbolic and Statistical Approaches to Language", Las Cruces, New Mexico, USA, July 1, 1994 | 
| An Integrated Heuristic Scheme For Partial Parse Evaluation | Alon Lavie | Proceedings of the 32nd Annual Meeting of the Association for Computational Linguistics, ACL 1994, Las Cruces, New Mexico, USA, June 27-30, 1994 | 
| Learning State-Dependent Stream Weights For Multi-Codebook Hmm Speech Recognition Systems | Ivica Rogina, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, Adelaide, Australia, April 19-22, 1994 | 
| Learning Complex Output Representations In Connectionist Parsing Of Spoken Language | Finn Dag Buø, Thomas Polzin, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, Adelaide, Australia, April 19-22, 1994 | 
| JANUS 93: Towards Spontaneous Speech Translation | M. Woszczyna, N. Aoki-Waibel, F. D. Buø, N. Coccaro, K. Horiguchi, T. Kemp, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. P. Rose, T. Schultz, B. Suhm, M. Tomita, A. Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, Adelaide, Australia, April 19-22, 1994 | 
| Incremental Learning Using The Time Delay Neural Network | Minh Tue Vo | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, Adelaide, Australia, April 19-22, 1994 | 
| A Connectionist Recognizer For On-Line Cursive Handwriting Recognition | Stefan Manke, Ulrich Bodenhausen | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1994, Adelaide, Australia, April 19-22, 1994 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Multimodal Human-Computer Interaction | Minh Tue Vo, Alex Waibel | Proceedings of the International Symposium on Spoken Dialogue, ISSD 1993, Waseda, Japan, November 10-12, 1993 | 
| Tuning by Doing: Flexibility Through Automatic Structure Optimization | Ulrich Bodenhausen, Alex Waibel | Proceedings of the European Conference on Speech Communication and Technology, EUROSPEECH 1993, Berlin, Germany, September 22-25, 1993 | 
| Detection And Transcription Of New Words | Bernhard Suhm, Alex Waibel, Monika Woszczyna | Proceedings of the European Conference on Speech Communication and Technology, EUROSPEECH 1993, Berlin, Germany, September 22-25, 1993 | 
| Recent Advances In Janus: A Speech Translation System | M. Woszczyna, N. Coccaro, A. Eisele, A. Lavie, A. McNair, T. Polzin, I. Rogina, C. P. Rose, T. Sloboda, M. Tomita, J. Tsutsumi, N. Aoki-Waibel, A. Waibel, W. Ward | Proceedings of the European Conference on Speech Communication and Technology, EUROSPEECH 1993, Berlin, Germany, September 22-25, 1993 | 
| Speaker-Independent Connected Letter Recognition With A Multi-State Time Delay Neural Network | Hermann Hild, Alex Waibel | Proceedings of the European Conference on Speech Communication and Technology, EUROSPEECH 1993, Berlin, Germany, September 22-25, 1993 | 
| GLR* - An Efficient Noise-skipping Parsing Algorithm For Context Free Grammars | Alon Lavie, Masaru Tomita | Proceedings of the 3rd International Workshop on Parsing Technologies 1993, IWPT 1993, Tilburg, The Netherlands and Durby, Belgium, August 10-13, 1993 | 
| Frequency Estimation of Verb Subcategorization Frames Based on Syntactic and Multidimensional Statistical Analysis | Akira Ushioda, David A. Evans, Ted Gibson, Alex Waibel | Proceedings of the 3rd International Workshop on Parsing Technologies 1993, IWPT 1993, Tilburg, The Netherlands and Durby, Belgium, August 10-13, 1993 | 
| Flexibility Through Incremental Learning: Neural Networks For Text Categorization | P. Geutner, U. Bodenhausen, A. Waibel | Proceedings of the World Congress on Neural Networks, WCNN 1993, Portland, Oregon, USA, July 11-15, 1993 | 
| The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora | Akira Ushioda, David A. Evans, Ted Gibson, Alex Waibel | Proceedings of the Workshop „Acquisition of Lexical Knowledge from Text“ sponsored by the Association for Computational Linguistics (ACL), Columbus, Ohio, USA, June 1993 | 
| Improving the MS-TDNN for Word Spotting | Torsten Zeppenfeld, Rick Houghton, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1993, Minneapolis, Minnesota, USA, April 27-30, 1993 | 
| A Multi-Modal Human-Computer Interface: Combination of Gesture and Speech Recognition | Minh Tue Vo, Alex Waibel | Proceedings of the International Conference on Human-Computer Interaction (INTERACT), jointly organised with ACM Conference on Human Aspects in Computing Systems (CHI), INTERCHI 1993, Amsterdam, The Netherlands, April 24-29, 1993 | 
| Multi-Speaker/Speaker-Independent Architectures for the Multi-State Time Delay Neural Network | Hermann Hild, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1993, Minneapolis, Minnesota, USA, April 27-30, 1993 | 
| Improving Connected Letter Recognition By Lipreading | Christoph Bregler, Hermann Hild, Stefan Manke, Alex Waibel | Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1993, Minneapolis, Minnesota, USA, April 27-30, 1993 | 
| Bimodal Sensor Integration On The Example Of Speech-Reading | Christoph Bregler, Stefan Manke, Hermann Hild, Alex Waibel | Proceedings of the International Conference on Neural Networks, ICNN 1993, San Francisco, California, USA, March 28 - April 1, 1993 | 
| Application Oriented Automatic Structuring of Time-Delay Neural Networks for High Performance Character and Speech Recognition | Ulrich Bodenhausen, Alex Waibel | Proceedings of the International Conference on Neural Networks, ICNN 1993, San Francisco, California, USA, March 28 - April 1, 1993 | 
| Recent Advances in JANUS: A Speech Translation System | Thomas Polzin, Noah Coccaro, N. Aoki-Waibel, Monika Woszczyna, M. Tomita, J. Tsutsumi, Ivica Rogina, Carolyn Rose, Alex Waibel, Arthur McNair, Alon Lavie, A. Eisele, Tilo Sloboda, Wayne Ward | Proceedings of the workshop on Human Languagy Technology, HLT 1993, Plainsboro, New Jersey, USA, March 21-24, 1993 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| PARSEC: A Structured Connectionist Parsing System for Spoken Language | Ajay N. Jain, Alex Waibel, David S. Touretzky | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992, San Francisco, California, USA, March 23-26, 1992 | 
| A Hybrid Neural Network, Dynamic Programming Word Spotter | Torsten Zeppenfeld, Alex H. Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992, San Francisco, California, USA, March 23-26, 1992 | 
| Testing Generality In Janus: A Multi-Lingual Speech Translation System | Louise Osterholtz, Charles Augustine, Arthur McNair, Ivica Rogina, Hiroaki Saito, Tilo Sloboda, Joe Tebelskis, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1992, San Francisco, California, USA, March 23-26, 1992 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Connectionist Approaches to Large Vocabulary Continuous Speech Recognition | Hidefumi Sawai, Yasuhiro Minami, Masanori Miyatake, Alex Waibel, Kiyohiro Shikan | ATR Interpreting Telephony Research Laboratories - IEICE Transactions on Communications Electronics Information and Systems | 
| Fast Back-Propagation Learning Methods For Neural Networks in Speech | P. Haffner, A. Waibel and K.Shikano | ATR Interpreting Telephony Research Laboratories | 
| Evaluation Of Speaker Independent Phoneme Recognition On Timit Database Using Tdnns | Nobuo Hataoka, Alexander H. Waibel | 2nd European Conference on Speech Communication and Technology EUROSPEECH '91Genova, Italy | 
| Continuous Speech Recognition with the Connectionist Viterbi Training Procedure: A Summary of Recent Work | Michael Franzini, Alex Waibel, Kai-Fu Lee | Proceedings of the IEEE International Joint Conference on Neural Networks, IJCNN 1991, Singapore, November 18-22, 1991 | 
| Connectionist Speaker Normalization and its Applications to Speech Recognition | X. D. Huang, K. F. Lee, A. Waibel | Proceedings of the IEEE Workshop on Neural Networks for Signal Processing, NNSP 1991, Princeton, New Jersey, USA, September 29 - October 2, 1991 | 
| Effectiveness of the Neural Fuzzy Training Method for Continuous Speech Recognition | K. Fukuzawa, Y. Komori, M. Sugiyama, S. Sagayama, A. Waibel | Proceedings of the Fall Meeting of the Acoustical Society of Japan, Japan, 1991 | 
| Time-Delay Neural Networks Embedding Time Alignment: A Performance Analysis | Patrick Haffner, Alex Waibel | Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genoa, Italy, September 24-26, 1991 | 
| Integrated Phoneme-Function Word Architecture of Hidden Control Neural Networks for Continuous Speech Recognition | Bojan Petek, Alex H. Waibel, Joseph M. Tebelskis | Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genoa, Italy, September 24-26, 1991 | 
| Recent Work in Continuous Speech Recognition using the Connectionist Viterbi Training Procedure | Michael A. Franzini, Alex H. Waibel, Kai-Fu Lee | Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genoa, Italy, September 24-26, 1991 | 
| Speaker-Independent Phoneme Recognition on TIMIT Database Using Integrated Time-Delay Neural Networks | Nobuo Hataoka, Alex H. Waibel | Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genoa, Italy, September 24-26, 1991 | 
| Evaluation of Speaker-Independent Phoneme Recognition on TIMIT Database Using TDNNs | Nobuo Hataoka, Alexander H. Waibel | Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, Genoa, Italy, September 24-26, 1991 | 
| An Improvement of TDNN-LR Continuous Speech Recognition System Using a Neural Fuzzy Training Approach | Yasuhiro Komori, Alex H. Waibel, Shigeki Sagayama | Proceedings of the IEICE workshop, SP91-24, pp. 49-56, June 1991 | 
| Review of TDNN (Time-Delay Neural Network) Architectures for Speech Recognition | Masahide Sugiyama, Hidehumi Sawai, Alexander H. Waibel | 
 ATR Interpreting Telephony Research Laboratories; | 
| JANUS: A Speech-to-Speech Translation System Using Connectionist and Symbolic Processing Strategies | Alex Waibel, Ajay N. Jain, Arthur E. McNair, Hiroaki Saito, Alexander G. Hauptmann, Joe Tebelskis | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1991, Toronto, Canada, May 14-17, 1991 | 
| Integrating Time Alignment and Neural Networks for High Performance Continuous Speech Recognition | Patrick Haffner, Michael Franzini, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1991, Toronto, Canada, May 14-17, 1991 | 
| Continuous Speech Recognition Using Linked Predictive Neural Networks | Joe Tebelskis, Alex Waibel, Bojan Petek, Otto Schmidbauer | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1991, Toronto, Canada, May 14-17, 1991 | 
| Learning the Architecture of Neural Networks for Speech Recognition | Ulrich Bodenhausen, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1991, Toronto, Canada, May 14-17, 1991 | 
| A Connectionist Model for Dialog Processing | Ye-Yi Wang, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1991, Toronto, Canada, May 14-17, 1991 | 
| A New Fuzzy Training Method for Phoneme Identification Neural Networks | Y. Komori, S. Sagayama, A. Waibel | Proceedings of the Spring Meeting of the Acoustical Society of Japan, Japan, 1991 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Speech Recognition Using Sub-Phoneme Recognition Neural Network | Kiyoaki Aikawa, Alexander H. Waibel | Proceedings of the 1st International Conference on Spoken Language Processing, ICSLP 1990, Kobe, Japan, November 18-22, 1990 | 
| Connectionist Large Vocabulary Speech Recognition | Alex Waibel | Proceedings of International Conference Commemorating the 30th Anniversary of the Information Processing Society of Japan (IPSJ), InfoJapan 1990, Japan, October 1990 | 
| Phoneme-Based Word Recognition by Neural Network - A Step Toward Large Vocabulary Recognition | Akihiro Hirai, Alexander Waibel | Proceedings of the International Joint Conference on Neural Networks, IJCNN 1990, San Diego, California, USA, June 17-21, 1990 | 
| Speaker-Independent Phoneme Recognition on TIMIT Database Using Integrated Time-Delay Neural Networks (TDNNs) | Nobuo Hataoka, Alex H. Waibel | Proceedings of the International Joint Conference on Neural Networks, IJCNN 1990, San Diego, California, USA, June 17-21, 1990 | 
| Large Vocabulary Recognition Using Linked Predictive Neural Networks | Joe Tebelskis, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1990, Albuquerque, New Mexiko, USA, April 3-6, 1990 | 
| The Meta-Pi Network: Connectionist Rapid Adaptation for High-Performance Multi-Speaker Phoneme Recognition | John B. Hampshire II, Alex H. Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1990, Albuquerque, New Mexiko, USA, April 3-6, 1990 | 
| Connectionist Viterbi Training: A new Hybrid Method for Continuous Speech Recognition | Michael Franzini, Kai-Fu Lee, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1990, Albuquerque, New Mexiko, USA, April 3-6, 1990 | 
| Robust Connectionist Parsing of Spoken Language | Ajay N. Jain, Alex H. Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1990, Albuquerque, New Mexiko, USA, April 3-6, 1990 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Phoneme-based Word Recognition by Neural Network - A step toward large vocabulary recognition | Akihiro Hirait, Alexander Waibel | ATR Interpreting Telephony Research Laboratories | 
| Connectionist Large Vocabulary Word Recognition | Alex Waibel | ATR Interpreting Telephony Research Laboratories, Technical Report TR-1-0120 | 
| A Preliminary Study On Spotting Japanese Cv Syllables By Time Delay Neural Networks | Alexander Waibel | ATR Interpreting Telephony Research Laboratories - ASJ Fall Meeting, Hakata, Oct 1988 | 
| Connectionist Architectures for Multi-Speaker Phoneme Recognition | John B. Hampshire II, Alex Waibel | Advances in Neural Information Processing Systems 2 – NIPS | 
| Fast Back-Propagation Learning Methods for Large Phonemic Neural Networks | Patrick Haffner, Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano | Proceedings of the 1st European Conference on Speech Technology, EUROSPEECH 1989, Paris, France, September 27-29, 1989 | 
| A Connectionist Parser Aimed at Spoken Language | Ajay Jain, Alex Waibel | Proceedings of the 1st International Workshop on Parsing Technologies, IWPT 1989, Pittsburgh, Pennsylvania, USA, August 28-31, 1989 | 
| A Novel Objective Function for Improved Phoneme Recognition Using Time-Delay Neural Networks | J. B. Hampshire II, A. H. Waibel | Proceedings of the International Joint Conference on Neural Networks, IJCNN 1989, Washington, D.C., USA, June 1989 | 
| Spotting Japanese CV-Syllables and Phonemes Using Time-Delay Neural Networks | Hidefumi Sawai, Alex Waibel, Masanori Miyatake, Kiyohiro Shikano | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1989, Glasgow, Scotland, May 23-26, 1989 | 
| Consonant Recognition by Modular Construction of Large Phonemic Time-Delay Neural Networks | Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1989, Glasgow, Scotland, May 23-26, 1989 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Neural Network Application to Speech Processing | K. Shikano, M. Nakamura, S. Tamura, A. Waibel | ATR Interpreting Telephony Research Laboratories - Journal of Acoustic Society of Japan | 
| Fast Back-Propagation Learning Methods For Neural Networks in Speech | P. Haffner, A. Waibel and K.Shikano | ATR Interpreting Telephony Research Laboratories - ASJ Fall Meeting, Hakata, Oct 1988 | 
| Modularity and Scaling in Large Phonemic Neural Networks | Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano | ATR - Interpreting Telephony Research Laboratories - IEEE Transactions on Acoustics, speech and signal processing | 
| Speech Recognition Using Time-Delay Neural Networks - Invited Talk | Alex Waibel | ATR Interpreting Telephony Research Laboratories, Snowbird Conference | 
| Speech Recognition Research At ATR | Kiyohiro Shikano, Takeshi Kawabata, Alex Waibel, Kaichiro Hatazaki, Hidefumi Sawai, Satoshi Nakamura, Toshiyuki Hanazawa, Kenji Kita, Akira Kurematsu | ATR Interpreting Telephony Research Laboratories - 2nd Symposium on Advanced Man-Machine Interface through Spoken Language, Makaha, Hawaii, Nov 1988 | 
| Speech Recognition Using Time-Delay Neural Networks Invited Talk | A. Waibel | ASA Spring Meeting | 
| Phoneme Recognition by Scaling up Modular Time-Delay Neural Networks | Hidefumi Sawai, Alex Waibel, Masanori Miyatake, Kiyohiro Shikano | ATR Interpreting Telephony Research Laboratories, SP88-105 | 
| Neural Network Applications To Speech Processing | Kiyohiro Shikano, Masami Nakamura, Shinichi Tamura and Alex Waibel | ATR Interpreting Telephony Research Laboratories - IEICE Special Seinar on Neural Networks, Osaka, Sep 1988 | 
| Modularity in Neural Networks for Speech Recognition | Alexander Waibel | IEEE Conference on NIPS | 
| Incremental Learning Of Large Phonetic Neural Networks From Smaller Subnets | Alexander Waibel | USA-Japan Joint Acoustical Society Meeting, Nov. 14, 1988 | 
| Phoneme Recognition by Scaling up Modular Time-Delay Neural Networks | Masanori Miyatake, Hidefumi Sawai, Kiyohiro Shikano, Alex Waibel | Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), December 1988 | 
| Fast Back-Propagation Learning Methods for Neural Networks in Speech | Patrick Haffner, Kyohiro Shikano, Alex Waibel, Hidehumi Sawai | ATR Interpreting Telephony Research Laboratories, Technical Report TR-1-0058 | 
| DyNet, a Fast Program for Learning in Neural Networks | P. Haffner | ATR Interpreting Telephony Research Laboratories, ATR Technical Report, TR-1-0059 | 
| Phoneme Recognition by Modular Construction of Time-Delay Neural Networks | Alex Waibel, Hidefumi Sawai, Kiyohiro Shikano | Proceedings of the Fall Meeting of the Acoustical Society of Japan, October 1988 | 
| A Preliminary Study on Spotting Japanese CV-Syllables by Time-Delay Neural Networks | Hidefumi Sawai, Kiyohiro Shikano, Alex Waibel | Proceedings of the Fall Meeting of the Acoustical Society of Japan, October 1988 | 
| Connectionist Glue: Modular Design of Neural Speech Systems | Alex Waibel | Proceedings of the 1988 Connectionist Models Summer School, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 17-26, 1988 | 
| Noise Reduction Using Connectionist Models | Sin'ichi Tamura, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1988, New York, New York, USA, April 11-14, 1988 | 
| Phoneme Recognition: Neural Networks vs. Hidden Markov Models | T. Hanazawa, G. Hinton, K. Lang, K. Shikano, A. Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1988, New York, New York, USA, April 11-14, 1988 | 
| Noise Reduction through Waveform Input and Output Using Neural Networks | Shin'ichi Tamura, Alex Waibel | ATR Interpreting Telephony Research Laboratories, ASJ Spring Meeting, Tamagawa University | 
| Phoneme Recognition Using Time-Delay Neural Networks | Alex Waibel | Proceedings of the first Symposium on Advanced Man-Machine Interface Through Spoken Language, Gakushikaikan, Tokyo, Japan, January 1988 | 
| Noise Reduction by Neural Networks | Shin'ichi Tamura, Alex Waibel | Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), Osaka, Japan, January 1988 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Phoneme Recognition Using Time-Delay Neural Networks | Alex Waibel | Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), Tokyo, Japan, December 1987 | 
| Noise Reduction Using Neural Networks | Shin' ichi Tamura, Alex Waibel | ATR Interpreting Telephony Research Laboratories, IEICE Technical Report SP87-112 | 
| Phoneme Recognition Using Time-Delay Neural Networks | A. Waibel, T. Hanazawa, G. Hinton, K. Shikano, K. Lang | ATR Interpreting Telephony Research Laboratories, ATR Technical Report, TR-I-0006, October 30, 1987 | 
| Learned Phonetic Discrimination Using Connectionist Networks | R. L. Watrous, L. Shastri, A. H. Waibel | Proceedings of the European Conference on Speech Technology, Edinburgh, Great Britain, September 1987 | 
| Prosodic Knowledge Sources for Word Hypothesization in a Continuous Speech Recognition System | Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1987, Dallas, Texas, USA, April 6-9, 1987 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Recognition of Lexical Stress in a Continuous Speech Understanding System - A Pattern Recognition Approach | Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1986, Tokyo, Japan, April 7-11, 1986 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| A Coarse Phonetic Knowledge Source for Template Independent Large Vocabulary Word Recognition | Helmut Lagger, Alex Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, Tampa, Florida, USA, March 26-29, 1985 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Suprasegmentals in Very Large Vocabulary Isolated Word Recognition | A. Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1984, San Diego, California, USA, March 19-21, 1984 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Comparative Study of Nonlinear Time Warping Techniques in Isolated Word Speech Recognition Systems | A. Waibel, B. Yegnanarayana | IEEE Transactions on Acoustics, Speech and Signal Processing, Volume 31, Number 6, pgs. 1582-1586, December 1983 | 
| Title of Paper | Author | Conference | 
|---|---|---|
| Performance Trade-Offs in Search Techniques for Isolated Word Speech Recognition | R. Bisiani, A. Waibel | Proceedings of the International Conference on Acoustics, Speech and Signal Processing, ICASSP 1982, Paris, France, May 3-5, 1982 | 
