Journals                 

 

Publikationsliste


2025
Episodic Memory Verbalization Using Hierarchical Representations of Life-Long Robot Experience
Bärmann, L.; DeChant, C.; Plewnia, J.; Peller-Konrad, F.; Bauer, D.; Asfour, T.; Waibel, A. H.
2025. 2025 IEEE-RAS 24th International Conference on Humanoid Robots (Humanoids), Seoul, 30th September - 2nd October 2025, 783–790, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/Humanoids65713.2025.11203101
KIT’s Offline Speech Translation and Instruction Following Submission for IWSLT 2025
Koneru, S.; Züfle, M.; Binh Nguyen, T.; Akti, S.; Niehues, J.; Waibel, A.
2025. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025). Ed.: E. Salesky, 232–244, Association for Computational Linguistics (ACL). doi:10.18653/v1/2025.iwslt-1.22
Factorized-VITS: Decoupling Prosody and Text in End-to-End Speech Synthesis without External or Secondary Aligner
Liu, Y.; Waibel, A.
2025. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49660.2025.10890003
Improving Pronunciation and Accent Conversion through Knowledge Distillation And Synthetic Ground-Truth from Native TTS
Nguyen, T. N.; Akti, S.; Pham, N. Q.; Waibel, A.
2025. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49660.2025.10890229
PIER: A Novel Metric for Evaluating What Matters in Code-Switching
Ugan, E. Y.; Pham, N.-Q.; Bärmann, L.; Waibel, A.
2025. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49660.2025.10889660
MSA-ASR: Efficient Multilingual Speaker Attribution with frozen ASR Models
Nguyen, T.-B.; Waibel, A.
2025. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49660.2025.10889116
Cocktail-Party Audio-Visual Speech Recognition
Nguyen, T.-B.; Pham, N.-Q.; Waibel, A.
2025. nterspeech 2025, 17-21 August 2025, Rotterdam, The Netherlands, 1828–1832, ISCA. doi:10.21437/Interspeech.2025-676
Weight Factorization and Centralization for Continual Learning in Speech Recognition
Ugan, E. Y.; Pham, N.-Q.; Waibel, A.
2025. Interspeech 2025, 17-21 August 2025, Rotterdam, The Netherlands, 2200–2204, ISCA. doi:10.21437/Interspeech.2025-1701
Streaming Non-Autoregressive Model for Accent Conversion and Pronunciation Improvement
Nguyen, T.-N.; Pham, N.-Q.; Akti, Ş.; Waibel, A.
2025. nterspeech 2025, 17-21 August 2025, Rotterdam, The Netherlands, 4163–4167, ISCA. doi:10.21437/Interspeech.2025-1403
Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion
Akti, Ş.; Nguyen, T.-N.; Waibel, A.
2025. Interspeech 2025, 17-21 August 2025, Rotterdam, The Netherlands, 1358–1362, ISCA. doi:10.21437/Interspeech.2025-815
Summarizing Speech: A Comprehensive Survey
Retkowski, F.; Züfle, M.; Sudmann, A.; Pfau, D.; Watanabe, S.; Niehues, J.; Waibel, A.
2025. C. Christodoulopoulos, T. Chakraborty, C. Rose & V. Peng (Eds.), Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing. Hrsg.: Christodoulopoulos, Christos; Chakraborty, Tanmoy; Rose, Carolyn; Peng, Violet, 27263–27294, Association for Computational Linguistics (ACL). doi:10.18653/v1/2025.emnlp-main.1388
Findings of the IWSLT 2025 Evaluation Campaign
Abdulmumin, I.; Agostinelli, V.; Alumäe, T.; Anastasopoulos, A.; Bentivogli, L.; Bojar, O.; Borg, C.; Bougares, F.; Cattoni, R.; Cettolo, M.; Chen, L.; Chen, W.; Dabre, R.; Estève, Y.; Federico, M.; Fishel, M.; Gaido, M.; Javorský, D.; Kasztelnik, M.; Kponou, F.; Krubiński, M.; Kin Lam, T.; Liu, D.; Matusov, E.; Kumar Maurya, C.; P. McCrae, J.; Mdhaffar, S.; Moslem, Y.; Murray, K.; Nakamura, S.; Negri, M.; Niehues, J.; Kr. Ojha, A.; Ortega, J. E.; Papi, S.; Pecina, P.; Polák, P.; Połeć, P.; Sankar, A.; Savoldi, B.; Sethiya, N.; Sikasote, C.; Sperber, M.; Stüker, S.; Sudoh, K.; Thompson, B.; Turchi, M.; Waibel, A.; Wilken, P.; Zevallos, R.; Zouhar, V.; Züfle, M.
2025. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025), Hrsg.: Salesky, Elizabeth; Federico, Marcello; Anastasopoulos, Antonis, 412–481, Association for Computational Linguistics (ACL). doi:10.18653/v1/2025.iwslt-1.44
KIT’s Low-resource Speech Translation Systems for IWSLT2025: System Enhancement with Synthetic Data and Model Regularization
Li, Z.; Liu, Y.; Liu, D.; Nam Nguyen, T.; Yavuz Ugan, E.; Anh Dinh, T.; Mullov, C.; Waibel, A.; Niehues, J.
2025. Proceedings of the 22nd International Conference on Spoken Language Translation (IWSLT 2025), 212–221, Association for Computational Linguistics (ACL). doi:10.18653/v1/2025.iwslt-1.20
Continuously Learning New Words in Automatic Speech Recognition
Huber, C.; Waibel, A.
2025. ICASSP 2025 - 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49660.2025.10889216
From Speech to Summary: A Comprehensive Survey of Speech Summarization
Retkowski, F.; Züfle, M.; Sudmann, A.; Pfau, D.; Niehues, J.; Waibel, A.
2025. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000180972
2024
Decoupled Vocabulary Learning Enables Zero-Shot Translation from Unseen Languages
Mullov, C.; Pham, Q.; Waibel, A.
2024. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), Ed.: L. Ku, A. Martins, V. Srikumar, 6693–6709, Association for Computational Linguistics (ACL). doi:10.18653/v1/2024.acl-long.362
ConVoiFilter: A Case Study of Doing Cocktail Party Speech Recognition
Nguyen, T.-B.; Waibel, A.
2024. 2024 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW) : Proceedings, Seoul, 14th-19th April 2024, 565 – 569, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSPW62465.2024.10626098
Audio-Visual Speech Representation Expert for Enhanced Talking Face Video Generation and Evaluation
Yaman, D.; Eyiokur, F. I.; Bärmann, L.; Aktı, S.; Ekenel, H. K.; Waibel, A.
2024. 2024 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 6003 – 6013, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CVPRW63382.2024.00607
DECM: Evaluating Bilingual ASR Performance on a Code-switching/mixing Benchmark
Ugan, E. Y.; Pham, N.-Q.; Waibel, A.
2024. 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation, LREC-COLING 2024 - Main Conference Proceedings, Hybrid, Torino, 20th-25th May 2024, 4468 – 4475, European Language Resources Association (ELRA)
Synthetic Conversations Improve Multi-Talker ASR
Nguyen, T.-B.; Waibel, A.
2024. ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Seoul, 14th-19th April 2024, 10461–10465, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP48485.2024.10446589
Incremental learning of humanoid robot behavior from natural interaction and large language models
Bärmann, L.; Kartmann, R.; Peller-Konrad, F.; Niehues, J.; Waibel, A.; Asfour, T.
2024. Frontiers in Robotics and AI, 11, Art.-Nr.: 1455375. doi:10.3389/frobt.2024.1455375
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Dinh, T. A.; Mullov, C.; Bärmann, L.; Li, Z.; Liu, D.; Reiß, S.; Lee, J.; Lerzer, N.; Ternava, F.; Gao, J.; Röddiger, T.; Waibel, A.; Asfour, T.; Beigl, M.; Stiefelhagen, R.; Dachsbacher, C.; Böhm, K.; Niehues, J.
2024. arxiv. doi:10.48550/arXiv.2406.10421
SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading
Dinh, T. A.; Mullov, C.; Bärmann, L.; Li, Z.; Liu, D.; Reiß, S.; Lee, J.; Lerzer, N.; Gao, J.; Peller-Konrad, F.; Röddiger, T.; Waibel, A.; Asfour, T.; Beigl, M.; Stiefelhagen, R.; Dachsbacher, C.; Böhm, K.; Niehues, J.
2024. Y. Al-Onaizan, M. Bansal & Y.-N. Chen (Eds.), Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, Miami, 12th-16th November 2024, Hrsg.: Al-Onaizan, Y., Bansal, M., Chen, Y.-N., 11592–11610, Association for Computational Linguistics (ACL). doi:10.18653/v1/2024.emnlp-main.647
Charles Locock, Lowcock or Lockhart? Offline Speech Translation: Test Suite for Named Entities
Awiszus, M.; Niehues, J.; Turchi, M.; Stüker, S.; Waibel, A.
2024. roceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024), 15th-16th August 2024, 339 – 345, Association for Computational Linguistics (ACL)
Blending LLMs into Cascaded Speech Translation: KIT’s Offline Speech Translation System for IWSLT 2024
Koneru, S.; Binh Nguyen, T.; Pham, N.-Q.; Liu, D.; Li, Z.; Waibel, A.; Niehues, J.
2024. Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024). Ed.: Elizabeth Salesky, Marcello Federico, Marine Carpuat, 183–191, Association for Computational Linguistics (ACL). doi:10.18653/v1/2024.iwslt-1.24
The KIT Speech Translation Systems for IWSLT 2024 Dialectal and Low-resource Track
Li, Z.; Ugan, E. Y.; Liu, D.; Mullov, C.; Dinh, T. A.; Koneru, S.; Waibel, A.; Niehues, J.
2024. Proceedings of the 21st International Conference on Spoken Language Translation (IWSLT 2024). Ed.: E. Salesky, 221–228, Association for Computational Linguistics (ACL). doi:10.18653/v1/2024.iwslt-1.27
2023
Continually learning new languages. PhD dissertation
Pham, N. Q.
2023, December 1. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000164125
Incremental Learning of Humanoid Robot Behavior from Natural Interaction and Large Language Models
Bärmann, L.; Kartmann, R.; Peller-Konrad, F.; Waibel, A.; Asfour, T.
2023, October 21. 7th Annual Conference on Robot Learning (CoRL 2023), Atlanta, GA, USA, November 6–9, 2023
Multimodal Error Correction with Natural Language and Pointing Gestures
Constantin, S.; Eyiokur, F. I.; Yaman, D.; Bärmann, L.; Waibel, A.
2023. 2023 IEEE/CVF International Conference on Computer Vision (ICCV), 1968–1979, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICCVW60793.2023.00212
Towards continually learning new languages
Pham, N. Q.; Niehues, J.; Waibel, A.
2023. INTERSPEECH 2023, Dublin, 20th-24th August 2023, 3262–3266, International Society for Computers and Their Applications (ISCA)
KIT’s Multilingual Speech Translation System for IWSLT 2023
Liu, D.; Nguyen, T. B.; Koneru, S.; Yavuz Ugan, E.; Pham, N.-Q.; Nguyen, T. N.; Dinh, T. A.; Mullov, C.; Waibel, A.; Niehues, J.
2023. Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023). Ed.: E. Salesky, 113–122, Association for Computational Linguistics (ACL)
SYNTACC : Synthesizing Multi-Accent Speech By Weight Factorization
Nguyen, T.-N.; Pham, N.-Q.; Waibel, A.
2023. ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Rhodos, 04th-10th June 2023, 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP49357.2023.10096431
A survey on computer vision based human analysis in the COVID-19 era
Eyiokur, F. I.; Kantarcı, A.; Erakın, M. E.; Damer, N.; Ofli, F.; Imran, M.; Križaj, J.; Salah, A. A.; Waibel, A.; Štruc, V.; Ekenel, H. K.
2023. Image and Vision Computing, 130, Art.-Nr.: 104610. doi:10.1016/j.imavis.2022.104610
End-to-End Evaluation for Low-Latency Simultaneous Speech Translation
Huber, C.; Dinh, T. A.; Mullov, C.; Pham, N.-Q.; Nguyen, T. B.; Retkowski, F.; Constantin, S.; Ugan, E.; Liu, D.; Li, Z.; Koneru, S.; Niehues, J.; Waibel, A.
2023. Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. Ed.: Yansong Feng, Els Lefever, 12–20, Association for Computational Linguistics (ACL). doi:10.18653/v1/2023.emnlp-demo.2
Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages
Zhou, Z.; Niehues, J.; Waibel, A.
2023. 6th Workshop on Technologies for Machine Translation of Low-Resource Languages, LoResMT 2023 - Proceedings. Ed.: Atul Kr. Ojha, 1 – 15, Association for Computational Linguistics (ACL)
Towards continually learning new languages
Pham, Q.; Niehues, J.; Waibel, A.
2023. Proc. INTERSPEECH 2023, 3262–3266, ISCA. doi:10.21437/Interspeech.2023-1867
Face-Dubbing++: LIP-Synchronous, Voice Preserving Translation Of Videos
Waibel, A.; Behr, M.; Yaman, D.; Eyiokur, F. I.; Nguyen, T.-N.; Mullov, C.; Demirtas, M. A.; Kantarci, A.; Constantin, S.; Ekenel, H. K.
2023. 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), 1–5, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSPW59220.2023.10193719
Towards Efficient Simultaneous Speech Translation: CUNI-KIT System for Simultaneous Track at IWSLT 2023
Polak, P.; Liu, D.; Pham, N.-Q.; Niehues, J.; Waibel, A.; Bojar, O.
2023. The 20th International Conference on Spoken Language Translation : Proceedings of the Conference. Ed.: E. Salesky, 389–396, Association for Computational Linguistics (ACL)
Findings of the IWSLT 2023 Evaluation Campaign
Agarwal, M.; Agrawal, S.; Anastasopoulos, A.; Bentivogli, L.; Bojar, O.; Borg, C.; Carpuat, M.; Cattoni, R.; Cettolo, M.; Chen, M.; Chen, W.; Choukri, K.; Chronopoulou, A.; Currey, A.; Declerck, T.; Dong, Q.; Duh, K.; Estève, Y.; Federico, M.; Gahbiche, S.; Haddow, B.; Hsu, B.; Mon Htut, P.; Inaguma, H.; Javorský, D.; Judge, J.; Kano, Y.; Ko, T.; Kumar, R.; Li, P.; Ma, X.; Mathur, P.; Matusov, E.; McNamee, P.; P. McCrae, J.; Murray, K.; Nadejde, M.; Nakamura, S.; Negri, M.; Nguyen, H.; Niehues, J.; Niu, X.; Kr. Ojha, A.; E. Ortega, J.; Pal, P.; Pino, J.; Plas, L. van der; Polák, P.; Rippeth, E.; Salesky, E.; Shi, J.; Sperber, M.; Stüker, S.; Sudoh, K.; Tang, Y.; Thompson, B.; Tran, K.; Turchi, M.; Waibel, A.; Wang, M.; Watanabe, S.; Zevallos, R.
2023. Proceedings of the 20th International Conference on Spoken Language Translation (IWSLT 2023), 1–61, Association for Computational Linguistics (ACL)
Train Global, Tailor Local: Minimalist Multilingual Translation into Endangered Languages
Zhou, Z.; Niehues, J.; Waibel, A.
2023. The Sixth Workshop on Technologies for Machine Translation of Low-Resource Languages (LoResMT 2023) : Proceedings of the Workshop. Ed.: A. K. Ojha, 1–15, Association for Computational Linguistics (ACL). doi:10.18653/v1/2023.loresmt-1.1
Interactive Multimodal Robot Dialog Using Pointing Gesture Recognition
Constantin, S.; Eyiokur, F. I.; Yaman, D.; Bärmann, L.; Waibel, A.
2023. Computer Vision – ECCV 2022 Workshops. Tel Aviv, Israel, October 23-27, 2022, Proceedings, Part V, Ed.: L. Karlinsky, 640–657, Springer Nature Switzerland. doi:10.1007/978-3-031-25075-0_43
2022
Exposure Correction Model to Enhance Image Quality
Eyiokur, F. I.; Yaman, D.; Ekenel, H. K.; Waibel, A.
2022. Proceedings 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW): New Orleans, Louisiana, 19–24 June 2022, 676–686, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CVPRW56347.2022.00083
Alpha Matte Generation from Single Input for Portrait Matting
Yaman, D.; Ekenel, H. K.; Waibel, A.
2022. Proceedings 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW): New Orleans, Louisiana, 19–24 June 2022, 696–705, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CVPRW56347.2022.00085
Face-Dubbing++: LIP-Synchronous, Voice Preserving Translation Of Videos
Waibel, A.; Behr, M.; Yaman, D.; Eyiokur, F. I.; Nguyen, T.-N.; Mullov, C.; Demirtas, M. A.; Kantarci, A.; Constantin, S.; Ekenel, H. K.
2022. doi:10.48550/arXiv.2206.04523
Error correction and extraction in request dialogs
Constantin, S.; Waibel, A.
2022. Proceedings of the 5th International Conference on Natural Language and Speech Processing (ICNLSP 2022), 2–11, Association for Computational Linguistics (ACL)
A survey on computer vision based human analysis in the COVID-19 era
Eyiokur, F. I.; Kantarcı, A.; Erakın, M. E.; Damer, N.; Ofli, F.; Imran, M.; Križaj, J.; Salah, A. A.; Waibel, A.; Štruc, V.; Ekenel, H. K.
2022. doi:10.48550/arXiv.2211.03705
Machine Translation from Standard German to Alemannic Dialects
Lambrecht, L.; Schneider, F.; Waibel, A.
2022. Proceedings of the 1st Annual Meeting of the ELRA/ISCA Special Interest Group on Under-Resourced Languages, 129–136, Association for Computational Linguistics (ACL)
Accent Conversion using Pre-trained Model and Synthesized Data from Voice Conversion
Nguyen, T. N.; Pham, N.-Q.; Waibel, A.
2022. Proc. Interspeech 2022, 2583–2587, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2022-10729
Where did I leave my keys? Episodic-Memory-Based Question Answering on Egocentric Videos
Barmann, L.; Waibel, A.
2022. Proceedings 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW): New Orleans, Louisiana, 19–24 June 2022, 1559–1567, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/CVPRW56347.2022.00162
Adaptive multilingual speech recognition with pretrained models
Pham, N.-Q.; Waibel, A.; Niehues, J.
2022. Interspeech 2022, 3879–3883, ISCA. doi:10.21437/Interspeech.2022-872
CUNI-KIT System for Simultaneous Speech Translation Task at IWSLT 2022
Polák, P.; Pham, N.-Q.; Nguyen, T. N.; Liu, D.; Mullov, C.; Niehues, J.; Bojar, O.; Waibel, A.
2022. Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022). Ed.: E. Salesky, 277–285, Association for Computational Linguistics (ACL). doi:10.18653/v1/2022.iwslt-1.24
Effective combination of pretrained models - KIT@IWSLT2022
Pham, N.-Q.; Nguyen, T. N.; Nguyen, T.-B.; Liu, D.; Mullov, C.; Niehues, J.; Waibel, A.
2022. Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022). Ed.: E. Salesky, 190–197, Association for Computational Linguistics (ACL). doi:10.18653/v1/2022.iwslt-1.14
Findings of the IWSLT 2022 Evaluation Campaign
Anastasopoulos, A.; Barrault, L.; Bentivogli, L.; Zanon Boito, M.; Bojar, O.; Cattoni, R.; Currey, A.; Dinu, G.; Duh, K.; Elbayad, M.; Emmanuel, C.; Estève, Y.; Federico, M.; Federmann, C.; Gahbiche, S.; Gong, H.; Grundkiewicz, R.; Haddow, B.; Hsu, B.; Javorský, D.; Kloudová, V.; Lakew, S.; Ma, X.; Mathur, P.; McNamee, P.; Murray, K.; Nǎdejde, M.; Nakamura, S.; Negri, M.; Niehues, J.; Niu, X.; Ortega, J.; Pino, J.; Salesky, E.; Shi, J.; Sperber, M.; Stüker, S.; Sudoh, K.; Turchi, M.; Virkar, Y.; Waibel, A.; Wang, C.; Watanabe, S.
2022. Proceedings of the 19th International Conference on Spoken Language Translation (IWSLT 2022). Ed.: E. Salesky, 98–157, Association for Computational Linguistics (ACL). doi:10.18653/v1/2022.iwslt-1.10
2021
Text and Synthetic Data for Domain Adaptation in End-to-End Speech Recognition
Hussain, J.; Huber, C.; Stüker, S.; Waibel, A.
2021. Speech and Computer: 23rd International Conference, SPECOM 2021, St. Petersburg, Russia, September 27–30, 2021, Proceedings. Ed.: A. Karpov, 271–278, Springer-Verlag. doi:10.1007/978-3-030-87802-3_25
Deep Episodic Memory for Verbalization of Robot Experience
Barmann, L.; Peller-Konrad, F.; Constantin, S.; Asfour, T.; Waibel, A.
2021. IEEE Robotics and Automation Letters, 6 (3), 5808–5815. doi:10.1109/LRA.2021.3085166
Adapting Automatic Speech Recognition for Foreign Language Learners in a Serious Game
Winebarger, J.; Stüker, S.; Waibel, A.
2021. AIIDE Workshop Games and Natural Language Processing, 10. AAAI Conference on Artificial Intelligence and Interactive Digital Entertainment, Raleigh; NC, United States; 3 - 7 October 2014. Ed.: N. Tomuro, 38–40, AAAI Press
High Performance Neural Networks for Online Speech Recognizer. PhD dissertation
Nguyen, T.-S.
2021, February 2. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000128854
Deep Identification of Arabic Dialects. bachelor’s thesis
Mousa, A.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166144
Cross-lingual Transfer Learning for Low-Resource Natural Language Processing Tasks. master’s thesis
Wang, J.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166142
Multilingual Sequence-To-Sequence Speech Recognition. master’s thesis
Gremmelmaier, H.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166138
Long-Term Compressive Memory Transformer for Encoding and Verbalizing Robot Experiences. master’s thesis
Bärmann, L.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166136
Code-Switching using Language-Agnostic Speech Modeling. master’s thesis
Ugan, E. Y.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166135
Phoneme classification and alignment through recognition on TIMIT. bachelor’s thesis
Schilpp, L.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166134
Fine-Grained Prosody Control in Neural TTS Systems. bachelor’s thesis
Behr, M.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166133
Instant One-Shot Word-Learning for Context-Specific Neural Sequence-to-Sequence Speech Recognition
Huber, C.; Hussain, J.; Stüker, S.; Waibel, A.
2021. 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Cartagena, Colombia, 12-17 December 2021, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU51503.2021.9687898
KIT’s IWSLT 2021 Offline Speech Translation System
Nguyen, T. N.; Huber, C.; Awiszus, M.; Pham, N.-Q.; Ha, T.-L.; Schneider, F.; Stüker, S.; Waibel, A.; Nguyen, T.-S.
2021. Proceedings of the 18th International Conference on Spoken Language Translation. Ed.: M. Federico, A. Waibel, M. R. Costa-jussà, J. Niehues, S. Stuker, E. Salesky, 125–130, Association for Computational Linguistics (ACL). doi:10.18653/v1/2021.iwslt-1.13
Multilingual Speech Translation KIT @ IWSLT2021
Pham, N.-Q.; Ha, T.-L.; Stüker, S.; Waibel, A.; He, D.; Nguyen, T. N.
2021. Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021). Ed.: M. Federico, A. Waibel, M. R. Costa-jussà, J. Niehues, S. Stuker, E. Salesky, 154–159, Association for Computational Linguistics (ACL). doi:10.18653/v1/2021.iwslt-1.18
Cross-lingual, Language-independent Phoneme Alignment. bachelor’s thesis
Bühler, N.; Waibel, A.; Asfour, T.
2021. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166128
Value-Based Reinforcement Learning for Sequence-to-Sequence Models
Retkowski, F.; Waibel, A.
2021. Adaptive and Learning Agent Workshop at AAMAS
Findings of the IWSLT 2021 Evaluation campaign
Anastasopoulos, A.; Bojar, O.; Bremerman, J.; Cattoni, R.; Elbayad, M.; Federico, M.; Ma, X.; Nakamura, S.; Negri, M.; Niehues, J.; Pino, J.; Salesky, E.; Stüker, S.; Sudoh, K.; Turchi, M.; Waibel, A.; Wang, C.; Wiesner, M.
2021. The 18th International Conference on Spoken Language Translation - proceedings of the conference : August 5-6, 2021, Bangkok, Thailand (online) : IWSLT 2021. Ed.: M. Federico, 1–29. doi:10.18653/v1/2021.iwslt-1.1
Efficientweight factorization for multilingual speech recognition
Pham, N.-Q.; Nguyen, T.-N.; Stueker, S.; Waibel, A.
2021. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) : Brno, Czech Republic, 30 August-3 September 2021, 386–390, Curran. doi:10.21437/Interspeech.2021-216
Super-human performance in online low-latency recognition of conversational speech
Nguyen, T.-S.; Stüker, S.; Waibel, A.
2021. 22nd Annual Conference of the International Speech Communication Association (INTERSPEECH 2021) : Brno, Czech Republic, 30 August-3 September 2021, 4131–4135, Curran. doi:10.21437/Interspeech.2021-1114
ELITR multilingual live subtitling: Demo and strategy
Bojar, O.; Macháček, D.; Sagar, S.; Smrž, O.; Kratochvíl, J.; Polák, P.; Ansari, E.; Mahmoudi, M.; Kumar, R.; Franceschini, D.; Canton, C.; Simonini, I.; Nguyen, T.-S.; Schneider, F.; Stüker, S.; Waibel, A.; Haddow, B.; Sennrich, R.; Williams, P.
2021. Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations, 271–277
2020
Findings of the IWSLT 2020 Evaluation campaign
Ansari, E.; Axelrod, A.; Bach, N.; Bojar, O.; Cattoni, R.; Dalvi, F.; Durrani, N.; Federico, M.; Federmann, C.; Gu, J.; Huang, F.; Knight, K.; Ma, X.; Nagesh, A.; Negri, M.; Niehues, J.; Pino, J.; Salesky, E.; Shi, X.; Stüker, S.; Turchi, M.; Waibel, A.; Wang, C.
2020. IWSLT 2020 : The 17th International Conference on Spoken Language Translation, Seattle - colocated with ACL 2020, Seattle, USA, July 9 - 10, 2020. Ed.: M. Federico, 1–34, Association for Computational Linguistics (ACL)
Towards Stream Translation: Adaptive Computation Time for Simultaneous Machine Translation
Schneider, F.; Waibel, A.
2020. Proceedings of the 17th International Conference on Spoken Language Translation. Ed.: A. Waibel, 228–236, Association for Computational Linguistics (ACL). doi:10.18653/v1/2020.iwslt-1.28
Improving Sequence-To-Sequence Speech Recognition Training with On-The-Fly Data Augmentation
Nguyen, T.-S.; Stueker, S.; Niehues, J.; Waibel, A.
2020. ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Barcelona, 4th-8th May 2020, 7689–7693, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP40776.2020.9054130
Gun source and muzzle head detection
Zhou, Z.; Etinger, I. C.; Metze, F.; Hauptmann, A.; Waibel, A.
2020. 2020 Imaging and Multimedia Analytics in a Web and Mobile World Conference, IMAWM 2020, Hyatt Regency San Francisco Airport, Burlingame, United States, 26 - 30 January 2020, 187–1, Society for Imaging Science and Technology (IS&T). doi:10.2352/ISSN.2470-1173.2020.8.IMAWM-187
Revised Speech Chain Loop featuring Quality Estimation. bachelor’s thesis
Wirth, O.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166156
Towards Diversity and Relevance in Neural Natural Language Response Generation. master’s thesis
Handloser, D.; Waibel, A.; Asfour, T.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166154
Using Scene-Aware Voice Dialogs in Human-Drone Interaction. master’s thesis
Fuhrmann, T.; Waibel, A.; Scherer, U.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166153
A Knowledge-grounded Conversation Model with Word-level Incorporation. master’s thesis
Mingyu, Z.; Waibel, A.; Asfour, T.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166152
Reinforcement Learning for Sequence-to-Sequence Dialogue Systems. master’s thesis
Retkowski, F.; Waibel; Asfour; Pham, N.-Q.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166150
Supervised Adaptation of Sequence-to-Sequence Speech Recognition Systems using Batch-Weighting
Huber, C.; Nguyen, T.-N.; Song, K.; Stüker, S.; Hussain, J.; Waibel, A.
2020. Proceedings of the 2nd Workshop on Life-long Learning for Spoken Language Systems. Ed.: W. M. Campbell, A. Waibel, D. Hakkani-Tur, T. J. Hazen, K. Kilgour, E. Cho, V. Kumar, H. Glaude, 9–17, Association for Computational Linguistics (ACL)
Robust Voice Activity Detection in the Presence of Music using Neural Networks. master’s thesis
Beeking, M.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166147
Sample-Incremental Meta-Learning: Weight-Mapping with Deep Learning. master’s thesis
Huber, C.; Waibel, A.; Thäter, G.; Hussain, J.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166145
German-Arabic Speech-to-Speech Translation for Psychiatric Diagnosis
Hussain, J.; Behr, M.; Cheragui, M. A.; Stüker, S.; Waibel, A.; Mediani, M.
2020. Proceedings of the Fifth Arabic Natural Language Processing Workshop. Ed.: I. Zitouni, M. Abdul-Mageed, H. Bouamor, F. Bougares, M. El-Haj, N. Tomeh, W. Zaghouani, Association for Computational Linguistics (ACL)
Improvement of the Translation of Named Entities in Neural Machine Translation. master’s thesis
Modrzejewski, M.; Waibel, A.; Asfour, T.; Ha, T.-L.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166141
CAGAN: Text-To-Image Generation with Combined Attention Generative Adversarial Networks. master’s thesis
Schulze, H.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166140
KIT’s IWSLT 2020 SLT Translation System
Pham, N.-Q.; Nguyen, T.-N.; Ha, T.-L.; Nguyen, T.-S.; Awiszus, M.; Stüker, S.; Waibel, A.; Schneider, F.
2020. Proceedings of the 17th International Conference on Spoken Language Translation (IWSLT 2020). Ed.: M. Federico, A. Waibel, K. Knight, S. Nakamura, H. Ney, J. Niehues, S. Stüker, D. Wu, J. Mariani, F. Yvon, Association for Computational Linguistics (ACL). doi:10.18653/v1/2020.iwslt-1.4
Relative Positional Encoding for Speech Recognition and Direct Translation
Pham, N.-Q.; Ha, T.-L.; Nguyen, T.-N.; Nguyen, T.-S.; Salesky, E.; Stüker, S.; Niehues, J.; Waibel, A.
2020. Cognitive intelligence for speech processing : 21st Annual Conference of the International Speech Communication Association (INTERSPEECH 2020) : held online due to Covid-19 : Shanghai, China, 25-29 October 2020, 31–35, Red Hook. doi:10.21437/Interspeech.2020-2526
High Performance Sequence-to-Sequence Model for Streaming Speech Recognition
Nguyen, T.-S.; Pham, N.-Q.; Stüker, S.; Waibel, A.
2020. Proceedings of the 21st Annual Conference of the International Speech Communication Association, INTERSPEECH 2020; Shanghai; China; 25 - 29 October 2020, 2147–2151, ISCA. doi:10.21437/Interspeech.2020-1863
DaCToR: A data collection tool for the RELATER project
Hussain, J.; Zenkri, O.; Stüker, S.; Waibel, A.
2020. Proceedings of the 12th Language Resources and Evaluation Conference. Ed.: N. Calzolari, 6627–6632, European Language Resources Association (ELRA)
Self-attentional models for lattice inputs
Sperber, M.; Neubig, G.; Pham, N.-Q.; Waibel, A.
2020. The 57th Annual Meeting of the Association for Computational Linguistics - proceedings of the conference : July 28-August 2, 2019, Florence, Italy. Ed.: A. Korhonen, 1185–1197, Association for Computational Linguistics (ACL)
Multilingual Neural Translation. PhD dissertation
Ha, T.-L.
2020. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000104498
2019
The IWSLT 2019 KIT Speech Translation System
Pham, N.-Q.; Nguyen, T.-S.; Ha, T.-L.; Hussain, J.; Schneider, F.; Niehues, J.; Stüker, S.; Waibel, A.
2019. Proceedings of the 16th International Conference on Spoken Language Translation (IWSLT 2019), Hong Kong, November 2-3. Ed.: J. Niehues. doi:10.5281/zenodo.3525564
21: Kognitive Systeme, Vorlesung, SS 2019, 22.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-701
20: Kognitive Systeme, Übung, SS 2019, 17.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-679
19: Kognitive Systeme, Vorlesung, SS 2019, 15.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-666
18: Kognitive Systeme, Vorlesung, SS 2019, 10.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-645
17: Kognitive Systeme, Übung, SS 2019, 08.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-636
16: Kognitive Systeme, Vorlesung, SS 2019, 03.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-616
15: Kognitive Systeme, Vorlesung, SS 2019, 01.07.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-607
14: Kognitive Systeme, Vorlesung, SS 2019, 26.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-585
13: Kognitive Systeme, Vorlesung, SS 2019, 24.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (Zentrum für Mediales Lernen (ZML), Ed.). doi:10.5445/DIVA/2019-576
12: Kognitive Systeme, Vorlesung, SS 2019, 19.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-570
11: Kognitive Systeme, Vorlesung, SS 2019, 17.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-554
10: Kognitive Systeme, Vorlesung, SS 2019, 12.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-540
09: Kognitive Systeme, Vorlesung, SS 2019, 05.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-520
08: Kognitive Systeme, Vorlesung, SS 2019, 03.06.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-511
Attention-Passing Models for Robust and Data-Efficient End-to-End Speech Translation
Sperber, M.; Neubig, G.; Niehues, J.; Waibel, A.
2019. Transactions of the Association for Computational Linguistics, 7, 313–325. doi:10.1162/tacl_a_00270
07: Kognitive Systeme, Vorlesung, SS 2019, 22.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (Zentrum für Mediales Lernen (ZML), Ed.). doi:10.5445/DIVA/2019-422
06: Kognitive Systeme, Vorlesung, SS 2019, 20.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-402
05: Kognitive Systeme, Vorlesung, SS 2019, 15.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-367
04: Kognitive Systeme, Vorlesung, SS 2019, 13.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-347
03: Kognitive Systeme, Vorlesung, SS 2019, 08.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-322
02: Kognitive Systeme, Vorlesung, SS 2019, 06.05.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-313
01: Kognitive Systeme, Vorlesung, SS 2019, 29.04.2019
Dillmann, R.; Waibel, A.; Stüker, S.; Meißner, P.
2019. (KIT | Webcast, Ed.). doi:10.5445/DIVA/2019-283
Optimization of DNN Acoustic Models for Low Resource and Mobile Environments. PhD dissertation
Tu, A.; Waibel, A.; Asfour, T.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166170
KIT’s Submission to the IWSLT 2019 Shared Task on Text Translation
Schneider, F.; Waibel, A.
2019. Proceedings of the 16th International Conference on Spoken Language Translation. Ed.: J. Niehues, R. Cattoni, S. Stüker, M. Negri, M. Turchi, T.-L. Ha, E. Salesky, R. Sanabria, L. Barrault, L. Specia, M. Federico, Association for Computational Linguistics (ACL)
Bimodal Speech Emotion Recognition Using Pre-Trained Language Models
Heusser, V.; Freymuth, N.; Constantin, S.; Waibel, A.
2019. arxiv. doi:10.5445/IR/1000166167
Multimodal Dialogue Processing for Machine Translation
Waibel, A.
2019. The Handbook of Multimodal-Multisensor Interfaces, Vol.: 3. Ed.: S. Oviatt, 577–620, Association for Computing Machinery (ACM). doi:10.1145/3233795.3233811
Unsupervised Style Transfer. bachelor’s thesis
Rublack, V.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166160
Application of Neural Networks for Heading Direction Estimation. bachelor’s thesis
Nguyen, X. T.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166159
A Text-to-Speech system based on Deep Neural Networks. bachelor’s thesis
Dunaev, A.; Waibel, A.; Asfour, T.; Constantin, S.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166158
A Study on Semantic Parsing of Cooking Recipes. bachelor’s thesis
Pfisterer, F.; Waibel, A.; Asfour, T.; Hovy, E.; Otani, N.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166157
Automatic music transcription using sequence to sequence learning. master’s thesis
Awiszus, M.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166155
Improving Zero-shot Translation with Language-Independent Constraints
Pham, N.-Q.; Niehues, J.; Ha, T.-L.; Waibel, A.
2019. Proceedings of the Fourth Conference on Machine Translation. Vol. 1. Ed.: O. Bojar, 13–23, Association for Computational Linguistics (ACL). doi:10.18653/v1/W19-5202
Very Deep Self-Attention Networks for End-to-End Speech Recognition
Pham, N.-Q.; Nguyen, T.-S.; Niehues, J.; Müller, M.; Stüker, S.; Waibel, A.
2019. Crossroads of speech and language : 20th Annual Conference of the International Speech Communication Association (INTERSPEECH 2019) : Graz, Austria, 15-19 September 2019, 66–70, Curran. doi:10.21437/Interspeech.2019-2702
Multi-task learning to improve natural language understanding
Constantin, S.; Niehues, J.; Waibel, A.
2019. Proceedings of the International Workshop Series on Spoken Dialogue System Technology (IWSDS 2019)
Incremental processing of noisy user utterances in the spoken language understanding task
Constantin, S.; Niehues, J.; Waibel, A.
2019. The Fifth Workshop on Noisy User-generated Text (W-NUT 2019) - proceedings of the workshop : Nov 4, 2019, Hong Kong, China : W-NUT 2019. Ed.: W. Xu, 265–274, Association for Computational Linguistics (ACL). doi:10.18653/v1/D19-5535
Fluent translations from disfluent speech in end-to-end speech translation
Salesky, E.; Sperber, M.; Waibel, A.
2019. The 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - proceedings of the conference, NAACL HLT 2019, June 2-7, 2019, Minneapolis, USA. Vol. 1 (long and short papers). Ed. J. Burstein, 2786–2792, ACL
Paraphrases as foreign languages in multilingual neural machine translation
Zhou, Z.; Sperber, M.; Waibel, A.
2019. The 57th Annual Meeting of the Association for Computational Linguistics - proceedings of the Student Research Workshop : July 28-August 2, 2019, Florence, Italy. Ed.: F. Alva-Manchego, 113–122, Association for Computational Linguistics (ACL)
An End-to-End Goal-Oriented Dialog System with a Generative Natural Language Response Generation
Constantin, S.; Niehues, J.; Waibel, A.
2019. 9th International Workshop on Spoken Dialogue System Technology. Ed.: L. F. D’Haro, 209–219, Springer. doi:10.1007/978-981-13-9443-0_18
Neural Codes to Factor Language in Multilingual Speech Recognition
Müller, M.; Stüker, S.; Waibel, A.
2019. 2019 IEEE International Conference on Acoustics, Speech, and Signal Processing: Proceedings ; May 12-17, 2019, Brighton Conference Centre, Brighton, United Kingdom, 8638–8642, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2019.8683484
End-to-End Neural Speech Translation. PhD dissertation
Sperber, M.
2019. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000095218
Towards Fluent Translations from Disfluent Speech
Salesky, E.; Burger, S.; Niehues, J.; Waibel, A.
2019. 2018 IEEE Workshop on Spoken Language Technology, SLT 2018: Proceedings, 921–926, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/SLT.2018.8639661
Bulbasaa: A bilingual Bàsàá-French speech corpus for the evaluation of language documentation tools
Hamlaoui, F.; Makasso, E.-M.; Müller, M.; Engelmann, J.; Adda, G.; Waibel, A.; Stüker, S.
2019. 11th International Conference on Language Resources and Evaluation, LREC 2018; Phoenix Seagaia Conference CenterMiyazaki; Japan; 7 May 2018 through 12 May 2018. Ed.: H. Isahara, 3377–3381, European Language Resources Association (ELRA)
Yeah, right, uh-huh: A deep learning backchannel predictor
Ruede, R.; Müller, M.; Stüker, S.; Waibel, A.
2019. 8th International Workshop on Spoken Dialogue Systems, IWSDS 2017; Farmington; United States; 6 June 2017 through 9 June 2017, 247–258, Springer. doi:10.1007/978-3-319-92108-2_25
2018
Open Source Toolkit for Speech to Text Translation
Zenkel, T.; Sperber, M.; Niehues, J.; Müller, M.; Pham, N.-Q.; Stüker, S.; Waibel, A.
2018. The Prague Bulletin of Mathematical Linguistics, 111 (1), 125–135. doi:10.2478/pralin-2018-0011
Multilingual Adaptation of RNN Based ASR Systems
Miiller, M.; Stiiker, S.; Waibel, A.
2018. 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Calgary, 15th-20th April 2018, 5219–5223, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2018.8461614
Towards one-shot learning for rare-word translation with external experts
Pham, N.-Q.; Niehues, J.; Waibel, A.
2018. Neural Machine Translation and Generation : Proceedings of the 2nd Workshop, July 20, 2018, Melbourne, Australia. Ed.: A. Alexandra, 100–109, Association for Computational Linguistics (ACL). doi:10.18653/v1/W18-2712
Automated evaluation of out-of-context errors
Huber, P.; Niehues, J.; Waibel, A.
2018. 11th International Conference on Language Resources and Evaluation, LREC 2018; Phoenix Seagaia Conference CenterMiyazaki; Japan; 7 May 2018 through 12 May 2018. Ed.: H. Isahara, 2022–2026, European Language Resources Association (ELRA)
Kit-Multi: A translation-oriented multilingual embedding corpus
Ha, T.-L.; Niehues, J.; Sperber, M.; Pham, N. Q.; Waibel, A.
2018. 11th International Conference on Language Resources and Evaluation, LREC 2018; Phoenix Seagaia Conference CenterMiyazaki; Japan; 7 May 2018 through 12 May 2018. Ed.: H. Isahara, 3904–3907, European Language Resources Association (ELRA)
Attention Neural Network-Based Abstractive Summarization and Headline Generation. master’s thesis
Douma, N.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166202
Analysis and Advancement of Differentiable Neural Computers for Question Answering. master’s thesis
Franke, J.; Waibel, A.; Asfour, T.; Niehues, J.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166201
Enhancing Multilingual Graphemic RNN based ASR Systems using Phone Information
Müller, M.; Stüker, S.; Waibel, A.
2018. Elektronische Sprachsignalverarbeitung 2018 : Tagungsband der 29. Konferenz, Ulm, 7.-9. März 2018. Ed.: A. Berton, 30–37, TUDpress
Exploring CTC-Network Derived Features With Conventional Hybrid System
Nguyen, T.-S.; Waibel, A.; Stüker, S.
2018. International Conference on Acoustics, Speech, and Signal Processing 2018 - ICASSP, Calgary, Canada, 15-20 April 2018, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2018.8461437
Parameter Optimization for CTC Acoustic Models in a Less-resourced Scenario: An Empirical Study
Müller, M.; Stüker, S.; Waibel, A.
2018. Speech communication: 13. ITG-Fachtagung Sprachkommunikation: 10.-12. Oktober 2018 in Oldenburg. Ed.: S. Doclo, 181–185, VDE Verlag
Measuring MT Quality using vector representations. master’s thesis
Huang, Y.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166177
Neuronale Textklassifikation mittels Wissen aus der maschinellen Übersetzung. master’s thesis
Kleiser, D.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166176
Term Extraction as Sequence Labeling Task using Recurrent Neural Networks. master’s thesis
Kucza, M.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166175
KIT’s IWSLT 2018 SLT Translation System
Sperber, M.; Pham, N. Q.; Nguyen, T. S.; Niehues, J.; Müller, M.; Ha, T.-L.; Stücker, S.; Waibel, A.
2018. Proceedings of the International Workshop on Spoken Language Translation. Ed.: M. Turchi; J. Niehues, 131–135, ACL Anthology
Keyword Based Document Retrieval via Document Embeddings. bachelor’s thesis
Brendl, J.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166163
Creating Audio Level Dependency Parse Trees from Speech. bachelor’s thesis
Beffart, T.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166162
Multimodal goal-oriented dialog using Encoder-Decoder-Networks. bachelor’s thesis
Baermann, L.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166161
Robust and Scalable Differentiable Neural Computer for Question Answering
Franke, J.; Niehues, J.; Waibel, A.
2018. Machine Reading for Question Answering : proceedings of the workshop, July 19, 2018, Melbourne, Australia. Ed.: E. Choi, 47–59, Association for Computational Linguistics (ACL). doi:10.18653/v1/W18-2606
Inspection of Multilingual Neural Machine Translation
Mullov, C.; Niehues, J.; Waibel, A.
2018. Proceedings of the Second Workshop on Multi-Language Processing in a Globalising World and the First Workshop on Multilingualism at the intersection of Knowledge Bases and Machine Translation (MLP-MomenT 2018) Hrsg.: J. Du
The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2018
Pham, N.-Q.; Niehues, J.; Waibel, A.
2018. Proceedings of the Third Conference on Machine Translation (WMT) Vol.: 2. Ed.: O. Bojar, 467–472, Association for Computational Linguistics (ACL). doi:10.18653/v1/W18-64049
KIT Lecture Translator: Multilingual Speech Translation with One-Shot Learning
Dessloch, F.; Ha, T.-L.; Müller, M.; Niehues, J.; Nguyen, T.-S.; Pham, N.-Q.; Salesky, E.; Sperber, M.; Stüker, S.; Zenkel, T.; Waibel, A.
2018. The 27th International Conference on Computational Linguistics : proceedings of System Demonstrations, August 20-26, 2018, Santa Fe, New Mexico, USA, COLING 2018. Ed.: D. Zhao, 89–93, Association for Computational Linguistics (ACL)
Multilingual Modulation by Neural Language Codes. PhD dissertation
Müller, M.
2018. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000088486
Low-latency neural speech translation
Niehues, J.; Pham, N.-Q.; Ha, T.-L.; Sperber, M.; Waibel, A.
2018. 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018; Hyderabad International Convention Centre (HICC)Hyderabad; India; 2 September 2018 through 6 September 2018. Ed.: C.C. Sekhar, 1293–1297, ISCA. doi:10.21437/Interspeech.2018-1055
Neural language codes for multilingual acoustic models
Müller, M.; Stüker, S.; Waibel, A.
2018. 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018; Hyderabad International Convention Centre (HICC)Hyderabad; India; 2 September 2018 through 6 September 2018. Ed.: C. C. Sekhar, 2419–2423, ISCA. doi:10.21437/Interspeech.2018-1241
Self-attentional acoustic models
Sperber, M.; Niehues, J.; Neubig, G.; Stüker, S.; Waibel, A.
2018. 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018; Hyderabad International Convention Centre (HICC)Hyderabad; India; 2 September 2018 through 6 September 2018. Ed.: C.C. Sekhar, 3723–3727, ISCA. doi:10.21437/Interspeech.2018-1910
Subword and crossword units for CTC acoustic models
Zenkel, T.; Sanabria, R.; Metze, F.; Waibel, A.
2018. 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018; Hyderabad International Convention Centre (HICC)Hyderabad; India; 2 September 2018 through 6 September 2018. Ed.: C. C. Sekhar, 396–400, ISCA. doi:10.21437/Interspeech.2018-2057
Term extraction via neural sequence labeling a comparative evaluation of strategies using recurrent neural networks
Kucza, M.; Niehues, J.; Zenkel, T.; Waibel, A.; Stüker, S.
2018. 19th Annual Conference of the International Speech Communication, INTERSPEECH 2018; Hyderabad International Convention Centre (HICC)Hyderabad; India; 2 September 2018 through 6 September 2018, 2072–2076, ISCA. doi:10.21437/Interspeech.2018-2017
Building Real-Time Speech Recognition Without CMVN
Nguyen, T. S.; Sperber, M.; Stüker, S.; Waibel, A.
2018. 20th International Conference on Speech and Computer, SPECOM 2018; Leipzig; Germany; 18 September 2018 through 22 September 201, 451–460, Springer. doi:10.1007/978-3-319-99579-3_47
DBLSTM based multilingual articulatory feature extraction for language documentation
Müller, M.; Stiiker, S.; Waibel, A.
2018. 2017 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), Okinawa, J, December 16-20, 2017, 417–423, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2017.8268966
Speech interaction strategies for a humanoid assistant
Stüker, S.; Constantin, S.; Niehues, J.; Nguyen, T.-S.; Muller, M.; Pham, N. Q.; Rüde, R.; Waibel, A.
2018. 13th International Scientific-Technical Conference on Electromechanics and Robotics “Zavalishin’s Readings”, St. Petersburg, Russia, 18th - 21st April 2018, Art.Nr. 01002, EDP Sciences. doi:10.1051/matecconf/201816101002
2017
Transcribing against time
Sperber, M.; Neubig, G.; Niehues, J.; Nakamura, S.; Waibel, A.
2017. Speech communication, 93, 20–30. doi:10.1016/j.specom.2017.07.006
TAUS Speech-to-Speech Translation Technology Report
Seligman, M.; Waibel, A.; Joscelyne, A.
2017. TAUS
Topic Prediction in Dialogs using Convolutional Neural Networks. master’s thesis
Jin, K.; Waibel, A.; Asfour, T.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166207
Domain-independent Punctuation and Segmentation Insertion
Waibel, A.; Cho, E.; Niehues, J.
2017. Proceedings of the 14th International Conference on Spoken Language Translation. Ed.: S. Sakti, M. Utiyama, 74–81, Association for Computational Linguistics (ACL)
The 2017 KIT IWSLT Speech-to-Text Systems for English and German
Nguyen, T.-S.; Sperber, S.; Zenkel, T.; Stüker, S.; Müller, M.; Waibel, A.
2017. Proceedings of the 14th International Conference on Spoken Language Translation. Ed.: S. Sakti, M. Utiyama, 60–64, Association for Computational Linguistics (ACL)
KIT’s Multilingual Neural Machine Translation systems for IWSLT 2017
Pham, N.-Q.; Salesky, E.; Ha, T.-L.; Niehues, J.; Waibel, A.; Sperber, M.
2017. Proceedings of the 14th International Conference on Spoken Language Translation. Ed.: S. Sakti, M. Utiyama, 42–47, Association for Computational Linguistics (ACL)
Target Factors for Neural Machine Translation. diploma thesis
Wagner, M.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166198
Towards Tonality Detection in Unseen Languages. master’s thesis
Deßloch, F.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166194
Online Neural Network-based Language Identification. master’s thesis
Draper, D. H.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166193
Character Based Language Modeling and Applications in Speech Recognition. master’s thesis
Zenkel, T.; Waibel, A.; Tichy, W.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166192
Neural Network-based Small-Footprint Flexible Keyword Spotting. master’s thesis
Zhu, L.; Waibel, A.; Asfour, T.; Müller, M.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166191
Improving phoneme set discovery for documenting unwritten languages
Müller, M.; Stüker, S.; Waibel, A.; Franke, J.
2017. Elektronische Sprachsignalverarbeitung 2017: Tagungsband der 28. Konferenz, Saarbrücken, 15.-17. März 2017. Ed.: J. Trouvain, I. Steiner, B. Möbius, 202–209, TUDpress
Backchannel Prediction for Conversational Speech Using Recurrent Neural Networks. bachelor’s thesis
Ruede, R.; Waibel, A.; Asfour, T.; Müller, M.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166173
Toward Robust Neural Machine Translation for Noisy Input Sequences
Sperber, M.; Niehues, J.; Waibel, A.
2017. Proceedings of the International 14th International Workshop on Spoken Language Translation : 14th-15th December, 2017 Tokyo, Japan : IWSLT 2017. Ed.: S. Sakti, 90–96, ACL Anthology
Analyzing Neural MT Search and Model Performance
Niehues, J.; Cho, E.; Ha, T.-L.; Waibel, A.
2017. ACL 2017 The First Workshop on Neural Machine Translation : Proceedings of the Workshop, August 4, 2017, Vancouver, Canada. Ed. T. Luong, 11–17, Association for Computational Linguistics (ACL). doi:10.18653/v1/W17-3202
Effective Strategies in Zero-Shot Neural Machine Translation
Ha, T.-L.; Niehues, J.; Waibel, A.
2017. Proceedings of the International Workshop on Spoken Language Translation 14th-15th December, 2017 Tokyo, Japan. Ed.: S. Sakti, 105–112, ACL Anthology
The QT21 Combined Machine Translation System for English to Latvian
Peter, J.-T.; Ney, H.; Bojar, O.; Pham, N.-Q.; Niehues, J.; Waibel, A.; Burlot, F.; Yvon, F.; Pinnis, M.; Sics, V.; Bastings, J.; Rios, M.; Aziz, W.; Williams, P.; Blain, F.; Specia, L.
2017. WMT 2017 Second Conference on Machine Translation : Proceedings, September 7-8, 2017, Copenhagen, Denmark. Ed.: O. Bojar, 348–357, Association for Computational Linguistics (ACL). doi:10.18653/v1/W17-4734
The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2017
Pham, N.-Q.; Niehues, J.; Ha, T.-L.; Cho, E.; Sperber, M.; Waibel, A.
2017. WMT 2017 Second Conference on Machine Translation : Proceedings, September 7-8, 2017, Copenhagen, Denmark. Ed.: O. Bojar, 366–373, Association for Computational Linguistics (ACL). doi:10.18653/v1/W17-4736
Neural Lattice-to-Sequence Models for Uncertain Inputs
Sperber, M.; Neubig, G.; Niehues, J.; Waibel, A.
2017. emnlp 2017 : Conference on Empirical Methods in Natural Language Processing : conference proceedings : Copenhagen, Denmark, September 7-11, 2017. Ed.: M. Palmer, 1380–1389, Association for Computational Linguistics (ACL). doi:10.18653/v1/D17-1145
NMT-based segmentation and punctuation insertion for real-Time spoken language translation
Cho, E.; Niehues, J.; Waibel, A.
2017. Situated Interaction : Interspeech 2017, Stockholm, Sweden, 20th - 24th August 2017. Ed.: F. Lacerda, 2645–2649, ISCA. doi:10.21437/Interspeech.2017-1320
Enhancing backchannel prediction using word embeddings
Ruede, R.; Müller, M.; Stüker, S.; Waibel, A.
2017. Situated interaction : Interspeech 2017, Stockholm, Sweden, 20th - 24th August 2017. Ed.: F. Lacerda, 879–883, ISCA. doi:10.21437/Interspeech.2017-1606
Comparison of decoding strategies for CTC acoustic models
Zenkel, T.; Sanabria, R.; Metze, F.; Niehues, J.; Sperber, M.; Stüker, S.; Waibel, A.
2017. Situated interaction : Interspeech 2017, Stockholm, Sweden, 20th - 24th August 2017. Ed.: F. Lacerda, 513–517, ISCA. doi:10.21437/Interspeech.2017-1683
Language adaptive multilingual CTC speech recognition
Müller, M.; Stüker, S.; Waibel, A.
2017. Speech and Computer: 19th International Conference, SPECOM 2017 Hatfield, UK, September 12–16, 2017, Proceedings. Ed.: A. Karpov, 473–482, Springer. doi:10.1007/978-3-319-66429-3_47
Improved Speaker Adaptation by Combining I-vector and fMLLR with Deep Bottleneck Networks
Nguyen, T. S.; Kilgour, K.; Sperber, M.; Waibel, A.
2017. Speech and Computer : Proceedings of the 19th International Conference, SPECOM 2017, Hatfield, United Kingdom, 12th - 16th September 2017. Ed.: A. Karpov, 417–426, Springer. doi:10.1007/978-3-319-66429-3_41
Towards phoneme inventory discovery for documentation of unwritten languages
Müller, M.; Franke, J.; Waibel, A.; Stueker, S.
2017. ICASSP 2017 : IEEE International Conference on Acoustics, Speech and Signal Processing, New Orleans, USA, 5th - 9th March 2017, 5200–5204, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2017.7953148
Learning from Noisy Data in Statistical Machine Translation. PhD dissertation
Mediani, M.
2017. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000072805
Towards an Open-Domain Social Dialog System
Schmidt, M.; Niehues, J.; Waibel, A.
2017. Dialogues with Social Robots. Ed.: K. Jokinen, 271–278, Springer. doi:10.1007/978-981-10-2585-3_21
2016
Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks. bachelor’s thesis
Wetzel, M.
2016. Association for Computational Linguistics (ACL). doi:10.5445/IR/1000183883
Unsupervised Phoneme Segmentation of Previously Unseen Languages. master’s thesis
Vetter, M.
2016. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000183875
Error analysis of a statistical machine translation system. bachelor’s thesis
Chelbi, S.
2016. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166798
Personalized News Event Retrieval for Small Talk in Social Dialog Systems. master’s thesis
Bechberger, L.
2016. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166675
Analysis on Punctuation Prediction Performance given Latency Contraints. master’s thesis
Wang, Y.
2016. Karlsruher Institut für Technologie (KIT)
Optimizing Computer-Assisted Transcription Quality with Iterative User Interfaces
Sperber, M.; Neubig, G.; Nakamura, S.; Waibel, A.
2016. Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16). Ed.: N. Calzolari, K. Choukri, T. Declerck, S. Goggi, M. Grobelnik, B. Maegaard, J. Mariani, H. Mazo, A, Moreno, J. Odijk, S. Piperidis, 1986–1992, European Language Resources Association (ELRA)
Evaluation of the KIT Lecture Translation System
Müller, M.; Fünfer, S.; Stüker, S.; Waibel, A.
2016. Proceedings of the 10th Language Resources and Evaluation Conference (LREC’16). Ed.: N. Calzolari, K. Choukri, T. Declerck, S. Goggi, M. Grobelnik, B. Maegaard, J. Mariani, H. Mazo, A. Moreno, J. Odijk, S. Piperidis, 1856–1861, European Language Resources Association (ELRA)
Training Deep Neural Networks for Reverberation Robust Speech Recognition
Ritter, M.; Müller, M.; Stüker, S.; Metze, F.; Waibel, A.
2016. Speech communication : 12. ITG-Fachtagung Sprachkommunikation, 5.-7. Oktober 2016 in Paderborn, 382–386, VDE Verlag
Language Feature Vectors for Resource Constraint Speech Recognition
Müller, M.; Stüker, S.; Waibel, A.
2016. Speech communication : 12. ITG-Fachtagung Sprachkommunikation, 5.-7. Oktober 2016 in Paderborn, 357–361, VDE Verlag
Phoneme Boundary Detection using Deep Bidirectional LSTMs
Franke, J.; Müller, M.; Hamaloui, F.; Stüker, S.; Waibel, A.
2016. Speech communication : 12. ITG-Fachtagung Sprachkommunikation, 5.-7. Oktober 2016 in Paderborn, 377–381, VDE Verlag
Audio Segmentation for Robust Real-Time Speech Recognition Based on Neural Networks
Wetzel, M.; Sperber, M.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico, Association for Computational Linguistics (ACL)
Towards Improving Low-Resource Speech Recognition Using Articulatory and Language Features
Müller, M.; Stüker, S.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico, Association for Computational Linguistics (ACL)
The 2016 KIT IWSLT Speech-to-Text Systems for English and German
Nguyen, T.-S.; Müller, M.; Sperber, M.; Zenkel, T.; Kilgour, K.; Stüker, S.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo, J. Niehues, S. Stüker, L. Bentivogli, R. Cattoni, M. Federico, Association for Computational Linguistics (ACL)
Challenges for automatic speech recognition of non-native adolescent speech
Winebarger, J.; Mediani, M.; Stüker, S.; Waibel, A.
2016. Environnements numériques et interactions en langue étrangère : du formel à linformel, du réel à la réalité virtuelle. Ed.: M. Roy, 97–124, Peter Lang
An empirical exploration of CTC acoustic models
Miao, Y.; Gowayyed, M.; Na, X.; Ko, T.; Metze, F.; Waibel, A.
2016. 2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Shanghai, China, 20-25 March 2016, 2623–2627, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2016.7472152
Integrating Encyclopedic Knowledge into Neural Language Models
Zhang, Y.; Niehues, J.; Waibel, A.
2016. Proceedings of the 13th International Workshop on Spoken Language Translation – IWSLT, Seattle, USA, December 8-9, Association for Computational Linguistics (ACL)
Adaptation and Combination of NMT Systems: The KIT Translation Systems for IWSLT 2016
Cho, E.; Niehues, J.; Ha, T.-L.; Sperber, M.; Mediani, M.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo
Lecture Translator Speech translation framework for simultaneous lecture translation
Müller, M.; Nguyen, T.-S.; Niehues, J.; Cho, E.; Krüger, B.; Ha, T.-L.; Kilgour, K.; Sperber, M.; Mediani, M.; Stüker, S.; Waibel, A.
2016. The 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies - proceedings of the conference : NAACL HLT 2016 : June 12-17, 2016, San Diego, California, USA. Ed.: K. Knight, 82–86, Association for Computational Linguistics (ACL). doi:10.18653/v1/N16-3017
Lightly Supervised Quality Estimation
Sperber, M.; Neubig, G.; Niehues, J.; Stüker, S.; Waibel, A.
2016. The 26th International Conference on Computational Linguistics : proceedings of COLING 2016: technical papers, December 11-16, 2016, Osaka, Japan : COLING 2016. Ed.: Y. Matsumoto, 3103–3113, Association for Computational Linguistics (ACL)
Pre-translation for neural machine translation
Niehues, J.; Cho, E.; Ha, T.-L.; Waibel, A.
2016. The 26th International Conference on Computational Linguistics - proceedings of COLING 2016: technical papers : December 11-16, 2016, Osaka, Japan : COLING 2016. Ed.: Y. Matsumoto, 1828–1836, Association for Computational Linguistics (ACL)
Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System
Andonov, A.; Schmidt, M.; Niehues, J.; Waibel, A.
2016. Speech communication : 12. ITG-Fachtagung Sprachkommunikation, 5.-7. Oktober 2016 in Paderborn, 125–129, VDE Verlag
Multilingual Disfluency Removal using NMT
Cho, E.; Niehues, J.; Ha, T.-L.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo
The Karlsruhe Institute of Technology Systems for the News Translation Task in WMT 2016
Ha, T.-L.; Cho, E.; Niehues, J.; Mediani, M.; Sperber, M.; Allauzen, A.; Waibel, A.
2016. Proceedings of the First Conference on Machine Translation, Volume 2. Ed.: O. Bojar, 303–310, Association for Computational Linguistics (ACL). doi:10.18653/v1/W16-2314
Using Factored Word Representation in Neural Network Language Models
Niehues, J.; Ha, T.-L.; Cho, E.; Waibel, A.
2016. Proceedings of the First Conference on Machine Translation, August 2016, Berlin, Germany. Vol.: 1. Ed.: O. Bojar, 74–82, Association for Computational Linguistics (ACL). doi:10.18653/v1/W16-2208
The QT21/HimL Combined Machine Translation System
Peter, J.-T.; Alkhouli, T.; Ney, H.; Huck, M.; Braune, F.; Fraser, A.; Tamchyna, A.; Bojar, O.; Haddow, B.; Sennrich, R.; Blain, F.; Specia, L.; Niehues, J.; Waibel, A.; Allauzen, A.; Aufrant, L.; Burlot, F.; knyazeva, elena; Lavergne, T.; Yvon, F.; Pinnis, M.; Frank, S.
2016. Proceedings of the First Conference on Machine Translation, August 2016, Berlin, Germany. Vol.: 2. Ed.: O. Bojar, 344–355, Association for Computational Linguistics (ACL). doi:10.18653/v1/W16-2320
Toward Multilingual Neural Machine Translation with Universal Encoder and Decoder
Ha, T.-L.; Niehues, J.; Waibel, A.
2016. Proceedings of the 13th International Conference on Spoken Language Translation. Ed.: M. Cettolo
Unsupervised phoneme segmentation of previously unseen languages
Vetter, M.; Müller, M.; Hamlaoui, F.; Neubig, G.; Nakamura, S.; Stüker, S.; Waibel, A.
2016. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016; Hyatt Regency San FranciscoSan Francisco; United States; 8 September 2016 through 16 September 2016. Ed. : N. Morgan, 3544–3548. doi:10.21437/Interspeech.2016-1440
Dynamic transcription for low-latency speech translation
Niehues, J.; Nguyen, T. S.; Cho, E.; Ha, T.-L.; Kilgour, K.; Müller, M.; Sperber, M.; Stüker, S.; Waibel, A.
2016. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016; Hyatt Regency San Francisco; United States; 8 September 2016 through 16 September 2016, 2513–2517, International Speech Communication Association. doi:10.21437/Interspeech.2016-154
Language adaptive DNNs for improved low resource speech recognition
Müller, M.; Stüker, S.; Waibel, A.
2016. 17th Annual Conference of the International Speech Communication Association, INTERSPEECH 2016; Hyatt Regency San FranciscoSan Francisco; United States; 8 September 2016 through 16 September 2016. Ed. : N. Morgan, 3878–3882, International Speech Communication Association. doi:10.21437/Interspeech.2016-1143
Machine Translation of Spontaneous Speech. PhD dissertation
Cho, E.
2016. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000056203
2015
Online Incremental Machine Translation. PhD dissertation
Rottmann, K.
2015, June. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000052162
A Novel Greeting Selection System for a Culture-Adaptive Humanoid Robot
Trovato, G.; Waibel, A.; Asfour, T.; Zecca, M.; Do, Ö. T. M.; Takanishi, A.; Terlemez, Ö.; Kuramochi, M.
2015. International journal of advanced robotic systems, 12, 13 S. doi:10.5772/60117
EU-BRIDGE Bridges Across the Language Divide : Final Evaluation Report | Version 1.0
Koehn, P.; Dugast, C.; Gauthier, J.; Grimsey, S.; Fünfer, S.; Müller, M.; Stüker, S.; Steinbiss, V.; Zhang, Y.
2015. (A. Waibel, Ed.). doi:10.5445/IR/1000166184
Using Tweets as "Ice-Breaking" Sentences in a Social Dialog System. bachelor’s thesis
Andonov, A.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000183919
GMM free ASR using DNN based Cluster Trees. bachelor’s thesis
Zhu, L.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166808
Klassenbasierte Sprachmodellierung mit neuronalen Netzen. bachelor’s thesis
Zenkel, T.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166807
Development of a Domain-Independent Interactive Question Answering System. bachelor’s thesis
Kaiser, F.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166806
Towards a Persuasive Dialog System Supporting Personal Health Management. bachelor’s thesis
Götzmann, V.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166805
Homogenization of Arabic Corpora for Machine Translation. student research project
Khelifi, M. Y.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166804
Predicting Clarification Questions in a Social Dialog System. bachelor’s thesis
Lian, X.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166802
Automatische Eingabeselektion in der Maschinellen Übersetzung. bachelor’s thesis
Weller, B.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166801
Easily Bootstrappable Statistical Spoken Dialogue System. master’s thesis
Valev, K.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166682
Active Learning For Acoustic Models in Automatic Speech Recognition. master’s thesis
Schulze, M. A.; Waibel, A.; Stüker, S.; Sperber, M.
2015. Karlsruher Institut für Technologie (KIT)
Parallelisierung für statistische maschinelle Übersetzung. diploma thesis
Wening, C.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166680
Sub-word Language Models for German LVCSR. diploma thesis
Zairi, A.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166679
Data Selection For Machine Translation With Paraphrasing. master’s thesis
Koch, M.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166678
Recurrent Neural Networks in Speech Disfluency Detection and Punctuation Prediction. master’s thesis
Reisser, M.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166677
Neural Network-based Multilingual Translation Models. master’s thesis
Nguyen, D. T.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166676
Linguistic Structure in Statistical Machine Translation. PhD dissertation
Herrmann, T.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000054573
Source Discriminative Word Lexicon for Translation Disambiguation
Herrmann, T.; Niehues, J.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 135–142
The KIT Translation Systems for IWSLT 2015
Ha, T.-L.; Niehues, J.; Cho, E.; Mediani, M.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 62–69, Association for Computational Linguistics (ACL)
Using Language Adaptive Deep Neural Networks for Improved Multilingual Speech Recognition
Müller, M.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 167–172, Association for Computational Linguistics (ACL)
The 2015 KIT IWSLT Speech-to-Text Systems for English and German
Müller, M.; Nguyen, T.-S.; Sperber, M.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 70–75
Punctuation Insertion for Real-time Spoken Language Translation
Cho, E.; Niehues, J.; Kilgour, K.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 173–179
Multifeature Modular Deep Neural Network Acoustic Models
Kilgour, K.; Waibel, A.
2015. Proceedings of the 12th International Workshop on Spoken Language Translation (IWSLT), December 3-4 2015, Da Nang, Vietnam. Ed.: M. Federico, 159–166, Association for Computational Linguistics (ACL)
Effectiveness of Histogram Equalization and SyDOCC Features on Speech Recognition Performance on a Real-World Noisy Speech Task
Müller, M.; Wagner, M.; Hussain, J.; Stüker, S.; Waibel, A.
2015. 41. Jahrestagung für Akustik,(DAGA 2015) : DAGA-Conference, annual general meeting of the DEGA (German Acoustical Society), 16. - 19. March 2015, Nürnberg, Germany
Effectiveness of Histogram Equalization and SyDOCC Features on Speech Recognition Performance on a Real-World Noisy Speech Task
Müller, M.; Wagner, M.; Hussain, J.; Stüker, S.; Waibel, A.
2015. Programm : DAGA 2015 : 41. Jahrestagung für Akustik, DAGA-Conference, annual general meeting of the DEGA (German Acoustical Society), 16. - 19. März 2015, Nürnberg, Germany, 131–134, Druckhaus Garlev
Stripping Adjectives: Integration Techniques for Selective Stemming in SMT Systems
Slawik, I.; Niehues, J.; Waibel, A.
2015. Proceedings of the 18th Annual Conference of the European Association for Machine Translation (EAMT 2015), May 1-13 2015, Antalya, Turkey. Ed.: İ. El-Kahlout, 129–136, European Association for Machine Translation (EAMT)
A Semi-Automatic Word-Level Annotation and Transcription Tool for Spelling Error Categories
Müller, M.; Leuschner, D.; Briem, L.; Schmidt, M.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. HCI International 2015 - Posters’ Extended Abstracts : International Conference, HCI International 2015, Los Angeles, CA, USA, August 2-7, 2015. Proceedings, Part I. Ed.: C. Stephanidis, 587–592, Springer International Publishing. doi:10.1007/978-3-319-21380-4_100
Using Neural Networks for Data-Driven Backchannel Prediction: A Survey on Input Features and Training Techniques
Müller, M.; Leuschner, D.; Briem, L.; Schmidt, M.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. Human-Computer Interaction: Interaction Technologies : 17th International Conference, HCI International 2015, Los Angeles, CA, USA, August 2-7, 2015, Proceedings, Part II. Ed.: M. Kurosu, 329–340, Springer. doi:10.1007/978-3-319-20916-6 31
Evaluation of Crowdsourced User Input Data for Spoken Dialog Systems
Schmidt, M.; Müller, M.; Wagner, M.; Stüker, S.; Waibel, A.; Hofmann, H.; Werner, S.
2015. Proceedings of the 16th Annual SIGdial Meeting on Discourse and Dialogue (SIGDIAL 2015), September 2-4 2015, Prague, Czech Republic, 427–431, Association for Computational Linguistics (ACL). doi:10.18653/v1/W15-4657
Gaussian Free Cluster Tree Construction using Deep Neural Network
Zhu, L.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), September 6-10 2015, Dresden, Germany
Gaussian Free Cluster Tree Construction using Deep Neural Network
Zhu, L.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. Abstacts : Speech Beyond Speech Towards A Better Understanding Of The Most Important Biosigna, 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), September 6-10 2015, Dresden, Germany, 214
Combination of NN and CRF Models for Joint Detection of Punctuation and Disfluencies
Cho, E.; Kilgour, K.; Niehues, J.; Waibel, A.
2015. Speech beyond speech towards a better understanding of the most important biosignal : 16th Annual Conference of the International Speech Communication Association (INTERSPEECH 2015) : Dresden, Germany, 6-10 September 2015, 3650–3654, Red Hook
Combination of NN and CRF Models for Joint Detection of Punctuation and Disfluencies
Cho, E.; Kilgour, K.; Niehues, J.; Waibel, A.
2015. Proceedings of the 16th Annual Conference of the International Speech Communication Association (INTERSPEECH), September 6-10 2015, Dresden, Germany
The KIT-LIMSI Translation System for WMT 2015
Ha, T.-L.; Do, Q.-K.; Cho, E.; Niehues, J.; Allauzen, A.; Yvon, F.; Waibel, A.
2015. EMNLP 2015 : Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), September 17-18 2015, Lisboa, Portugal, 120–125, Association for Computational Linguistics (ACL). doi:10.18653/v1/W15-3012
ListNet-based MT Rescoring
Niehues, J.; Do, Q.-K.; Allauzen, A.; Waibel, A.
2015. Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), September 17-18 2015, Lisboa, Portugal, 248–255, Association for Computational Linguistics (ACL). doi:10.18653/v1/W15-3030
The Karlsruhe Institute of Technology Translation Systems for the WMT 2015
Cho, E.; Ha, T.-L.; Niehues, J.; Herrmann, T.; Mediani, M.; Zhang, Y.; Waibel, A.
2015. EMNLP 2015 : Proceedings of the 10th Workshop on Statistical Machine Translation (WMT), September 17-18 2015, Lisboa, Portugal. Ed.: O. Bojar, 92–97, Association for Computational Linguistics (ACL). doi:10.18653/v1/W15-3008
Sprachbarrieren durchbrechen: Traum oder Wirklichkeit?
Waibel, A.
2015. Wahrnehmen und Steuern. Sensorsysteme in Biologie und Technik Band 122: Vorträge anlässlich der Jahresversammlung vom 19. bis 21. September 2014 in Rostock, Deutschland. Hrsg.: J. Hacker, 101–123, Wissenschaftliche Verlagsgesellschaft
On-line Recognition of Handwritten Mathematical Symbols
Thoma, M.; Kilgour, K.; Stüker, S.; Waibel, A.
2015. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000048047
2014
Sesla transcriber : A speech transcription tool that adapts to your skill and time budget
Sperber, M.; Neubig, G.; Nakamura, S.; Waibel, A.
2014. Proceedings of the Spoken Language Technology Workshop (SLT 2014)
On-line Recognition of Handwritten Mathematical Symbols. bachelor’s thesis
Thoma, M.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000183920
Analyse von Methoden zum Graphemclustering bei der automatischen Spracherkennung. bachelor’s thesis
Wallisch, C.; Waibel, A.; Stüker, S.; Müller, M.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166812
An Optimization of Deep Neural Networks in ASR Using Singular Value Decomposition. bachelor’s thesis
Tseyzer, I.; Waibel, A.; Stüker, S.; Kilgour, K.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166811
Pronominal Anaphora in Machine Translation. master’s thesis
Weiner, J.; Waibel, A.; Stüker, S.; Niehues, J.; Herrmann, T.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166685
A System for Recognizing Natural Spelling of English Words. diploma thesis
Czech, L.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166684
Optimization of Neural Network Language Models for Keyword Search
Gandhe, A.; Metze, F.; Waibel, A.; Lane, I.
2014. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), Florence, Italy, 04-09 May 2014, 4888–4892, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2014.6854531
A Neural Network Keyword Search System for Telephone Speech
Kilgour, K.; Waibel, A.
2014. Speech and Computer. 16th International Conference, SPECOM 2014, Novi Sad, Serbia, October 5-9, 2014. Proceedings. Ed.: A. Ronzhin, 58–65, Springer. doi:10.1007/978-3-319-11581-8_7
Multilingual Deep Bottle Neck Features: A Study on Language Selection and Training Techniques
Müller, M.; Stüker, S.; Sheikh, Z.; Metze, F.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Papers. Ed.: M. Federico, S. Stüker, F. Yvon, 257–264, Association for Computational Linguistics (ACL)
Rule-Based Preordering on Multiple Syntactic Levels in Statistical Machine Translation
Wu, G.; Zhang, Y.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation (IWSLT), 279–286
Improving In-Domain Data Selection for Small In-Domain Sets
Mediani, M.; Winebarger, J.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Papers. Ed.: M. Federico, S. Stüker, F. Yvon, 249–256, Association for Computational Linguistics (ACL)
The 2014 KIT IWSLT Speech-to-Text Systems for English, German and Italian
Kilgour, K.; Heck, M.; Müller, M.; Sperber, M.; Stüker, S.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: M. Federico, S. Stüker, F. Yvon, 73–79, Association for Computational Linguistics (ACL)
Combined Spoken Language Translation
Freitag, M.; Wuebker, J.; Peitz, S.; Ney, H.; Huck, M.; Birch, A.; Durrani, N.; Koehn, P.; Mediani, M.; Slawik, I.; Niehues, J.; Cho, E.; Waibel, A.; Bertoldi, N.; Cettolo, M.; Frederico, M.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: M. Federico, S. Stüker, F. Yvon, 57–64, Association for Computational Linguistics (ACL)
On-the-fly user modeling for cost-sensitive correction of speech transcripts
Sperber, M.; Neubig, G.; Nakamura, S.; Waibel, A.
2014. Proceedings of the Spoken Language Technology Workshop (SLT 2014), Lake Tahoe, USA, 07-10 December 2014, 460–465, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/SLT.2014.7078618
Extracting Translation Pairs from Social Network Content
Eck, M.; Zhang, J.; Waibel, A.; Zemlyanskiy, Y.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Papers. Ed.: M. Federico, S. Stüker, F. Yvon, 200–205, Association for Computational Linguistics (ACL)
Lexical Translation Model Using A Deep Neural Network Architecture
Ha, T.-L.; Niehues, J.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation : Papers, December 4-5, 2014, Lake Tahoe, California. Ed.: M. Federico, 223–229, ACL
Machine translation of multi-party meetings: Segmentation and disfluency removal strategies
Cho, E.; Niehues, J.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation: Papers. Ed.: M. Federico, 176–183, ACL
The KIT Translation Systems for IWSLT 2014
Slawik, I.; Mediani, M.; Niehues, J.; Zhang, Y.; Cho, E.; Herrmann, T.; Ha, T.-L.; Waibel, A.
2014. Proceedings of the 11th International Workshop on Spoken Language Translation : Evaluation Campaign. Ed.: M. Federico, 119–126, ACL
Ein System zur automatischen simultanen Übersetzung deutscher Vorlesungen
Stüker, S.; Cho, E.; Fügen, C.; Hermann, T.; Kilgour, K.; Mediani, M.; Mohr, C.; Niehues, J.; Rottmann, K.; Waibel, A.
2014. Translationswissenschaftliches Kolloquium III - Beiträge zur Übersetzungs- und Dolmetschwissenschaft. Hrsg.: B. Ahrens, 267–280, Lang u. Peter-Verl
Segmentation for Efficient Supervised Language Annotation with an Explicit Cost-Utility Tradeoff
Sperber, M.; Simantzik, M.; Neubig, G.; Nakamura, S.; Waibel, A.
2014. Transactions of the Association for Computational Linguistics, 2, 169–180
Tight Integration of Speech Disfluency Removal into SMT
Cho, E.; Niehues, J.; Waibel, A.
2014. EACL 2014 14th Conference of the European Chapter of the Association for Computational Linguistics : Proceedings of the Conference, April 26-30, 2014, Gothenburg, Sweden. Vol.: 2. Ed.: S. Wintner, 43–47, Association for Computational Linguistics (ACL). doi:10.3115/v1/E14-4009
Multilingual shifting deep bottleneck features for low-resource ASR
Nguyen, Q. B.; Gehring, J.; Mueller, M.; Stuker, S.; Waibel, A.
2014. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’14), Florence, Italy, May 4-9, 2014, 5607–5611, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2014.6854676
Training time reduction and performance improvements from multilingual techniques on the BABEL ASR task
Stuker, S.; Muller, M.; Nguyen, Q. B.; Waibel, A.
2014. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’14), Florence, Italy, May 4-9, 2014, 6374–6378, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2014.6854831
A Corpus of Spontaneous Speech in Lectures: The KIT Lecture Corpus for Spoken Language Processing and Translation
Cho, E.; Fünfer, S.; Stüker, S.; Waibel, A.
2014. 9th International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, May 26-31, 2014. Ed.: N. Calzolari, 1554–1559, European Language Resources Association (ELRA)
Manual Analysis of Structurally Informed Reordering in German-English Machine Translation
Herrmann, T.; Niehues, J.; Waibel, A.
2014. Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), Reykjavik, Iceland, May 26-31, 2014. Ed.: N.Calzolari, 4379–4386, European Language Resources Association (ELRA)
Combining Techniques from different NN-based Language Models for Machine Translation
Niehues, J.; Allauzen, A.; Yvon, F.; Waibel, A.
2014. 11th Biennial Conference of the Association for Machine Translation in the Americas (AMTA’14), Vancouver, Canada, October 22-26, 2014. Ed.: Y. Al-Onaizan, 222–233, Association for Machine Translation in the Americas (AMTA)
The KIT-LIMSI Translation System for WMT 2014
Do, Q. K.; Herrmann, T.; Niehues, J.; Allauzen, A.; Yvon, F.; Waibel, A.
2014. ACL 2014 Ninth Workshop on Statistical Machine Translation : Proceedings of the Workshop June 26-27, 2014, Baltimore, Maryland, USA, 84–89, Association for Computational Linguistics (ACL). doi:10.3115/v1/W14-3307
EU-Bridge MT: Combined Machine Translation
Freitag, M.; Peitz, S.; Wuebker, J.; Ney, H.; Huck, M.; Sennrich, R.; Durrani, N.; Nadejde, M.; Williams, P.; Koehn, P.; Herrmann, T.; Cho, E.; Waibel, A.
2014. 52nd Annual Meeting of the Association for Computational Linguistics Linguistics (ACL’14), Baltimore, Maryland/USA, June 22-27, 2014, 105–113, ACL. doi:10.3115/v1/W14-3310
The Karlsruhe Institute of Technology Translation Systems for the WMT 2014
Herrmann, T.; Mediani, M.; Cho, E.; Ha, T.-L.; Niehues, J.; Slawik, I.; Zhang, Y.; Waibel, A.
2014. 52nd Annual Meeting of the Association for Computational Linguistics (ACL’14), Baltimore, Maryland/USA, June 22-27, 2014, 130–135, ACL. doi:10.3115/v1/W14-3313
Adaptation in Machine Translation. PhD dissertation
Niehues, J.
2014. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000042129
2013
Letter N-Gram-based Input Encoding for Continuous Space Language Models. diploma thesis
Sperr, H.
2013. Association for Computational Linguistics (ACL). doi:10.5445/IR/1000183878
Optimizing Deep Bottleneck Feature Extraction
Nguyen, Q. B.; Gehring, J.; Kilgour, K.; Waibel, A.
2013. The 10th IEEE RIVF International Conference on Computing and Communication Technologies, Hanoi, Vietnam, 10-13 November 2013, 152–156, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/RIVF.2013.6719885
CRF-based Disfluency Detection using Semantic Features for German to English Spoken Language Translation
Cho, E.; Ha, T.-L.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
The KIT Translation Systems for IWSLT 2013
Ha, T.-L.; Herrmann, T.; Niehues, J.; Mediani, M.; Cho, E.; Zhang, Y.; Slawik, I.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
Incremental Unsupervised Training for University Lecture Recognition
Heck, M.; Stüker, S.; Sakti, S.; Waibel, A.; Nakamura, S.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
The 2013 KIT Quaero Speech-to-Text System for French
Winebarger, J.; Nguyen, B.; Gehring, J.; Stüker, S.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
Maximum Entropy Language Modeling for Russian ASR
Shin, E.; Stüker, S.; Kilgour, K.; Fügen, C.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Papers. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
The 2013 KIT IWSLT Speech-to-Text Systems for German and English
Kilgour, K.; Mohr, C.; Heck, M.; Nguyen, Q. B.; Nguyen, V. H.; Shin, E.; Tseyzer, I.; Gehring, J.; Müller, M.; Sperber, M.; Stüker, S.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: J. Y. Zhang, Association for Computational Linguistics (ACL)
DNN Acoustic Modeling with Modular Multi-Lingual Feature Extraction Networks
Gehring, J.; Nguyen, Q. B.; Metze, F.; Waibel, A.
2013. Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, 08-12 December 2013, 344–349, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2013.6707754
Models of tone for tonal and non-tonal languages
Metze, F.; Sheihk, Z. A. W.; Waibel, A.; Gehring, J.; Kilgour, K.; Nguyen, Q. B.; Nguyen, V. H.
2013. IEEE Automatic Speech Recognition and Understanding Workshop (ASRU 2013), Olomouc, Czech Republic, 08-12 December 2013, 261–266, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2013.6707740
Analyzing the Potential of Source Sentence Reordering in Statistical Machine Translation
Herrmann, T.; Weiner, J.; Niehues, J.; Waibel, A.
2013. Proceedings of the 10th International Workshop on Spoken Language Translation : Papers, December 5-6, 2013, Heidelberg, Germany. Ed.: J. Y. Zhang, ACL
EU-BRIDGE MT: Text translation of talks in the EU-BRIDGE project
Freitag, M.; Peitz, S.; Wuebker, J.; Ney, H.; Durrani, N.; Huck, M.; Koehn, P.; Ha, T.-L.; Niehues, J.; Mediani, M.; Herrmann, T.; Waibel, A.; Bertoldi, N.; Cettolo, M.; Federico, M.
2013. Proceedings of the Tenth International Workshop on Spoken Language Translation (IWSLT 2013), Heidelberg, 5th-6th December 2013, ACL Anthology
Extracting deep bottleneck features using stacked auto-encoders
Gehring, J.; Miao, Y.; Metze, F.; Waibel, A.
2013. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’13), Vancouver, Canada, May 26-31, 2013, 3377–3381, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2013.6638284
Warped Minimum Variance Distortionless Response based bottle neck features for LVCSR
Kilgour, K.; Tseyzer, I.; Nguyen, Q. B.; Waibel, A.
2013. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’13), Vancouver, Canada, May 26-31, 2013, 6990–6994, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2013.6639017
Learning discriminative basis coefficients for eigenspace MLLR unsupervised adaptation
Miao, Y.; Metze, F.; Waibel, A.
2013. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’13), Vancouver, Canada, May 26-31, 2013, 7927–7931, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2013.6639208
Subspace Mixture Model For Low-Resource Speech Recognition In Cross-Lingual Settings
Miao, Y.; Metze, F.; Waibel, A.
2013. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP’13), Vancouver, Canada, May 26-31, 2013, 7339–7343, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2013.6639088
Measuring the Structural Importance through Rhetorical Structure Index
Kokhlikyan, N.; Waibel, A.; Zhang, Y.; Zhang, J. Y.
2013. Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation (SSST). Ed.: M. Carpuat, L. Specia, D. Wu, 783–788, Curran
Combining Word Reordering Methods on different Linguistic Abstraction Levels for Statistical Machine Translation
Herrmann, T.; Niehues, J.; Waibel, A.
2013. Proceedings of the Seventh Workshop on Syntax, Semantics and Structure in Statistical Translation. Ed.: M. Carpuat, L. Specia, D. Wu, 39–47, Association for Computational Linguistics (ACL)
Modular Combination of Deep Neural Networks for Acoustic Modeling
Gehring, J.; Lee, W.; Kilgour, K.; Lane, I.; Miao, Y.; Waibel, A.
2013. 14th Annual Conference of the International Speech Communication Association (INTERSPEECH’13), Lyon, France, August 25-29, 2013. Ed.: F. Bimbot, 1–5, International Society for Computers and Their Applications (ISCA). doi:10.21437/Interspeech.2013-45
An MT Error-driven Discriminative Word Lexicon using Sentence Structure Features
Niehues, J.; Waibel, A.
2013. 8th Workshop on Statistical Machine Translation (WMT’13), Sofia, Bulgaria, August 8-9, 2013, 512–520, Curran
Letter N-Gram-based Input Encoding for Continuous Space Language Models
Sperr, H.; Niehues, J.; Waibel, A.
2013. ACL 2013 : 51st Annual Meeting of the Association for Computational Linguistics : Proceedings of the Workshop on Continuous Vector Space Models and their Compositionality, August 9, 2013, Sofia, Bulgaria, 30–39, Association for Computational Linguistics (ACL)
The Karlsruhe Institute of Technology Translation Systems for the WMT 2013
Cho, E.; Ha, T.-L.; Mediani, M.; Niehues, J.; Herrmann, T.; Slawik, I.; Waibel, A.
2013. 8th Workshop on Statistical Machine Translation (WMT’13), Sofia, Bulgaria, August 8-9, 2013, 104–108, Curran
Joint WMT 2013 Submission of the QUAERO Project
Cho, E.; Herrmann, T.; Mediani, M.; Niehues, J.; Waibel, A.; Allauzen, A.; Khanh Do, Q.; Buschbeck, B.; Wandmacher, T.
2013. 8th Workshop on Statistical Machine Translation (WMT’13), Sofia, Bulgaria, August 8-9, 2013, 185–192, Curran
Slightly Supervised Adaptation of Acoustic Models on Captioned BBC Weather Forecasts
Mohr, C.; Saam, C.; Kilgour, K.; Gehring, J.; Stüker, S.; Waibel, A.
2013. Proceedings of the First Workshop on Speech, Language and Audio in Multimedia Marseille, France, August 22-23, 2013. Ed.:G. Gravier, 32–36, RWTH Aachen
A Real-World System for Simultaneous Translation of German Lectures
Eunah, C.; Fügen, C.; Hermann, T.; Kilgour, K.; Mediani, M.; Mohr, C.; Niehues, J.; Rottmann, K.; Saam, C.; Stüker, S.; Waibel, A.
2013. 14th Annual Conference of the International Speech Communication Association (INTERSPEECH’13), Lyon, France, August 25-29, 2013. Ed.: F. Bimbot, 3473–3477, International Society for Computers and Their Applications (ISCA). doi:10.21437/Interspeech.2013-612
Segmentation of Telephone Speech Based on Speech and Non-speech Models
Heck, M.; Mohr, C.; Stüker, S.; Müller, M.; Kilgour, K.; Gehring, J.; Nguyen, Q. B.; Nguyen, V. H.; Waibel, A.
2013. Speech and Computer - 15th International Conference, SPECOM 2013, Pilsen, Czech Republic, September 1-5, 2013 - Proceedings. Ed.: M. Zelezny, 286–293, Springer-Verlag. doi:10.1007/978-3-319-01931-4_38
2012
Was braucht man für einen humanoiden Roboter?
Pham Huu, T. M.; Dillmann, R.; Bretthauer, G.; Waibel, A.
2012. doi:10.5445/DIVA/2012-362
Blind Dereverberation of Sinusoid Signals using PLL-based combined Phase and Amplitude Analysis
Huber, R.; Kraft, F.; Waibel, A.
2012. 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4549–4552, IEEE Computer Society. doi:10.1109/icassp.2012.6288930
Efficient Speech Transcription Through Respeaking. master’s thesis
Sperber, M.
2012. International Society for Computers and Their Applications (ISCA). doi:10.5445/IR/1000183879
Getting Bilingual Information from the Web. bachelor’s thesis
Koch, M.
2012. Karlsruher Institut für Technologie (KIT)
Japanese-English Machine Translation for a Humanoid Robot Moderator. student research project
Sperr, H. D.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166818
Approaches to Compound Splitting in German Spoken Term Detection. bachelor’s thesis
Wu, G.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166817
Ambiguity Resolution with User Interaction. student research project
Hundemer, M.; Waibel, A.; Rottmann, K.
2012. Karlsruher Institut für Technologie (KIT)
A Study of Distance Measures for Clustering Generalized Polyphones. student research project
Czech, L.; Waibel, A.; Stüker, S.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166814
Unsupervised Vocabulary Selection for Speech Recognition of Lectures. diploma thesis
Märgner, P.
2012. Karlsruher Institut für Technologie (KIT)
High-Accuracy Frequency, Phase and Amplitude Estimation for Robust Speech Recognition. diploma thesis
Huber, R.; Waibel, A.; Stern, R.; Kraft, F.
2012. Karlsruher Institut für Technologie (KIT)
Training Deep Neural Networks for Bottleneck Feature Extraction. master’s thesis
Gehring, J.; Waibel, A.; Metze, F.; Stüker, S.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166691
Unsupervised Acoustic Model Training for Simultaneous Lecture Translation in Incremental and Batch Mode. diploma thesis
Heck, M.; Stüker, S.; Sakti, S.; Waibel, A.; Nakamura, S.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166689
Unsupervised Vocabulary Selection for Real-Time Speech Recognition of Lectures
Märgner, P.; Waibel, A.; Lane, I.
2012. Proceedings of the 2012 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), Kyoto, Japan, 25-30 March 2012, 4417–4420, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2012.6288899
Automatische Zeichensetzung in Spracherkennungssystemen: Entscheidungsbaum und Sprachmodell im Vergleich. bachelor’s thesis
Adel, H.; Kilgour, K.; Stüker, S.; Waibel, A.
2012. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166336
Evaluation of Interactive User Corrections for Lecture Transcription
Kolkhorst, H.; Kilgour, K.; Stüker, S.; Waibel, A.
2012. Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012), 217–221, Association for Computational Linguistics (ACL)
The KIT-NAIST (Contrastive) English ASR System for IWSLT 2012
Heck, M.; Kubo, K.; Sperber, M.; Sakti, S.; Stüker, S.; Saam, C.; Kilgour, K.; Mohr, C.; Neubig, G.; Toda, T.; Nakamura, S.; Waibel, A.
2012. Proceedings of the 9th International Workshop on Spoken Language Translation: Evaluation Campaign, 91–95, Association for Computational Linguistics (ACL)
The 2012 KIT and KIT-NAIST English ASR Systems for the IWSLT Evaluation
Saam, C.; Mohr, C.; Kilgour, K.; Heck, M.; Sperber, M.; Kubo, K.; Stüker, S.; Sakti, S.; Neubig, G.; Toda, T.; Nakamura, S.; Waibel, A.
2012. Proceedings of the International Workshop for Spoken Language Translation (IWSLT 2012), 87–90, Association for Computational Linguistics (ACL)
The Karlsruhe Institute of Technology Translation Systems for the WMT 2012
Niehues, J.; Zhang, Y.; Mediani, M.; Herrmann, T.; Cho, E.; Waibel, A.
2012. 7th Workshop on Statistical Machine Translation : Proceedings of the Workshop, June 7-8, 2012, Montréal, Canada. Ed.: C. Callison-Burch, 349–355, Association for Computational Linguistics (ACL)
Continuous Space Language Models using Restricted Boltzmann Machines
Niehues, J.; Waibel, A.
2012. Proceedings of the Ninth International Workshop on Spoken Language Translation (IWSLT 2012), 164–170, Association for Computational Linguistics (ACL)
Detailed Analysis of different Strategies for Phrase Table Adaptation in SMT
Niehues, J.; Waibel, A.
2012. Proceedings of the Tenth Conference of the Association for Machine Translation in the Americas (AMTA 2012), San Diego, 28th October -1st November 2012 AMTA 2012 - Proceedings of the 10th Conference of the Association for Machine Translation in the Americas2012 10th Conference of the Association for Machine Translation in the Americas, AMTA 2012San Diego28 October 2012 through 1 November 2012Code 123820, Association for Machine Translation in the Americas (AMTA)
Joint WMT 2012 Submission of the QUAERO Project
Freitag, M.; Peitz, S.; Huck, M.; Ney, H.; Niehues, J.; Herrmann, T.; Waibel, A.; Le, H.- son; Lavergne, T.; Allauzen, A.; Buschbeck, B.; Crego, J. M.; Senellart, J.
2012. 7th Workshop on Statistical Machine Translation : Poroceedings of the Workshop, June 7-8, 2012, Montréal, Canada. Ed.: C. Callison-Burch, 322–329, Association for Computational Linguistics (ACL)
Segmentation and punctuation prediction in speech language translation using a monolingual translation system
Cho, E.; Niehues, J.; Waibel, A.
2012. Proceedings of the 9th International Workshop on Spoken Language Translation: Papers, 252–259, Association for Computational Linguistics (ACL)
Parallel Phrase Scoring for Extra-large Corpora
Mediani, M.; Niehues, J.; Waibel, A.
2012. The Prague bulletin of mathematical linguistics, 98 (1), 87–98. doi:10.2478/v10108-012-0011-z
The KIT Lecture Corpus for Speech Translation
Stüker, S.; Kraft, F.; Mohr, C.; Herrmann, T.; Cho, E.; Waibel, A.
2012. Proceedings of the Eight International Conference on Language Resources and Evaluation (LREC’12) Ed.: N. Calzolari, 3409–3414, European Language Resources Association (ELRA)
A hybrid phonotactic language identification system with an SVM back-end for simultaneous lecture translation
Heck, M.; Stüker, S.; Waibel, A.
2012. Proceedings of the 2012 IEEE International Conference on Acoustics Speech and Signal Processing (ICASSP), March 25-30, 2012, 4857–4860, IEEE Computer Society. doi:10.1109/ICASSP.2012.6289007
2011
The Karlsruhe Institute of Technology translation systems for the WMT 2011
Herrmann, T.; Mediani, M.; Niehues, J.; Waibel, A.
2011. WMT ’11 : Proceedings of the Sixth Workshop on Statistical Machine Translation : Edinburgh Scotland July 30 - 31, 2011. Ed.: C. Callison-Burch, 379–385, Association for Computational Linguistics (ACL)
Unsupervised Vocabulary Selection for Domain-Independent Simultaneous Lecture Translation
Märgner, P.; Kilgour, K.; Lane, I.; Waibel, A.
2011. International Workshop on Spoken Language Translation (IWSLT 2011), San Francisco, 8th - 9th December 2011, 19–23, Association for Computational Linguistics (ACL)
Themenspezifische Vor-Adaptierung von Sprachmodellen. student research project
Xu, Y.
2011. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166827
Combined Phase and Amplitude Analysis in Harmonic Acoustic Signals for Robust Speech Recognition. student research project
Huber, R.
2011. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166826
Distributed N-Gram Language Models: Application of Large Models to Automatic Speech Recognition. student research project
Mandery, C.
2011. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166825
Automatic Language Identification for Natural Speech Processing Systems. student research project
Heck, M.
2011. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166824
Strategies for Out-Of-Vocabulary Words in Spoken Term Detection. bachelor’s thesis
Kolkhorst, H.
2011. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000166823
Predicting, Detecting and Explaining the Occurrence of Vocal Activity in Multi-Party Conversation. PhD dissertation
Laskowski, K.
2011. Carnegie Mellon University
Unsupervised vocabulary selection for simultaneous lecture translation
Märgner, P.; Kilgour, K.; Lane, I.; Waibel, A.
2011. Proceedings of the 8th International Workshop on Spoken Language Translation: Papers. Ed.: M. Federico, M. Hwang, M. Rödder, S. Stüker, 214–221, Association for Computational Linguistics (ACL)
TriS: A Statistical Sentence Simplifier with Log-linear Models and Margin-based Discriminative Training
Bach, N.; Gao, Q.; Vogel, S.; Waibel, A.
2011. Proceedings of 5th International Joint Conference on Natural Language Processing. Ed.: H. Wang, D. Yarowsky, 474–482, Asian Federation of Natural Language Processing
Using Wikipedia to translate domain-specific terms in SMT
Niehues, J.; Waibel, A.
2011. Proceedings of the 8th International Workshop on Spoken Language Translation : Papers, December 8-9 2011, San Francisco, California. Ed.: M. Federico, 230–237
The KIT English-French translation systems for IWSLT 2011
Mediani, M.; Cho, E.; Niehues, J.; Herrmann, T.; Waibel, A.
2011. Proceedings of the 8th International Workshop on Spoken Language Translation: Evaluation Campaign. Ed.: M. Federico, 73–78, Association for Computational Linguistics (ACL)
Joint WMT Submission of the QUAERO Project
Freitag, M.; Leusch, G.; Wuebker, J.; Peitz, S.; Ney, H.; Herrmann, T.; Niehues, J.; Waibel, A.; Allauzen, A.; Adda, G.; Crego, J. M.; Buschbeck, B.; Wandmacher, T.; Senellart, J.
2011. WMT 2011 Sixth Workshop on Statistical Machine Translation : : Proceedings of the Workshop. Ed.: C. Callison-Burch, 358–364, Association for Computational Linguistics (ACL)
Advances on Spoken Language Translation in the Quaero Program
Boudahmane, K.; Buschbeck, B.; Cho, E.; Crego, J.; Freitag, M.; Lavergne, T.; Ney, H.; Niehues, J.; Peitz, S.; Senellart, J.; Sokolov, A.; Waibel, A.; Wandmacher, T.; Wuebker, J.; Yvon, F.
2011. International Workshop on Spoken Language Translation 2011 San Francisco, CA, USA December 8-9, 2011, 114–120, International Speech Communication Association
Wider context by using bilingual language models in machine translation
Niehues, J.; Herrmann, T.; Vogel, S.; Waibel, A.
2011. Sixth Workshop on Statistical Machine Translation (WMT 2011), Edinburgh, UK, 2011, 198–206, Association for Computational Linguistics (ACL)
Multi Domain Language Model Adaptation using Explicit Semantic Analysis
Kilgour, K.; Kraft, F.; Stüker, S.; Waibel, A.
2011. Proceedings of the 14th International Conference "Speech and Computer" (SPECOM 2011), Kasan, Russia, September 27-30, 2011. Ed.: R. K. Potapova, 76–104, Moscow State Linguistic University
The 2011 KIT QUAERO Speech-to-Text System for the Russian Language
Titov, Y.; Kilgour, K.; Stüker, S.; Waibel, A.
2011. Proceedings of the 14th International Conference "Speech and Computer" (SPECOM 2011), Kasan, Russia, September 27-30, 2011. Ed.: R. K. Potapova, 136–143, Moscow State Linguistic University
Speech Recognition for Machine Translation in Quaero
Lamel, L.; Courcinous, S.; Despres, J.; Gauvain, J.-L.; Josse, Y.; Kilgour, K.; Kraft, F.; Ney, H.; Nußbaum-Thom, M.; Oparin, I.; Schlippe, T.; Schlüter, R.; Schultz, T.; Da Silva, T. F.; Stüker, S.; Sundermeyer, M.; Vieru, B.; Vu, N. T.; Waibel, A.; Woehrling, C.
2011. International Workshop on Spoken Language Translation (IWSLT 2011), San Francisco, California, USA, December 8-9 2011, 121–128, ISCA
The 2011 KIT English ASR System for the IWSLT Evaluation
Stüker, S.; Kilgour, K.; Saam, C.; Waibel, A.
2011. International Workshop on Spoken Language Translation (IWSLT 2011), San Francisco, California, USA, December 8-9 2011, 94–97, ISCA
The 2011 KIT QUAERO Speech-to-Text System for Spanish
Kilgour, K.; Saam, C.; Mohr, C.; Stüker, S.; Waibel, A.
2011. International Workshop on Spoken Language Translation (IWSLT 2011), San Francisco, California, USA, December 8-9 2011, 199–205, ISCA
Towards Social Integration of Humanoid Robots by Conversational Concept Learning
Kraft, F.; Kilgour, K.; Saam, R.; Stuker, S.; Wolfel, M.; Asfour, T.; Waibel, A.
2011. 10th IEEE/RAS International Conference on Humanoid Robots (Humanoids 2010), Nashville, TN, USA, December 6-8, 2010, 352–357, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICHR.2010.5686832
Efficient language model lookahead through polymorphic linguistic context assignment
Soltau, H.; Metze, F.; Fuegen, C.; Waibel, A.
2011. 2002 IEEE International Conference of Acoustics, Speech and Signal Processing, ICASSP, Orlando, FL 2002. [CD-ROM], Orlando, 13th-17th May 2002, In: Proceedings of the International Conference of Acoustics, Speech and Signal Processing, ICASSP, Orlando, FL 2002. [CD-ROM]., IEEEXplore. doi:10.1109/ICASSP.2002.5743816
2010
Spoken news queries over the world wide web
Stüker, S.; Heck, M.; Renner, K.; Waibel, A.
2010. SSCS ’10: Proceedings of the 2010 international workshop on Searching spontaneous conversational speech, 61–64, Association for Computing Machinery (ACM). doi:10.1145/1878101.1878115
Verwendung von Körpersprache des Kopfes zur Verbesserung der User-Robot Kommunikation. student research project
Schneyer, B. N.
2010. Karlsruher Institut für Technologie (KIT)
Spoken Language Translation from parallel Speech Audio: Simultaneous Interpretation as SLT Training Data
Paulik, M.; Waibel, A.
2010. IEEE International Conference on Acoustic Speech and Signal Processing (ICASSP 2010), Dallas, USA, 14-19 March 2010, 5210–5213, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2010.5494998
Tools for Collecting Speech Corpora via Mechanical-Turk
Lane, I.; Waibel, A.; Eck, M.; Rottmann, K.
2010. Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon’s Mechanical Turk. Ed.: C. Callison-Burch, M. Dredze, 184–187, Association for Computational Linguistics (ACL)
Named-Entity Projection and Data-Driven Morphological Decomposition for Field Maintainable Speech-to-Speech Translation Systems
Lane, I. R.; Waibel, A.
2010. 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Japan, 26-30 September 2010, 2882–2885, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2010-687
Rapid Development of Speech Translation Using Consecutive Interpretation
Paulik, M.; Waibel, A.
2010. 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, Japan, 26-30 September 2010, 2534–2537, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2010-680
Jibbigo: Speech-to-Speech Translation on Mobile Devices
Eck, M.; Lane, I.; Zhang, Y.; Waibel, A.
2010. Proceedings of the Spoken Language Technology Workshop (SLT), Berkeley, USA, 12-15 December 2010, 165–166, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/SLT.2010.5700843
The KIT Translation system for IWSLT 2010
Niehues, J.; Mediani, M.; Herrmann, T.; Heck, M.; Herff, C.; Waibel, A.
2010. Proceedings of the Seventh International Workshop on Spoken Language Translation (IWSLT 2010), 93–98, ISCA
Domain Adaptation in Statistical Machine Translation using Factored Translation Models
Niehues, J.; Waibel, A.
2010. Proceedings of the 14th Annual Conference of the European Association for Machine Translation (EAMT’10), Saint-Raphaël, France, May 27-28 2010. Ed.: F. Yvon, 7 S., European Association for Machine Translation (EAMT)
The Karlsruhe Institute for Technology Translation System for the ACL-WMT 2010
Niehues, J.; Herrmann, T.; Mediani, M.; Waibel, A.
2010. Proceedings of the Joint Fifth Workshop on Statistical Machine Translation and MetricsMATR (WMT’10), Uppsala, Sweden, July 15-16, 2010, 138–142, Curran
Learning Speech Translation from Interpretation. PhD dissertation
Paulik, M.
2010. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000018610
2009
Computers in the Human Interaction Loop
Stiefelhagen, R.; Carlson, R.; Casas, J.; Kleindienst, J.; Lamel, L.; Lanz, O.; Mostefa, D.; Omologo, M.; Pianesi, F.; Polymenakos, L.; Potamianos, G.; Soldatos, J.; Sutchet, G.; Terken, J.; Waibel, A.
2009. Computers in the Human Interaction Loop, 1071–1116, Springer-Verlag
Consolidation-Based Speech Translation and Evaluation Approach
Hori, C.; Zhao, B.; Vogel, S.; Waibel, A.; Kashioka, H.; Nakamura, S.
2009. IEICE transactions / E, E92-D (3), 477–488. doi:10.1587/transinf.E92.D.477
Automatic Distortion Correction for a Full Windshield Head-Up Display System. student research project
Blaicher, F.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000166830
Using POMDP for Person Identification Dialogs. student research project
Märgner, P.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000166829
Language Model Adaptation using Interlinked Semantic Data. diploma thesis
Kilgour, K.
2009. Universität Karlsruhe (TH)
Rapid Unsupervised Topic Adaptation – a Latent Semantic Approach. PhD dissertation
Tam, Y.-C.
2009. Carnegie Mellon University
Incremental Adaptation of Speech-to-Speech Translation
Bach, N.; Hsiao, R.; Eck, M.; Charoenpornsawat, P.; Vogel, S.; Schultz, T.; Lane, I. R.; Waibel, A.; Black, A. W.
2009. Proceedings of NAACL HLT 2009: Short Papers, 149–152, Association for Computational Linguistics (ACL)
Automatic Translation from Parallel Speech: Simultaneous Interpretation as MT Training Data
Paulik, M.; Waibel, A.
2009. IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009, Moreno, Italy, 13 November - 17 December 2009, 496–501, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2009.5372880
Pronunciation Modeling for Dialectal Arabic Speech Recognition
Al-Haj, H.; Hsiao, R.; Lane, I.; Black, A. W.; Waibel, A.
2009. IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2009, Moreno, Italy, 13 November - 17 December 2009, 525–528, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2009.5373245
Beyond CHIL
Waibel, A.
2009. Computers in the Human Interaction Loop. Ed.: A. Waibel, 367–371, Springer. doi:10.1007/978-1-84882-054-8_30
The Universität Karlsruhe Translation System for the EACL-WMT 2009
Niehues, J.; Herrmann, T.; Kolss, M.; Waibel, A.
2009. Proceedings of the Fourth Workshop on Statistical Machine Translation Athens, Greece, 30 March – 31 March 2009, 80–84, Association for Computational Linguistics (ACL)
Porting Speech Recognition Systems to New Languages Supported by Articulatory Feature Models
Stüker, S.; Waibel, A.
2009. Proceedings / SPECOM ’2009: 13th International Conference Speech and Computer, St. Petersburg, Russia, 21 - 25 June 2009, 1 CD-Rom, Institution of the Russian Academy of Sciences
Human Translations Guided Language Discovery for ASR Systems
Stüker, S.; Besacier, L.; Waibel, A.
2009. Proceedings of the 10th Annual Conference of the International Speech Communication Association (InterSpeech’09), Brighton, UK, September 6-10, 2009. Ed.: M. Uther, 1 CD-ROM, ISCA. doi:10.21437/Interspeech.2009-765
Acquiring and Maintaining Knowledge by Natural Multimodal Dialog. PhD dissertation
Holzapfel, H.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000019778
Multimodal Probabilistic Person Tracking and Identification in Smart Spaces. PhD dissertation
Bernardin, K.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000018373
A Robust Face Recognition Algorithm for Real-World Applications. PhD dissertation
Ekenel, H. K.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000014999
Acoustic Modelling for Under-Resourced Languages. PhD dissertation
Stüker, S.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000014983
A System for Simultaneous Translation of Lectures and Speeches. PhD dissertation
Fügen, C.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000013594
Robust automatic transcription of lectures. PhD dissertation
Wölfel, M.
2009. Universitätsverlag Karlsruhe. doi:10.5445/KSP/1000011992
Robust Automatic Transcription of Lectures. PhD dissertation
Wölfel, M.
2009. Universität Karlsruhe (TH). doi:10.5445/IR/1000011937
2008
A Dialogue Approach to Learning Object Descriptions and Semantic Categories
Holzapfel, H.; Neubig, D.; Waibel, A.
2008. Robotics and autonomous systems, 56 (11), 1004–1013. doi:10.1016/j.robot.2008.08.012
Probabilistic Integration of Sparse Audio-Visual Cues for Identity Tracking
Bernardin, K.; Stiefelhagen, R.; Waibel, A.
2008. MM’08 : proceedings of the 2008 ACM International Conference on Multimedia, with co-located symposium & workshops; Vancouver, BC, Canada, October 27 - 31, 2008, Association for Computing Machinery (ACM). doi:10.1145/1459359.1459380
Spoken Language Translation : Enabling cross-lingual human–human communication
Waibel, A.; Fügen, C.
2008. IEEE Signal Processing Magazine, 25 (3), 70–79. doi:10.1109/MSP.2008.918415
PAT-Trees zur Automatischen Informationsextraktion aus semi-strukturierten Webdaten. student research project
Händel, R.
2008. Universität Karlsruhe (TH)
Open-set Face Recognition. student research project
Szasz-Toth, L.
2008. Universität Karlsruhe (TH)
Personenidentifikation mit Bayesnetzen in Mensch-Roboter-Dialogen. bachelor’s thesis
Hüthwohl, P. K.
2008. Universität Karlsruhe (TH)
Konfidenzbasierte multimodale Fusion von Audio und Video zur Personenidentifikation. student research project
Große, P.
2008. Universität Karlsruhe (TH)
Automatic Detection of Human Faces and Computer Monitors for Privacy Protection. bachelor’s thesis
Greiner, S. A.
2008. Universität Karlsruhe (TH)
Lernen von Vor- und Nachnamen im natürlichsprachigen Mensch-Roboter-Dialog. student research project
Ultes, S.
2008. Universität Karlsruhe (TH)
Gesichtsbasierte Geschlechtserkennung auf Bildsequenzen. student research project
Siebler, C.; Waibel, A.; Stiefelhagen, R.; Bernardin, K.
2008. Universität Karlsruhe (TH)
Vision Based Recognition of Vehicle Types. student research project
Morlock, D.
2008. Carnegie Mellon University
Oberkörperdetektion mittels Histogram of Oriented Gradients und Haar-Feature Kaskaden. student research project
Vu, N. T.
2008. Universität Karlsruhe (TH)
Lernen von Objekten und deren Bedeutung im Dialog. diploma thesis
Neubig, D.
2008. Universität Karlsruhe (TH)
Social User Model Acquisition through Network Analysis and Interactive Learning. diploma thesis
Putze, F.
2008. Universität Karlsruhe (TH)
Statistical Methods for Automatic Diacritization of Arabic Text. diploma thesis
Schlippe, T.
2008. Universität Karlsruhe (TH)
Human Body Tracking with Rank Priors for Continuous Non-Linear Dimensionality Reduction. diploma thesis
Geiger, A.
2008. Universität Karlsruhe (TH)
Hand Tracking for Human-Robot Interaction with Explicit Occlusion Handling. diploma thesis
Schick, A.
2008. Universität Karlsruhe (TH)
Schätzung der Objektkörperorientierung in einem intelligenten Raum. diploma thesis
Rybok, L.
2008. Universität Karlsruhe (TH)
Automatic Speech Recognition on Vibrocervigraphic and Electromyographic Signals. PhD dissertation
Jou, S.-C. S.
2008. Carnegie Mellon University
Learning and Verification of Names with Multimodal User ID in Dialog
Holzapfel, H.; Waibel, A.
2008. Proceedings of the 2008 International Conference on Cognitive Systems, University of Karlsruhe, Karlsruhe, Germany, April 2-4, 2008, 186–191
Extracting Clues from Human Interpreter Speech for Spoken Language Translation
Waibel, A.; Paulik, M.
2008. IEEE International Conference on Acoustics, Speech and Signal Processing, Las Vegas, NV, USA, 31 March 2008 - 04 April 2008, 5097–5051, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2008.4518805
Communicating Unknown Words in Machine Translation
Waibel, A.; Vogel, S.; Eck, M.
2008. Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC’08). Ed.: N. Calzolari, K. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis, D. Tapias, 1542–1547, Association for Computational Linguistics (ACL)
Lightly Supervised Acoustic Model Training on EPPS Recordings
Paulik, M.; Waibel, A.
2008. 9th Annual Conference of the International Speech Communication Association 2008, Brisbane, Australia 22-26 September 2008. Vol.: 1, 224–227, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2008-69
Class-Based Statistical Machine Translation for Field Maintainable Speech-to-Speech Translation
Waibel, A.; Lane, I. R.
2008. 9th Annual Conference of the International Speech Communication Association 2008, Brisbane, Australia 22-26 September 2008, Volume 5, 2362–2365, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2008-602
Stream Decoding for Simultaneous Spoken Language Translation
Kolss, M.; Vogel, S.; Waibel, A.
2008. 9th Annual Conference of the International Speech Communication Association 2008, Brisbane, Australia 22-26 September 2008, Vol.: 5, 2735–2738, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2008-678
Modelling Multimodal User ID in Dialogue
Holzapfel, H.; Waibel, A.
2008. IEEE Workshop on Spoken Language Technology, SLT 2008, Goa, India, 15-19 December 2008, 113–116, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/SLT.2008.4777853
Speech Processing in Support of Human-Human Communication
Waibel, A.
2008. A Second International Symposium on Universal Communication, ISUC 2008, Osaka, Japan, 15-16 December 2008, 11, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ISUC.2008.78
Confidence Based Multimodal Fusion for Person Identification
Große, P. W. L.; Holzapfel, H.; Waibel, A.
2008. MM ’08: Proceedings of the 16th ACM international conference on Multimedia, 885–888, Association for Computing Machinery (ACM). doi:10.1145/1459359.1459513
Simultaneous Translation of Lectures and Speeches
Fügen, C.; Waibel, A.; Kolss, M.
2008. Machine translation, 21 (4), 209–252. doi:10.1007/s10590-008-9047-0
Simultaneous German-English lecture translation
Kolss, M.; Wölfel, M.; Kraft, F.; Niehues, J.; Paulik, M.; Waibel, A.
2008. Proceedings of the Fifth International Workshop on Spoken Language Translation (IWSLT 2008), 174–181, Association for Computational Linguistics (ACL)
Calibration of a hybrid camera network
Chen, X.; Yang, J.; Waibel, A.
2008. Proceedings: Ninth IEEE International Conference on Computer Vision, Nice, France, October 13-16, 2003., 150–155, vol.1, IEEE Computer Society
Towards human translations guided language discovery for ASR
Stüker, S.; Waibel, A.
2008. Proceedings of the first International Workshop on Spoken Languages Technologies for Under-Resourced Languages, SLTU 2008, Hanoi, Vietnam, May 5, 2008, International Speech Communication Association (ISCA)
2007
Enabling Multimodal Human-Robot Interaction for the Karlsruhe Humanoid Robot
Stiefelhagen, R.; Ekenel, H.; Fügen, C.; Gieselmann, P.; Holzapfel, H.; Kraft, F.; Nickel, K.; Voit, M.; Waibel, A.
2007. IEEE Transactions on Robotics, Special Issue on Human-Robot Interaction, 23 (5), 840–851. doi:10.1109/TRO.2007.907484
Far-Field Speaker Recognition
Jin, Q.; Schultz, T.; Waibel, A.
2007. IEEE Transactions on Audio Speech and Language Processing, 15 (7), 2023–2032
Translating language with technology’s help
Paulik, M.; Stüker, S.; Fügen, C.; Schultz, T.; Waibel, A.
2007. IEEE Potentials - the magazine for high-tech inovators, 26 (3), 30–35. doi:10.1109/MP.2007.361642
Automatische Kalibrierung von Kameranetzwerken basierend auf lokaler Bewegung. diploma thesis
Aslan, C. T.; Waibel, A.; Stiefelhagen, R.; Bernardin, K.
2007, May 31. Karlsruher Institut für Technologie (KIT)
Integrating Paraphrasing Features into Statistical Machine Translation. student research project
Bracht, M.; Waibel, A.; Vogel, S.
2007. Universität Karlsruhe (TH)
Applikation von Lerndialogen auf Objektbezeichner mit multimodaler Datenverarbeitung. student research project
Neubig, D.
2007. Universität Karlsruhe (TH)
Facial Feature Localization based on Multi-Stream Gaussian Mixture Model. student research project
Blum, T.
2007. Universität Karlsruhe (TH)
Communication Shields: Design and Prototype Implementation of a Context-Aware Communication Service. student research project
Pathmaperuma, D.
2007. Universität Karlsruhe (TH)
Sprachmodelladaption mit Hilfe des World Wide Web. student research project
Haidan, S.
2007. Universität Karlsruhe (TH). doi:10.5445/IR/1000166842
Speech Feature Enhancement using Particle Filters with Class-Based Phoneme Models. student research project
Heger, D.
2007. Universität Karlsruhe (TH). doi:10.5445/IR/1000166841
Deploying Semantic Resources for Open Domain Question Answering. diploma thesis
Schläfer, N.
2007. Universität Karlsruhe (TH)
Namenserkennung bekannter und unbekannter Namen. diploma thesis
Ziesemer, S.
2007. Universität Karlsruhe (TH)
Reordering Strategies for Statistical Machine Translation. diploma thesis
Rottmann, K.
2007. Universität Karlsruhe (TH)
Discriminative Word Alignment Models. diploma thesis
Niehues, J.
2007. Universität Karlsruhe (TH)
Robust Speaker Recognition. PhD dissertation
Jin, Q.
2007. Carnegie Mellon University
Fehlerbehandlung in Mensch-Maschine-Dialogen. PhD dissertation
Gieselmann, P.
2007. Universität Stuttgart
Statistical Alignment Models for Translational Equivalence. PhD dissertation
Zhao, B.
2007. Carnegie Mellon University
Integrating Face-ID into an Interactive Person-ID Learning System
Könn, S.; Holzapfel, H.; Ekenel, H. K.; Waibel, A.
2007. Vision systems in the real world: adaptation, learning, evaluation : ICVS 2007, 5th International Conference on Computer Vision Systems, 21.03. - 24.03.2007, Bielefeld, Germany ; conference proceedings, Applied Computer Science Group, Bielefeld University. doi:10.2390/biecoll-icvs2007-119
Multi-stream articulatory feature classifiers for surface electromyographic continuous speech recognition with a multi-stream decoding architecture
Jou, S.-C.; Waibel, A.; Schultz, T.
2007. Proceedings of the 32nd IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2007, April 15-20, 2007, Honolulu, Hawaii, USA, IV-401 - IV-404, Institute of Electrical and Electronics Engineers (IEEE)
Behavior Models for Learning and Receptionist Dialogs
Holzapfel, H.; Waibel, A.
2007. Proceedings of the 8th Annual Conference of the International Speech Communication Association (Interspeech 2007), Antwerp, Belgium, 27-31 August 2007, 2189–2192, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2007-596
Consolidation Based Speech Translation
Hori, C.; Zhao, B.; Vogel, S.; Waibel, A.
2007. IEEE Workshop on Automatic Speech Recognition & Understanding, ASRU 2007, Kyoto, Japan, 09-13 December 2007, 380–385, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2007.4430142
Computer-Supported Human-Human Multilingual Communication
Waibel, A.; Bernardin, K.; Wölfel, M.
2007. 50 Years of Artificial Intelligence. Essays Dedicated to the 50th Anniversary of Artificial Intelligence. Ed.: M. Lungarella, F. Iida, J. Bongard, R. Pfeifer, 271–287, Springer. doi:10.1007/978-3-540-77296-5_25
Speech translation enhanced ASR for European parliament speeches - on the influence of ASR performance on speech translation
Stüker, S.; Paulik, M.; Kolss, M.; Fügen, C.; Waibel, A.
2007. Proceedings / ICASSP 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, April 15 - 20, 2007, Honolulu, Hawaii; 1 CD-Rom, IV/1293–1296, Institute of Electrical and Electronics Engineers (IEEE)
Translation model pruning via usage statistics for statistical machine translation
Eck, M.; Vogel, S.; Waibel, A.
2007. Human language technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics, April 22-27, 2007, Rochester, New York, USA. Hrsg.: C. Sidner, 21–24, ACL
Estimating phrase pair relevance for translation model pruning
Eck, M.; Vogel, S.; Waibel, A.
2007. 11th Machine Translation Summit, organized by the EAMT, Copenhagen, Denmark, September 10-14, 2007. Ed.: B. Maegaard, European Association for Machine Translation (EAMT)
The CMU-UKA statistical machine translation systems for IWSLT 2007
Lane, I.; Zollmann, A.; Nguyen, T.; Bach, N.; Venugopal, A.; Vogel, S.; Rottmann, K.; Zhang, Y.; Waibel, A.
2007. International Workshop on Spoken Language Translation, IWSLT 2007, Trento, Italy, October 15-16, 2007
Continuous electromyographic speech recognition with a multi-stream decoding architecture
Schultz, T.; Waibel, A.
2007. Jou, Szu-Chen (Stan) (Ed.), Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2007, Honolulu, Hawaii, US, 20. April 2007, Institute of Electrical and Electronics Engineers (IEEE)
2006
Design of a Decoder for Statistical Machine Translation using Inverse Transduction Grammar. student research project
Rottmann, K.
2006. Universität Karlsruhe (TH)
Dynamische Vokabularerweiterung für ein grammatikbasiertes Dialogsystem durch Online Ressourcen. student research project
Chouambe, L. C.
2006. Universität Karlsruhe (TH)
New Phrase Alignment Methods for Online Phrase Alignment. master’s thesis
Niehues, J. H.
2006. Universität Karlsruhe (TH)
Fuzzy Personenverfolgung mit Aktiven Kameras. student research project
Camp, F. van de
2006. Universität Karlsruhe (TH)
Natürlichsprachliches Reservierungssystem für einen multimodalen Raum. student research project
Sorg, P.
2006. Universität Karlsruhe (TH)
Local Appearance-based 3D Face Recognition. student research project
Gao, H.
2006. Universität Karlsruhe (TH)
Spontanisierung von Textkorpora für die Sprachmodelierung. diploma thesis
Mimer, B.
2006. Universität Karlsruhe (TH)
MyConnector - Design and Implementation of an Adaptive Context-aware Communication Service. diploma thesis
Kluge, T.
2006. Universität Karlsruhe (TH)
Comparison of Adaptive Beamforming Algorithms for Automatic Speech Recognition. diploma thesis
Klee, U.
2006. Universität Karlsruhe (TH)
Audio- Visuelle Aktivitätenerkennung und Personenverfolgung in einer Büroumgebung. diploma thesis
Wojek, C.
2006. Universität Karlsruhe (TH)
Erkennung von lautlos und kontinuierlich gesprochener Sprache mittels Elektromyografie. diploma thesis
Walliczek, M.
2006. Universität Karlsruhe (TH)
User Adaptive Music Similarity with an Application to Playlist Generation. diploma thesis
Gärtner, D.
2006. Universität Karlsruhe (TH)
Speech Feature Enhancement for Speech Recognition by Sequential Monte Carlo Methods. diploma thesis
Faubel, F.
2006. Universität Karlsruhe (TH)
Statistische Modellierung des Dialogverlaufes in natürlichsprachlichen Dialogsystemen. diploma thesis
Krum, U.
2006. Universität Karlsruhe (TH)
Video-based Face Recognition Using Local Appearance-based Models. diploma thesis
Stallkamp, J.
2006. Universität Karlsruhe (TH)
Darstellung von Emotionen als Repräsentation von Dialogerfolg. diploma thesis
Svojanovski, W.
2006. Universität Karlsruhe (TH)
Machine Listening for Context-Aware Computing. PhD dissertation
Malkin, R. G.
2006. Carnegie Mellon University
A Multilingual Expectations Model for Contextual Utterances in Mixed-Initiative Spoken Dialogue
Holzapfel, H.; Waibel, A.
2006. International Conference on Spoken Language Processing, Interspeech 2006, 1942–1945, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2006-533
Rapid Simulation-Driven Reinforcement Learning of Multimodal Dialog Strategies in Human-Robot Interaction
Prommer, T.; Holzapfel, H.; Waibel, A.
2006. International Conference on Spoken Language Processing, Interspeech 2006, 1918–1921, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2006-527
Multimodal Estimation of User Interruptibility for Smart Mobile Telephones
Malkin, R.; Chen, D.; Yang, J.; Waibel, A.
2006. ICMI ’06: Proceedings of the 8th international conference on Multimodal interfaces, Banff, Canada, 02-04 November 2006, 118–125, Association for Computing Machinery (ACM). doi:10.1145/1180995.1181018
Speech-to-Speech Translation
Vogel, S.; Schultz, T.; Waibel, A.; Yamamoto, S.
2006. Multilingual Speech processing. Ed.: T. Schultz, K. Kirchoff, 317–397, Elsevier. doi:10.1016/B978-012088501-5/50013-5
A Robot Learns to Know People - First Contacts of a Robot
Holzapfel, H.; Schaaf, T.; Ekenel, H. K.; Shaa, C.; Waibel, A.
2006. 29th Annual German Conference on AI, KI 2006, Bremen, Germany, June 14-17, 2006. Proceedings, 302–316, Springer-Verlag
Open domain speech recognition & translation - lectures and speeches
Fügen, C.; Kolss, M.; Bernreuther, D.; Paulik, M.; Stüker, S.; Vogel, S.; Waibel, A.
2006. International Conference on Acoustics, Speech, and Signal Processing 2006, ICASSP 2006, Toulouse, France, May 14-19, 2006., IEEEXplore
A flexible online server for machine translation evaluation
Eck, M.; Vogel, S.; Waibel, A.
2006. Proceedings of the 11th Annual Conference of the European Association of Machine Translation, EAMT 2006, Oslo, Norway, June 19-20, 2006., Association for Computational Linguistics (ACL)
Directing attention in online aggregate sensor streams via auditory blind value assignment
Chen, D.; Yang, J.; Waibel, A.; Malkin, R.
2006. Proceedings: 2006 IEEE International Conference on Multimedia and Expo, ICME 2006, July 9-12, 2006, Hilton, Toronto, Ontario, Canada., 2137–2140, IEEE Service Center
Speech-to-speech translation services for the Olympic Games 2008
Stüker, S.; Zong, C.; Reichert, J.; Cao, W.; Kolss, M.; Xie, G.; Peterson, K.; Ding, P.; Arranz, V.; Yu, J.; Waibel, A.
2006. Machine learning for multimodal interaction - 3rd International Workshop (MLMI 2006), May 1 - 4, 2006, Bethesda, MD, USA, 297–308, Springer-Verlag
Dynamic extension of a grammar-based dialogue system - constructing an all-recipes knowing robot
Gieselmann, P.; Waibel, A.
2006. Interspeech 2006. Ninth International Conference on Spoken Language Processing (ICSLP), September 17-21, 2006, Pittsburgh, PA, USA., paper 1091, International Society for Computers and Their Applications (ISCA)
The UKA/CMU statistical machine translation system for IWSLT 2006
Eck, M.; Lane, I. R.; Bach, N.; Hewavitharana, S.; Kolss, M.; Zhao, B.; Hildebrand, A. S.; Vogel, S.; Waibel, A.
2006. Proceedings of the International Workshop on Spoken Language Translation, IWSLT 2006, Kyoto, Japan, November 28, 2006., 130–137, Association for Computational Linguistics (ACL)
A pattern learning approach to question answering within the Ephyra framework
Schläfer, N.; Gieselmann, P.; Schaaf, T.; Waibel, A.
2006. Proceedings / Text, speech and dialogue - 9th International Conference (TSD 2006), September 11 - 15, 2006, Brno, Czech Republic. Ed.: P. Sojka, 687–694, Springer-Verlag. doi:10.1007/11846406_86
Articulatory Feature Classification using Surface Electromyography
Maier-Hein, L.; Schultz, T.; Waibel, A.
2006. Jou, Szu-Chen (Stan) (Ed.), Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2006, Toulouse, France, 15.-19. Mai 2006, Institute of Electrical and Electronics Engineers (IEEE)
Open Domain Speech Translation: From Seminars and Speeches to Lectures
Fügen, C.; Kolss, M.; Paulik, M.; Stüker, S.; Schultz, T.; Waibel, A.
2006. Journies d’E’tude sur la Parole (JEP) Invited paper and keynote talk, JEP 2006, Dinard, France, 14. Juni 2006, IEEEXplore. doi:10.1109/ICASSP.2006.1660084
Optimizing Components for Handheld Two-way Speech Translation for an English-Iraqi Arabic System
Hsiao, R.; Venugopal, A.; Köhler, T.; Zhang, Y.; Charoenpornsawat, P.; Zollmann, A.; Vogel, S.; Black, A. W.; Schultz, T.; Waibel, A.
2006. Proceedings of the 9th ISCA International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh PA, USA, 30. September 2006, 765–768, International Speech Communication Association (ISCA)
Sub-Word Unit based Non-audible Speech Recognition using Surface Electromyography
Walliczek, M.; Kraft, F.; Schultz, T.; Waibel, A.
2006. Jou, Szu-Chen (Stan) (Ed.), Proceedings of the 9th ISCA International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh PA, USA, 30. September 2006, 1487–1490, International Society for Computers and Their Applications (ISCA)
Towards Continuous Speech Recognition Using Surface Electromyography
Schultz, T.; Walliczek, M.; Kraft, F.; Waibel, A.
2006. Jou, Szu-Chen (Stan) (Ed.), Proceedings of the 9th ISCA International Conference on Spoken Language Processing, Interspeech 2006, Pittsburgh PA, USA, 30. September 2006, International Society for Computers and Their Applications (ISCA)
Speech Translation Enhanced Automatic Speech Recognition
Paulik, M.; Stüker, S.; Fügen, C.; Schultz, T.; Schaaf, T.; Waibel, A.
2006. Proceedings of the Automatic Speech Recognition and Understanding Workshop, ASRU 2005, Cancun, Mexico, 30. Dezember 2005, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2005.1566488
2005
The connector - facilitating context-aware communication
Danninger, M.; Flaherty, G.; Bernardin, K.; Ekenel, H. K.; Köhler, T.; Stiefelhagen, R.; Waibel, A.; Malkin, R.
2005. Proceedings of the Seventh International Conference on Multimodal Interfaces, October 4-6, 2005, Trento, Italy, 69–75, Association for Computing Machinery (ACM). doi:10.1145/1088463.1088478
Temporal ICA for classification of acoustic events in a kitchen environment
Kraft, F.; Schaaf, T.; Waibel, A.; Malkin, R.
2005. INTERSPEECH 2005 - Eurospeech: 9th European Conference on Speech Communication and Technology, Lisboa, September 4-8, 2005., International Society for Computers and Their Applications (ISCA). doi:10.21437/Interspeech.2005-261
Training and evaluating error minimization decision rules for statistical machine translation
Venugopal, A.; Waibel, A.
2005. Proceedings of the ACL Workshop on Building and Using Parallel Texts, June 2005, Ann Arbor, 208–215, Association for Computational Linguistics (ACL)
End-to-end evaluation in JANUS: a speech-to-speech translation system: evluation systems
Gates, D.; Lavie, A.; Levin, L.; Waibel, A.; Gavalda, M.; Mayfield, L.; Woszczyna, M.; Zhan, P.
2005. Dialogue Processing in Spoken Language Systems: ECAI’96 Workshop, Budapest, Hungary, August 1996, Revised Papers, In: Proceedings of the ECAI 96, Budapest 1996., Springer Berlin Heidelberg. doi:10.1007/3-540-63175-5_47
Adaptation of the translation model for statistical machine translation based on information retrieval
Hildebrand, A. S.; Eck, M.; Vogel, S.; Waibel, A.
2005. EAMT 2005 - Proceedings of the 10th EAMT Conference: Practical applications of machine translation, Budapest, May 30–31, 2005, Association for Computational Linguistics (ACL)
Adaptation of the translation model for statistical machine translation based on information retrieval
Hildebrand, A. S.; Eck, M.; Vogel, S.; Waibel, A.
2005. IEEE Computer Society Conference on Computer Vision and Pattern Recognition, CVPR, June 20-25, 2005, San Diego, USA., IEEE Computer Society
Automatically Transcribing Meetings Using Distant Microphones
Metze, F.; Fügen, C.; Pan, Y.; Waibel, A.
2005. Proceedings. (ICASSP ’05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005, Philadelphia, PA, USA, 23-23 March 2005, 989–992, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2005.1415282
Klärungsfragen und Subdialoge durch Anomalieanalyse für natürlichsprachliche Dialogsysteme. student research project
Krum, U.
2005. Universität Karlsruhe (TH)
Sprecherlokalisierung und Eingabefusion für einen humanoiden Roboter. student research project
Walliczek, M.
2005. Universität Karlsruhe (TH)
Entwicklung eines planbasierten Dialogmodells für Informationssysteme. student research project
Nienhüser, D.; Rutter, I.; Ziesemer, S.
2005. Universität Karlsruhe (TH)
Pattern Learning and Knowledge Annotation for Question Answering. student research project
Schläfer, M.
2005. Universität Karlsruhe (TH). doi:10.5445/IR/1000166860
Akustische Modelle mit Mixturen inverser Kovarianzen. master’s thesis
Eller, A.
2005. Universität Karlsruhe (TH)
Continuous Audio Object Recognition. diploma thesis
Kraft, F.
2005. Universität Karlsruhe (TH)
Mehrpersonentracking mittels Farbe und Detektorkaskaden. diploma thesis
Elbs, A.
2005. Universität Karlsruhe (TH)
Lernen neuer Wörter im Dialog. diploma thesis
Schulz, B.
2005. Universität Karlsruhe (TH)
Lattice Evaluation Techniques Applied to Domains of Statistical Machine Translation. diploma thesis
Bernreuther, D.
2005. Universität Karlsruhe (TH)
Proaktive Initiierung von Dialogen für humanoide Roboter. diploma thesis
Schaa, C.
2005. Universität Karlsruhe (TH)
Translation Model Adaption for Statistical Machine Translation using Information Retrieval. diploma thesis
Hildebrand, A. S.
2005. Universität Karlsruhe (TH)
Tracking der artikularen Bewegung des Oberkörpers in Stereobildfolgen. diploma thesis
Ziegler, J.
2005. Universität Karlsruhe (TH)
Improving Active Appearance Models from 3D Information : With Application to Facial Expression Recognition. diploma thesis
Liebelt, J.; Waibel, A.; Stiefelhagen, R.; Yang, J.
2005. Universität Karlsruhe (TH)
Determining User State and Mental Task Demand From Electroencephalographic Data. diploma thesis
Honal, M.
2005. Universität Karlsruhe (TH)
Multilingual named Entity Extraction and Translation from Text and Speech. PhD dissertation
Huang, F.
2005. Carnegie Mellon University
Augmenting a Statistical Translation System with a Translation Memory
Hewavitharana, S.; Vogel, S.; Waibel, A.
2005. Proceedings of the 10th EAMT Conference: Practical applications of machine translation, Budapest, Hungary, 30-31 May 2005, 126–132, Association for Computational Linguistics (ACL)
The "FAME" Interactive Space
Metze, F.; Gieselmann, P.; Holzapfel, H.; Kluge, T.; Rogina, I.; Waibel, A.; Crowley, J.; Reignier, P.; Vaufreydaz, D.; Berard, F.; Cohen, B.; Coutaz, J.; Rouillard, S.; Arranz, V.; Bertran, M.; Rodriguez, H.
2005. Machine Learning for Multimodal Interaction. Second International Workshop, MLMI 2005, Edinburgh, UK, July 11-13, 2005, Revised Selected Papers. Ed.: S. Renals, 126–137, Springer. doi:10.1007/11677482_11
Spontaneous Speech Consolidation for Spoken Language Applications
Hori, C.; Waibel, A.
2005. 9th Annual Conference of the International Spech Communication Association, Interspeech 2005, Lisbon, Portugal, 04-08 September 2005, 617–620, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2005-65
Learning a Log-Linear Model with Bilingual Phrase-Pair Features for Statistical Machine Translation
Zhao, B.; Waibel, A.
2005. IJCNLP-05: Fourth SIGHAN Workshop on Chinese Language Processing. Proceedings of the Workshop. Ed.: C. Huang, G. Levow, 79–86, Association for Computational Linguistics (ACL)
Clarification Questions to Improve Dialogue Flow and Speech Recognition in Spoken Dialogue Systems
Krum, U.; Holzapfel, H.; Waibel, A.
2005. 9th Annual Conference of the International Spech Communication Association, Interspeech 2005, Lisbon, Portugal, 04-08 September 2005, 3417–3420, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2005-831
Bilingual Word Spectral Clustering for Statistical Machine Translation
Zhao, B.; Xing, E. P.; Waibel, A.
2005. Building and Using Parallel Texts: Data-Driven Machine Translation and Beyond Proceedings of the Workshop 29-30 June 2005, University of Michigan, Ann Arbor, Michigan, USA. Ed.: P. Koehn, J. Martin, R. Mihalcea, C. Monz, T. Pedersen, 25–32, Association for Computational Linguistics (ACL)
Classifying user environment for mobile applications using linear autoencoding of ambient audio
Waibel, A.; Malkin, R.
2005. Proceedings / 2005 IEEE International Conference on Acoustics, Speech, and Signal Processing, May 18 - 23, 2005, Philadelphia, Pennsylvania, USA; Vol. 5, 509–512, IEEE Operations Center. doi:10.1109/ICASSP.2005.1416352
What makes human-robot dialogues struggle?
Gieselmann, P.; Waibel, A.
2005. Proceedings of the 9th Workshop on the Semantics and Pragmatics of Dialogue, SEMDIAL, Nancy, France, June 9-11, 2005, 21–28
Rapid porting of ASR-systems to mobile devices
Köhler, T.; Fügen, C.; Stüker, S.; Waibel, A.
2005. Proceedings of the 9th European Conference on Speech Communication and Technology, Interspeech 2005, September 4-8, 2005, Lisboa, Portugal., International Society for Computers and Their Applications (ISCA). doi:10.21437/Interspeech.2005-116
Document driven machine translation enhanced ASR
Waibel, A.; Paulik, M.; Fügen, C.; Stüker, S.; Schultz, T.; Schaaf, T.
2005. Proceedings / Interspeech’2005 - Eurospeech, 9th European Conference on Speech Communication and Technology, September 4-8, 2005, Lisboa, Portugal; 1 CD-Rom, 2261–2264, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2005-720
The CMU statistical machine translation system for IWSLT 2005
Hewavitharana, S.; Zhao, B.; Eck, M.; Hori, C.; Vogel, S.; Waibel, A.
2005. International Workshop on Spoken Language Translation, IWSLT 2005, October 24-25, 2005, Pittsburgh, PA, USA., Association for Computational Linguistics (ACL)
Low cost portability for statistical machine translation based on n-gram frequency and TF-IDF
Eck, M.; Vogel, S.; Waibel, A.
2005. International Workshop on Spoken Language Translation, IWSLT 2005, 24th-25th October , 2005, Pittsburgh, USA., Association for Computational Linguistics (ACL)
Whispery Speech Recognition using Adapted Articulatory Features
Schultz, T.; Waibel, A.
2005. Jou, Szu-Chen (Stan) (Ed.), Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , ICASSP 2005, Philadelphia, Pennsylvania, 23rd March 2005, IEEEXplore. doi:10.1109/ICASSP.2005.1415287
Document Driven Machine Translation Enhanced Automatic Speech Recognition
Paulik, M.; Fügen, C.; Schaaf, T.; Schultz, T.; Stüker, S.; Waibel, A.
2005. Proceedings of the Eurospeech, Eurospeech 2005, Lisbon, Portugal, 30. September 2005
Articulatory features for conversational speech recognition. PhD dissertation
Metze, F.
2005. Universität Karlsruhe (TH). doi:10.5445/IR/1000005908
Compensating hyperarticulation for automatic speech recognition. PhD dissertation
Soltau, H.
2005. Universität Karlsruhe (TH). doi:10.5445/IR/1000003651
2004
Automatic detection and recognition of signs from natural scenes
Chen, X.; Yang, J.; Zhang, J.; Waibel, A.
2004. IEEE Transactions on Image Processing, 13 (1), 87–99
Design and Implementation of a Cepstral Domain Likelihood Maximizing Beamformer for Speech Recognition. student research project
Raub, D. L.
2004. Universität Karlsruhe (TH)
Visuelle Personenverfolgung mit Partikelfilter. student research project
Wojek, C. A.
2004. Universität Karlsruhe (TH)
Automatic Classification of Non-Verbal Utterances in Japanese Spontaneous Speech. student research project
Svojanovski, W.
2004. Universität Karlsruhe (TH)
Einbettung von Grammatikregeln in Ngramm-Sprachmodelle. student research project
Fleischer, F.
2004. Universität Karlsruhe (TH)
Flexible Ballungsverfahren für Graphembasierte Spracherkennung. student research project
Mimer, B.
2004. Universität Karlsruhe (TH)
Implementierung und Evaluation eines Systems zur Erkennung von Gesichtern. student research project
Harres, D.
2004. Universität Karlsruhe (TH)
Konfidenzmaße im Dialogmanagement. student research project
Hauth, A.
2004. Universität Karlsruhe (TH)
Audio-Visuelle Spracherkennung auf großem Vokabular. diploma thesis
Kratt, J.
2004. Universität Karlsruhe (TH)
Visuelle Schätzung der horizontalen Kopfdrehung in Multikameraumgebungen. diploma thesis
Voit, M.
2004. Universität Karlsruhe (TH)
Recognizing Sloppy Speech. PhD dissertation
Yu, H.
2004. Carnegie Mellon University
Improving Named Entity Translation Combining Phonetic and Semantic Similarities
Huang, F.; Vogel, S.; Waibel, A.
2004. Proceedings of the Human Language Technology Conference of the North American Chapter of the Association for Computational Linguistics, HLT-NAACL 2004, Boston, USA, 02-07 May 2004, 281–288, Association for Computational Linguistics (ACL)
Performance Comparisons of All-pass Transform Adaptation with Maximum Likelihood Linear Regression
McDonough, J.; Waibel, A.
2004. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Volume 1, Montreal, Canada, 17-21 May 2004, I-313-I-316, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2004.1325985
Minimum Kullback-Leibler Distance Based Multivariate Gaussian Feature Adaptation for Distant-Talking Speech Recognition
Pan, Y.; Waibel, A.
2004. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Volume 1, Montreal, Canada, 17-21 May 2004, I-1029 - I-1032, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2004.1326164
Integrating Thumbnail Features For Speech Recognition Using Conditional Exponential Models
Yu, H.; Waibel, A.
2004. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Volume 1, Montreal, Canada, 17-21 May 2004, 893–896, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2004.1326130
Phrase Pair Rescoring with Term Weighting for Statistical Machine Translation
Zhao, B.; Vogel, S.; Waibel, A.; Eck, M.
2004. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing. Ed.: D. Lin, D. Wu, 206–213, Association for Computational Linguistics (ACL)
Improving Statistical Machine Translation in the Medical Domain using the Unified Medical Language system
Eck, M.; Vogel, S.; Waibel, A.
2004. Proceedings of the 20th International Conference on Computational Linguistics, COLING 2004, Geneva, Switzerland, 23-27 August 2004, 792–798, Association for Computational Linguistics (ACL)
Interpreting BLEU/NIST Scores: How Much Improvement Do We Need to Have a Better System?
Zhang, Y.; Vogel, S.; Waibel, A.
2004. Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04). Ed.: M. Lino, 2051–2054, European Language Resources Association (ELRA)
Natural Human-Robot Interaction Using Speech, Head Pose and Gestures
Stiefelhagen, R.; Fügen, C.; Gieselmann, P.; Holzapfel, H.; Nickel, K.; Waibel, A.
2004. Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS 2004, Sendai, Japan, 28 September - 02 October 2004, 2422–2427, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IROS.2004.1389771
The ISL EDTRL System
Reichert, J.; Waibel, A.
2004. Proceedings of the International Workshop on Spoken Language Translation, IWSLT 2004, Kyoto, Japan, 30 September - 01 October 2004, 61–64, Association for Computational Linguistics (ACL)
Towards Named Entity Extraction and Translation in Spoken Language Translation
Huang, F.; Vogel, S.; Waibel, A.
2004. Proceedings of the First International Workshop on Spoken Language Translation: Papers, Kyoto, Japan, 30 September - 01 October 2004, 131–137, Association for Computational Linguistics (ACL)
Worldwide Ongoing Activities On Multilingual Speech to Speech Translation
Lazzari, G.; Waibel, A.; Zong, C.
2004. Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, 04-08 October 2004, 373–376, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2004-161
Speech Translation: Past, Present and Future
Waibel, A.
2004. Proceedings of the International Conference on Speech and Language Processing, INTERSPEECH 2004 - ICSLP, Jeju Island, Korea, 04-08 October, 2004, 353–356, International Speech Communication Association (ISCA). doi:10.21437/Interspeech.2004-156
Speaker adaptation with all-pass transforms
McDonough, J.; Schaaf, T.; Waibel, A.
2004. Speech Communication, 42 (1), 75–91. doi:10.1016/j.specom.2003.09.005
Minimum Kullback-Leibler distance based multivariate Gaussian feature adaptation for distant-talking
Pan, Y.; Waibel, A.
2004. Proceedings / 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), May 17 - 21, 2004, Montreal, Quebec, Canada; Vol. 1, 1029–1032, IEEE Operations Center
Integrating thumbnail features for speech recognition using conditional
Yu, H.; Waibel, A.
2004. roceedings / 2004 IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2004), May 17 - 21, 2004, Montreal, Quebec, Canada; Vol. 1, 893–896, IEEE Operations Center
Improving statistical machine translation in the medical domain using the unified medical language system
Eck, M.; Vogel, S.; Waibel, A.
2004. Proceedings: COLING Geneva 2004, 20th International Conference on Computational Linguistics, August 23-27, 2004., No. 792, Association for Computational Linguistics (ACL)
The ISL statistical machine translation system for spoken language translation
Waibel, A.; Vogel, S.; Hewavitharana, S.; Kolss, M.
2004. Proceedings of the International Workshop on Spoken Language Translation (IWSLT 2004), September 30 - October 1, 2004, Kyoto, Japan, 65–72, Kyoto
Adaptation for soft whisper recognition using a throat microphone
Jou, S. C.; Schultz, T.; Waibel, A.
2004. Interspeech 2004. Proceedings of the 8th International Conference on Spoken Language Processing, ICSLP, October 4-8, 2004, International Convention Center Jeju, Jeju Island, Korea. Hrsg.: S.H. Kim, 1493–1496, Sunjin Printing. doi:10.21437/Interspeech.2004-565
Low cost portability for statistical machine translation based on n-gram coverage
Eck, M.; Vogel, S.; Waibel, A.
2004. Proceedings of Machine Translation Summit X: Papers, Phuket, 13th-15th September 2005, 227–234, Association for Computational Linguistics (ACL)
A Thai Speech Translation System For Medical Dialogs
Schultz, T.; Alexander, D.; Black, A. W.; Peterson, K.; Suebvisai, S.; Waibel, A.
2004. Demonstration Papers at HLT-NAACL 2004, Boston, Massachusetts. 2nd - 7th May 2004, 34–35, Association for Computational Linguistics (ACL)
Speaker Segmentation and Clustering in Meetings
Jin, Q.; Laskowski, K.; Schultz, T.; Waibel, A.
2004. NIST Meeting Recognition Workshop, NIST 2004, Montreal, Canada, 17. Mai 2004
Towards Language Portability in Statistical Speech Translation
Waibel, A.; Schultz, T.; Fügen, C.; Honal, M.; Kolss, M.; Reichert, J.; Stüker, S.
2004. Invited paper, Special Session on Multilinguality in Speech Processing, Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2004, Montreal, Canada, 17th-21st Mai 2004, IEEEXplore. doi:10.1109/ICASSP.2004.1326657
Adaptation for Soft Whisper Recognition
Schultz, T.; Waibel, A.
2004. Jou, Szu-Chen (Stan) (Ed.), International Conference of Spoken Language Processing , ICSLP 2004, Jeju Island, South Korea, 30. Oktober 2004
Large Vocabulary Audio-Visual Speech Recognition using the Janus Speech Recognition Toolkit
Kratt, J.; Metze, F.; Stiefelhagen, R.; Waibel, A.
2004. Pattern recognition. DAGM Symposium. Teil: 26. Tübingen, Germany, August 30 - September 1, 2004. Ed.: C. E. Rasmussen, 488–495, Springer-Verlag. doi:10.1007/978-3-540-28649-3_60
Tight Coupling of Speech Recognition and Dialog Management - Dialog-Context Dependent Grammar Weighting for Speech Recognition
Fügen, C.; Holzapfel, H.; Waibel, A.
2004. Proceedings of the International Conference of Spoken Language Processing (ICSLP-2004), INTERSPEECH 2004, Jeju Island, Korea, 4th-8th Oct. 2004, 169–172, Jeju Island
Tight Coupling of Speech Recognition and Dialog Management - Dialog-Context Grammar Weighting for Speech Recognition
Fügen, C.; Holzapfel, H.; Waibel, A.
2004. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2004, Jeju Island, Korea, Jeju Island
Fehlertolerante multimodale Fusion von n-besten Listen von Spracheingabe und Zeigegesten
Holzapfel, H.; Waibel, A.
2004. Elektronische Sprachsignalverarbeitung, ESSV 2004, Cottbus, Cottbus
CHIL - Computers in the Human Interaction Loop
Waibel, A.; Steusloff, H.; Stiefelhagen, R.
2004. CHIL Project Consortium (Ed.), Rich Transcription Meeting Recognition Workshop, NIST ICASSP, Montreal, 17th Mai 2004, Montreal
CHIL - Computers in the Human Interaction Loop
Waibel, A.; Steusloff, H.; Stiefelhagen, R.
2004. CHIL Project Consortium (Ed.), 5th International Workshop on Image Analysis for Multimedia Interactive, IEEE WIAMIS 2004, Lisboa, 21st-23rd April 2004, Institute of Electrical and Electronics Engineers (IEEE)
Natürliche Mensch-Roboter Interaktion mittels Sprache, Blickrichtung und Gestik
Stiefelhagen, R.; Fuegen, C.; Gieselmann, P.; Holzapfel, H.; Nickel, K.; Waibel, A.
2004. Robotik 2004 : Leistungsstand - Anwendungen - Visionen - Trends ; Tagung VDI/VDE, München, 17.-18. Juni 2004, VDI Fachmedien GmbH & Co. KG
Natural Human-Robot Interaction using Speech, Gaze and Gestures
Stiefelhagen, R.; Fuegen, C.; Gieselmann, P.; Holzapfel, H.; Nickel, K.; Waibel, A.
2004. IEEE/RSJ International Conference on Intelligent Robots and Systems, Sendai, Japan, Sept. 2004, Sendai
Identifying the Addressee in Human-Human-Robot Interactions based on Head Pose and Speech
Katzenmaier, M.; Stiefelhagen, R.; Schultz, T.; Rogina, I.; Waibel, A.
2004. International Conference on Multimodal Interfaces ICMI, State College, USA, Oct. 2004, 144–151, Association for Computing Machinery (ACM). doi:10.1145/1027933.1027959
Language Model Adaptation For Statistical Machine Translation Based On Information Retrieval
Eck, M.; Vogel, S.; Waibel, A.
2004. 4th International Conference on Language Resources and Evaluation, Lissabon, 24th-28th May 2004, Lissabon
Erkennen und Lernen neuer Wörter. PhD dissertation
Schaaf, T.
2004. Universität Karlsruhe (TH). doi:10.5445/IR/1000001684
2003
Extracting Named Entity Translingual Equivalence with Limited Resources
Huang, F.; Vogel, S.; Waibel, A.
2003. ACM transactions on Asian language information processing, 2 (2), 124–129. doi:10.1145/974740.974745
Efficient optimization for bilingual sentence alignment based on linear regression
Zechner, K.; Vogel, S.; Waibel, A.
2003. Proceedings of the HLT-NAACL 2003 Workshop on Building and Using Parallel Texts: Data Driven Machine Translation and Beyond, 81–87, Association for Computational Linguistics (ACL). doi:10.3115/1118905.1118920
A Statistical Approach to Automatic Speech Summarization
Hori, C.; Furui, S.; Malkin, R.; Yu, H.; Waibel, A.
2003. EURASIP journal on advances in signal processing, 2003 (2), 128–139. doi:10.1155/S1110865703211112
Speechalator: two-way speech-to-speech translation on a consumer PDA
Waibel, A.; Badran, A.; Black, A. W.; Frederking, R.; Gates, D.; Lavie, A.; Levin, L.; Lenzo, K.; Tomokiyo, L.-M.; Reichert, J.; Schultz, T.; Wallace, D.; Woszczyna, M.; Zhang, J.
2003. NAACL-Demonstrations ’03: Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Demonstrations - Volume 4, 29–30, Association for Computing Machinery (ACM). doi:10.3115/1073427.1073442
Verfolgen der Sprachaufmerksamkeit mit Hilfe der Ausgabe des Spracherkenners. student research project
Katzenmaier, M. D.
2003. Universität Karlsruhe (TH)
Erkennung von Zeigegesten basierend auf 3D-Traking von Kopf und Händen. diploma thesis
Nickel, K.
2003. Universität Karlsruhe (TH)
Emotionen als Parameter der Dialogverarbeitung. diploma thesis
Holzapfel, H.
2003. Universität Karlsruhe (TH)
Statistische Verfahren zur Bestimmung der Vokalisierung arabischer Texte. diploma thesis
Déltour, A.
2003. Universität Karlsruhe (TH)
Adaptive Verfahren zur kamerabasierten Erkennung von Gesichtsmerkmalen im PKW. diploma thesis
King, A.
2003. Universität Karlsruhe (TH)
Signalbasierte Verfahren zur robusten Spracherkennung im Cockpit von Luftfahrzeugen. diploma thesis
Dambier, M.
2003. Universität Karlsruhe (TH)
Schreiberadaption für Online-Handschrifterkennung. diploma thesis
Hermann, E.
2003. Universität Karlsruhe (TH)
Advances in ISL’s Lecture and Meeting Trackers
Waibel, A.; Rogina, I.
2003. Proceedings of the ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition, SSPR 2003, Tokyo, Japan, 13-16 April 2003, International Speech Communication Association (ISCA)
Maximum Mutual Information Speaker Adapted Training with Semi-Tied Covariance Matrices
McDonough, J.; Waibel, A.
2003. Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, Hong Kong, China, 06-10 April 2003, I-128 - I-131, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2003.1198733
Recent Advances in Lingwear: A Wearable Linguistic Assistant for Tourists
Fügen, C.; Schultz, T.; Hu, J.-C.; Waibel, A.
2003. Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003
Effective Phrase Translation Extraction from Alignment Models
Venugopal, A.; Vogel, S.; Waibel, A.
2003. Proceedings of the 41st Annual Conference of the Association for Computational Linguistics, ACL 2003, Sapporo, Japan, 07-12 July 2003, 319–326, Association for Computational Linguistics (ACL). doi:10.3115/1075096.1075137
The CMU Statistical Machine Translation System
Vogel, S.; Zhang, Y.; Huang, F.; Tribble, A.; Venugopal, A.; Zhao, B.; Waibel, A.
2003. Proceedings of Machine Translation Summit IX, New Orleans, USA, 23-27 September 2003: Papers, Association for Computational Linguistics (ACL)
Integrated Phrase Segmentation and Alignment Algorithm for Statistical Machine Translation
Zhang, Y.; Vogel, S.; Waibel, A.
2003. Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003, Beijing, China, 26-29 October 2003, 567–573, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/NLPKE.2003.1275970
Overlapping Phrase-Level Translation Rules in an SMT Engine
Tribble, A.; Vogel, S.; Waibel, A.
2003. Proceedings of the International Conference on Natural Language Processing and Knowledge Engineering, NLP-KE 2003, Beijing, China, 26-29 October 2003, 574–579, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/NLPKE.2003.1275971
Using Articulatory Information for Speaker Adaptation
Metze, F.; Waibel, A.
2003. Proceedings of the Automatic Speech Recognition and Understanding Workshop, ASRU 2003, St. Thomas, USA, 30 November - 04 December 2003, 405–410, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2003.1318475
Flexible parameter tying for conversational speech recognition
Yu, H.; Waibel, A.
2003. Proceedings / ISCA & IEEE Workshop on Spontaneous Speech Processing and Recognition (SSPR 2003), April 13-16, 2003, Tokyo, Japan, International Society for Computers and Their Applications (ISCA)
Automatic extraction of named entity translingual equivalence based on multi-feature cost minimization
Huang, F.; Vogel, S.; Waibel, A.
2003. Proceedings of the 41st Annual Meeting of the Association for Computational Linguistics, Sapporo Convention Center, Sapporo, Japan, July 7-12, 2003, 9–16, Association for Computational Linguistics (ACL). doi:10.3115/1119384.1119386
Minimum variance distortionless response on a warped frequency scale
Wölfel, M.; McDonough, J.; Waibel, A.
2003. Proceedings / Eurospeech 2003 - 8th European Conference on Speech Communication and Technology, September 1 - 4, 2003, Geneva, Switzerland; Vol. 2, 1021–1024, International Society for Computers and Their Applications (ISCA)
Towards multimodal communication with a household robot
Gieselmann, P.; Fügen, C.; Holzapfel, H.; Schaaf, T.; Waibel, A.
2003. Conference documentation: International Conference on Humanoid Robots, HUMANOIDS 2003, October 1-3, 2003, Karlsruhe., IEEE Robotics and Automation Society
Multilingual Articulatory Features. diploma thesis
Stüker, S.
2003. Karlsruher Institut für Technologie (KIT). doi:10.5445/IR/1000009176
Multilingual Articulatory Features
Stüker, S.; Schultz, T.; Metze, F.; Waibel, A.
2003. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , ICASSP 2003, Hong Kong, China, 30. April 2003, IEEEXplore. doi:10.1109/ICASSP.2003.1198737
Comparison of Acoustic Model Adaptation Techniques on Non-native Speech
Wang, Z.; Schultz, T.; Waibel, A.
2003. Proceedings of the International Conference on Acoustic, Speech, and Signal Processing, ICASSP 2003, 540–543, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2003.1198837
SMaRT: The Smart Meeting Room Task at ISL
Waibel, A.; Schultz, T.; Bett, M.; Malkin, R.; Rogina, I.; Stiefelhagen, R.; Yang, J.
2003. 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP ’03). ICASSP 2003, Hong Kong, China, 06th-10th April 2003, IEEEXplore. doi:10.1109/ICASSP.2003.1202752
Speechalator: Two-way Speech-to-Speech Translation in your Hand
Waibel, A.; Badran, A.; Black, A. W.; Frederking, R.; Gates, D.; Lavie, A.; Levin, L.; Lenzo, K.; Tomokiyo, L.-M.; Reichert, J.; Schultz, T.; Wallace, D.; Woszczyna, M.; Zhang, J.
2003. Human Language Technology & North American Chapter of the Association for Computational Linguistics, HLT-NAACL 2003, Edmonton, Alberta, Canada, 01. Juni 2003
Speechalator: two-way speech-to-speech translation on a consumer PDA
Waibel, A.; Badran, A.; Black, A. W.; Frederking, R.; Gates, D.; Lavie, A.; Levin, L.; Lenzo, K.; Tomokiyo, L.-M.; Reichert, J.; Schultz, T.; Wallace, D.; Woszczyna, M.; Zhang, J.
2003. Proceedings of the 8th European Conference on Speech Communication and Technology , Eurospeech 2003, Genf, Schweiz, 1st-4th September 2003, 369–372, International Society for Computers and Their Applications (ISCA)
Integrating Multilingual Articulatory Features into Speech Recognition
Stüker, S.; Metze, F.; Schultz, T.; Waibel, A.
2003. Proceedings of the 8th European Conference on Speech Communication and Technology , Eurospeech 2003, Genf, Schweiz, 1st-4th September 2003, International Society for Computers and Their Applications (ISCA). doi:10.21437/Eurospeech.2003-206
Towards Universal Speech Recognition
Wang, Z.; Topkara, U.; Schultz, T.; Waibel, A.
2003. Proceedings of the International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, PA, 14th-16th Oktober 2002, IEEEXplore. doi:10.1109/ICMI.2002.1167001
Integrating emotional cues into a framework for dialogue management
Holzapfel, H.; Fuegen, C.; Denecke, M.; Waibel, A.
2003. Proceedings. Fourth IEEE International Conference on Multimodal Interface (ICMI), Pittsburgh, 14th-16th October 2002, IEEEXplore
Generische Interaktionsmuster für aufgabenorientierte Dialogsysteme [online]. PhD dissertation
Denecke, M.
2003. Universität Karlsruhe (TH). doi:10.5445/IR/5092003
Minimum variance distortionless respronse on a warped frequency scale
Woelfel, M.; McDonough, J.; Waibel, A.
2003. Eurospeech 2003 proceedings. 8th European Conference on Speech comunication and Technology, Geneva, Switzerland 2003., Geneva
Warping and scaling of the minimum variance distortionless response
Wölfel, M.; McDonough, J.; Waibel, A.
2003. 2003 IEEE Workshop on Automatic Speech Recognition and Understanding. St. Thomas, VI, 30th November - 4th December 2003, ASRU ’03, IEEE Operations Center
2002
The NESPOLE! speech-to-speech translation system
Metze, F.; McDonough, J.; Soltau, H.; Langley, C.; Lavie, A.; Schultz, T.; Waibel, A.; Cattoni, R.; Lazzari, G.; Pianesi, F.
2002. HLT ’02: Proceedings of the second international conference on Human Language Technology Researc, San Diego, San Diego, 24th-27th March 2002, 378–383, Morgan Kaufmann Publishers. doi:10.3115/1289189.1289233
Simultaneous tracking of head poses in a panoramic view
Stiefelhagen, R.; Yang, J.; Waibel, A.
2002. Proceedings of the 15th International Conference on Pattern Recognition, ICPR, Barcelona, 3rd-8th September 2000, In: Proceedings of the International Conference on Pattern Recognition, Barcelona, Spain 2000. S. 726-729., IEEEXplore. doi:10.1109/ICPR.2000.903646
Smart sight: a tourist assistant system
Yang, J.; Yang, W.; Denecke, M.; Waibel, A.
2002. The 3rd International Symposium on Wearable Computers, ISWC 99, San Francisco, Calif., October 18-19, 1999, In: The 3rd International Symposium on Wearable Computers, ISWC 99, San Francisco, Calif. 1999., IEEEXplore. doi:10.1109/ISWC.1999.806662
Translation of conversational speech with Janus-II
Lavie, A.; Waibel, A.; Levin, L.; Gates, D.; Gavalda, M.; Zeppenfeld, T.; Zhan, P.; Glickman, O.
2002. Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP ’96, In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., IEEEXplore. doi:10.1109/ICSLP.1996.607286
Recognizing emotion in speech
Dellaert, F.; Polzin, T.; Waibel, A.
2002. Proceedings of 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996, In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., IEEEXplore. doi:10.1109/ICSLP.1996.608022
Toward movement-invariant automatic lip-reading and speech recognition
Duchnowski, P.; Hunke, M.; Buesching, D.; Meier, U.; Waibel, A.
2002. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, 9th-12 May 1995, In: Conference proceedings. The 1995 International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Mich. Vol. 1. Piscataway, NJ 1995. S. 109-113., 109–112, IEEE Press
Improvements in Non-verbal Cue Identification using Multilingual Phone Strings
Schultz, T.; Jin, Q.; Laskowski, K.; Tribble, A.; Waibel, A.
2002. Proceedings of the {ACL}-02 Workshop on Speech-to-Speech Translation: Algorithms and Systems on the 40th Anniversary Meeting of the Association for Computational Linguistics , ACL 2002, Philadelphia, Philadelphia, 6th-12th July 2002, 101–108, ACL Anthology. doi:10.3115/1118656.1118670
Modeling focus of attention for meeting indexing based on multiple cues
Stiefelhagen, R.; Yang, J.; Waibel, A.
2002. IEEE transactions on neural networks, 13 (4), 928–938. doi:10.1109/TNN.2002.1021893
Enhancing the Usability and Performance of NESPOLE!: a Real-World Speech-to-Speech Translation System
Metze, F.; McDonough, J.; Soltau, H.; Lavie, A.; Levin, L.; Langley, C.; Schultz, T.; Waibel, A.; Cattoni, R.; Lazzari, G.; Mana, N.; Pianesi, F.; Pianta, E.
2002. HLT ’02: Proceedings of the second international conference on Human Language Technology Research, San Diego, 24th-27th March, 2002, 269–274, Morgan Kaufmann Publishers
Speaker, Accent, and Language Identification using Multilingual Phone Strings
Jin, Q.; Schultz, T.; Waibel, A.
2002. HLT ’02: Proceedings of the second international conference on Human Language Technology Research, San Diego, 24th-27th March 2002, 125–131, Morgan Kaufmann Publishers
Farbbasierte Segmentierung von Körperregionen. student research project
King, A.
2002. Universität Karlsruhe (TH)
Erkennung gesprochener Flughafencodes. student research project
Dambier, M.
2002. Universität Karlsruhe (TH)
Spezifikation eines Dialogsystems für den Multimediaraum unter Verwendung des Dialogmanagers "ariadne". student research project
Topp, E. A.
2002. Universität Karlsruhe (TH)
Usability of the Multimodal Meeting Browser for Reviewing Meeting Records. bachelor’s thesis
Blank, K.
2002. International University in Germany
Continuous Grasp Recognition using Hidden Markov Models. diploma thesis
Bernardin, K.
2002. Universität Karlsruhe (TH)
Vision-based 3-D Tracking of People in a Smart Room Environment. diploma thesis
Focken, D.
2002. Universität Karlsruhe (TH)
An Adaptive Approach to Named Entity Extraction for Meeting Applications
Huang, F.; Waibel, A.
2002. HLT ’02: Proceedings of the second international conference on Human Language Technology Research. Ed.: M. Marcus, 165–170, Morgan Kaufmann Publishers
Automatic Summarization of English Broadcast News Speech
Hori, C.; Furui, S.; Malkin, R.; Yu, H.; Waibel, A.
2002. HLT ’02: Proceedings of the second international conference on Human Language Technology Research. Ed.: M. Marcus, 241–246, Morgan Kaufmann Publishers
Automatic Detection and Translation of Text from Natural Scenes
Yang, J.; Chen, X.; Zhang, J.; Zhang, Y.; Waibel, A.
2002. Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Volume II, Orlando, USA, 13-17 May 2002, 2101–2104, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2002.5745049
Automatic Speech Summarization Applied to English Broadcast News Speech
Hori, C.; Furui, S.; Malkin, R.; Yu, H.; Waibel, A.
2002. Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Volume I, Orlando, USA, 13-17 May 2002, I-9 - I-12, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2002.5743641
On Maximum Mutual Information Speaker-Adapted Training
McDonough, J.; Schaaf, T.; Waibel, A.
2002. Proceedings of the International Conference on Acoustic Speech and Signal Processing, ICASSP 2002, Volume I, Orlando, USA, 13-17 May 2002, I-601 - I-604, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2002.5743789
A Robust Approach for Recognition of Text Embedded in Natural Scenes
Zhang, J.; Chen, X.; Hannemann, A.; Yang, J.; Waibel, A.
2002. Proceedings of the 16th International Conference on Pattern Recognition, ICPR 2002, Volume 3, Quebec, Canada, 11-15 August 2002, 204–207, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICPR.2002.1047830
Compensating For Hyperarticulation By Modeling Articulatory Properties
Soltau, H.; Metze, F.; Waibel, A.
2002. ICSLP - 2002: 7th International Conference on Spoken Language Processing, Volume 2. Ed.: J. Hansen, 841–844, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2002-288
Automatic SIGN Translation
Zhang, Y.; Zhao, B.; Yang, J.; Waibel, A.
2002. ICSLP - 2002: 7th International Conference on Spoken Language Processing, Volume 1. Ed.: J. Hansen, 645–648, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2002-213
Flexi-modal And Multi-Machine User Interfaces
Myers, B.; Malkin, R.; Bett, M.; Waibel, A.; Bostwick, B.; Miller, R. C.; Yang, J.; Denecke, M.; Seemann, E.; Zhu, J.; Peck, C. H.; Kong, D.; Nichols, J.; Scherlis, B.
2002. Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, USA, 16 October 2002, 343–348, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICMI.2002.1167019
A PDA-based Sign Translator
Zhang, J.; Chen, X.; Yang, J.; Waibel, A.
2002. Proceedings of the 4th IEEE International Conference on Multimodal Interfaces, ICMI 2002, Pittsburgh, USA, 16 October 2002, 217–222, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICMI.2002.1166996
Automatic Detection of Signs with Affine Transformation
Chen, X.; Yang, J.; Zhang, J.; Waibel, A.
2002. Proceedings of the 6th IEEE Workshop on Applications of Computer Vision, WACV 2002, Orlando, USA, 04 December 2002, 32–36, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ACV.2002.1182151
Korean broadcast news transcription using morpheme-based recognition units
Kwon, O.-W.; Waibel, A.
2002. Han’gug Eumhyang Haghoeji, 21 (1E), 3–11
Speaker Identification using Multilingual Phone Strings
Jin, Q.; Schultz, T.; Waibel, A.
2002. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing , ICASSP 2002, Orlando, Florida, 13th-17th Mai 2002, IEEEXplore. doi:10.1109/ICASSP.2002.5743675
Phonetic Speaker Identification
Jin, Q.; Schultz, T.; Waibel, A.
2002. Proceedings of the International Conference of Spoken Language Processing, ICSLP 2002, Denver, CO, 16th-20th September 2002, International Society for Computers and Their Applications (ISCA)
Strategies for automatic segmentation of audio data
Kemp, T.; Schmidt, M.; Westphal, M.; Waibel, A.
2002. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000, Istanbul, Turkey, 5th-9th June 2000, In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000, Istanbul, Turkey 2000. [CD-ROM]., IEEEXplore
Specialized acoustic models for hyperarticulated speech
Soltau, H.; Waibel, A.
2002. Proceedings of the 25th IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2000, IStanbul. 5th-9th June 2000, In: Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000, Istanbul, Turkey 2000. [CD-ROM]., IEEEXplore
Polyphone decision tree specialization for language adaptation
Schultz, T.; Waibel, A.
2002. Proceedings of the IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2000, Istanbul, Turkey 5th - 9th June 2000, IEEEXplore. doi:10.1109/ICASSP.2000.862080
Interlingua based statistical machine translation
Kauers, M.; Vogel, S.; Fuegen, C.; Waibel, A.
2002. 7th International Conference on Spoken Language Processing (ICSLP 2002) - INTERSPEECH 2002, Denver, Colorado, USA, 16th-20th September 2002, In: Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP, Denver, CO 2002. Ed.: IEEE. [CD-ROM]., 1909–1912, International Society for Computers and Their Applications (ISCA). doi:10.21437/ICSLP.2002-513
A flexible stream architecture for ASR using articulatory features
Metze, F.; Waibel, A.
2002. Proceedings of the 7th International Conference on Spoken Language Processing (ICSLP 2002) - INTERSPEECH 2002, Denver, 16th-20th September 2002. Ed.: IEEE. [CD-ROM]., In: Proceedings of the 7th International Conference on Spoken Language Processing, ICSLP, Denver, CO 2002. Ed.: IEEE. [CD-ROM]., 2133–2136, International Society for Computers and Their Applications (ISCA). doi:10.21437/ICSLP.2002-583
A one-pass decoder based on polymorphic linguistic context assignment
Soltau, H.; Metze, F.; Fuegen, C.; Waibel, A.
2002. ASRU 2001 : 2001 IEEE Workshop on Automatic Speech Recognition and Understanding : conference proceedings : 9-13 December, 2001, Madonna di Campiglio, Italy, In: Proceedings of the 2001 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2001, Madonna di Campiglio, Italy 2001. [CD-ROM]., CD-ROM, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ASRU.2001.1034625
Knowing who to listen to in speech recognition: visually guided beamforming
Bub, U.; Hunke, M.; Waibel, A.
2002. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Michigan, USA, May 9-12, 1995, In: Conference proceedings. The 1995 International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Mich. Vol. 1. Piscataway, NJ 1995. S. 848-851., IEEEXplore. doi:10.1109/ICASSP.1995.479827
Dictionary learning for spontaneous speech recognition
Sloboda, T.; Waibel, A.
2002. Proceedings of the International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA, October 3-6,1996. Vol. 4, 2328–2331, Appl. Science and Eng. Lab. doi:10.1109/ICSLP.1996.607274
Class phrase models for language modeling
Ries, K.; Buoe, F. D.; Waibel, A.
2002. Proceedings of the International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. Vol. 1, In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. Vol. 1. S. 398-401., 398–401, IEEEXplore. doi:10.1109/ICSLP.1996.607138
Word clustering with parallel spoken language corpora
Wang, Y.-Y.; Lafferty, J.; Waibel, A.
2002. Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996, In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., IEEEXplore
Dialogue processing in a conversational speech translation system
Lavie, A.; Levin, L.; Qu, Y.; Waibel, A.; Gates, D.; Gavalda, M.; Mayfield, L.; Taboada, M.
2002. Proceedings of 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996, In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., IEEEXplore. doi:10.1109/ICSLP.1996.607177
NPen++: a writer independent, large vocabulary on-line cursive handwriting recognition system
Manke, S.; Finke, M.; Waibel, A.
2002. Proceedings of the 3rd International Conference on Document Analysis and Recognition, ICDAR 1995, Montreal, August 14-16, 1995, In: Proceedings of the International Conference on Document Analysis and Recognition, Montreal, Canada 1995., IEEE Xplore. doi:10.1109/ICDAR.1995.599023
Improved language modeling by unsupervised acquisition of structure
Ries, K.; Buoe, F. D.; Wang, Y.-Y.; Waibel, A.
2002. 1995 International Conference on Acoustics, Speech and Signal Processing; Vol 1: Speech, ICASSP ’95, Detroit, May 9-12, 1995, 193–196, IEEE Service Center
A real-time face tracker
Yang, J.; Waibel, A.
2002. 3rd IEEE Workshop on Applications of Computer Vision, WACV ’96, Sarasota, Fla., 2nd-4th December 1996, In: Proceedings. 3rd IEEE Workshop on Applications of Computer Vision, WACV ’96, Sarasota, Fla. 1996. Los Alamitos, Calif. 1996., IEEEXplore
Multimodal interfaces for multimedia information agents
Waibel, A.; Suhm, B.; Vo, M. T.; Yang, J.
2002. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, München, April 21-24, 1997., In: 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Munich 1997. Los Alamitos, Calif. 1997., 167–170, IEEE Comput. Soc. doi:10.1109/ICASSP.1997.599587
2001
An Automatic Sign Recognition And Translation System
Yang, J.; Gao, J.; Zhang, Y.; Chen, X.; Waibel, A.
2001. Proceedings of the Workshop on Perceptual User Interfaces, PUI 2001, Orlando, USA, 15-16 November 2001, 1–8, Association for Computing Machinery (ACM). doi:10.1145/971478.971490
Estimating focus of attention based on gaze and sound
Stiefelhagen, R.; Yang, J.; Waibel, A.
2001. Proceedings of the Workshop on Perceptive User Interfaces, PUI’01, Orlando, FL, 15th-16th November 2001, In: Proceedings of the Workshop on Perceptive User Interfaces, PUI’01, Orlando, FL 2001. [CD-ROM]., Association for Computing Machinery (ACM). doi:10.1145/971478.97150
Tracking focus of attention for human-robot communication
Stiefelhagen, R.; Yang, J.; Waibel, A.
2001. Humanoids 2001: Proceedings of IEEE-RAS International Conference on Humanoid Robots, November 22 - 24, 2001, Waseda University International Conference Center, In: Proceedings of IEEE-RAS International Conference on Humanoid Robots, Humanoids 2001, Tokyo, Japan 2001. [CD-ROM]., Historical Society of Waseda University
Arabic Speech Recognition. diploma thesis
Alwan, J.-A.; Kroschel, K.; Waibel, A.; Schultz, T.
2001, July. Universität Karlsruhe (TH)
Architecture and Design Considerations in NESPOLE!: A Speech Translation System For E-commerce Applications
Lavie, A.; Langley, C.; Waibel, A.; Pianesi, F.; Lazzari, G.; Coletti, P.; Taddei, L.; Balducci, F.
2001. HLT ’01: Proceedings of the first international conference on Human language technology research, 1–4, Association for Computing Machinery (ACM). doi:10.3115/1072133.1072140
Towards Automatic Sign Translation
Yang, J.; Gao, J.; Zhang, Y.; Waibel, A.
2001. HLT ’01: Proceedings of the first international conference on Human language technology research, 1–6, Association for Computing Machinery (ACM). doi:10.3115/1072133.1072223
Online handwriting recognition: the NPen++ recognizer
Jaeger, S.; Manke, S.; Reichert, J.; Waibel, A.
2001. International Journal on Document Analysis and Recognition, 3 (3), 169–180. doi:10.1007/PL00013559
Multimodal Error Correction for Speech User Interfaces
Suhm, B.; Myers, B.; Waibel, A.
2001. ACM transactions on computer human interaction, 8 (1), 60–98. doi:10.1145/371127.371166
On-line handwriting recognition: the NPen++ recognizer
Jaeger, S.; Manke, S.; Reichert, J.; Waibel, A.
2001. International journal on document analysis and recognition, Int. j. on doc. anal. and recognit. 3 (2000) H. 3 S. 169-181., (3), 169–181
Pen-Based Gesture Recognition. student research project
Seemann, E.
2001. Carnegie Mellon University. doi:10.5445/IR/1000166891
On-line Signature Verification. student research project
Kreckwitz, S.
2001. Universität Karlsruhe (TH). doi:10.5445/IR/1000166889
Recognizing Non-Native Speech: Characterizing and Adapting to Non-Native Usage in LVCSR. PhD dissertation
Mayfield Tomokiyo, L.
2001. Carnegie Mellon University
Detecting Emotions In Speech
Polzin, T. S.; Waibel, A. H.
2001. Cooperative Multimodal Communication, Second International Conference, CMC’98, Tilburg, The Netherlands, January 28-30, 1998
The ISL Evaluation System for Verbmobil-II
Soltau, H.; Schaaf, T.; Metze, F.; Waibel, A.
2001. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2001, Salt Lake City, USA, 07-11 May 2001, 65–68, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.2001.940768
Adaptation Methods For Non-Native Speech
Tomokiyo Mayfield, L.; Waibel, A.
2001. Proceedings of the 7th European Conference on Speech Communication and Technology, 2nd INTERSPEECH Event, EUROSPEECH 2001, Carnegie Mellon University
Domain Portability in Speech-to-Speech Translationng
Schultz, T.; Waibel, A.
2001. Proceedings of the Human Language Technology Meeting, HLT 2001, San Diego, San Diego, 30. März 2001
Experiments on Cross-language Acoustic Modeling
Schultz, T.; Waibel, A.
2001. Proceedings of the 7th European Conference on Speech Communication and Technology, Eurospeech 2001, Aalborg, September 3-7, 2001, 2721, International Society for Computers and Their Applications (ISCA)
The ISL meeting room system
Schultz, T.; Waibel, A.; Bett, M.; Metze, F.; Pan, Y.; Ries, K.; Schaaf, T.; Soltau, H.; Westphal, M.; Yu, H.; Zechner, K.
2001. Proceedings of the Workshop on Hands-Free Speech Communication, HSC 2001, Kyoto, 9th-11th April 2001
Speaker compensation with sine-log all-pass transforms
Mcdonough, J.; Metze, F.; Soltau, H.; Waibel, A.
2001. Proceedings of IEEE Signal Processing Society International Conference on Acoustics, Speech, and Signal Processing 2001, ICASSP 2001, Salt Lake City, UT, 7th-11th May 2001, In: Proceedings of IEEE Signal Processing Society International Conference on Acoustics, Speech, and Signal Processing 2001, ICASSP 2001, Salt Lake City, UT 2001. [CD-ROM]., IEEEXplore. doi:10.1109/ICASSP.2001.940844
Model-combination-based acoustic mapping
Westphal, M.; Waibel, A.
2001. 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221), Salt Lake City, UT, 7th-11th May 2001, In: Proceedings of IEEE Signal Processing Society International Conference on Acoustics, Speech, and Signal Processing 2001, ICASSP 2001, Salt Lake City, UT 2001. [CD-ROM]., IEEEXplore. doi:10.1109/ICASSP.2001.940807
LingWear: a mobile tourist information system
Fuegen, C.; Westphal, M.; Schneider, M.; Schultz, T.; Waibel, A.
2001. Proceedings of the 1st Human Language Technology Conference, HLT 2001, San Diego, USA, 18-21 March 2001, 1–5, Association for Computational Linguistics (ACL). doi:10.3115/1072133.1072200
Advances in meeting recognition
Waibel, A.; Yu, H.; Soltau, H.; Schultz, T.; Schaaf, T.; Pan, Y.; Metze, F.; Bett, M.
2001. Proceedings of the Human Technology Conference, HLT 2001, San Diego, CA, March 18 - 21, 2001, ACL Anthology
Advances in automatic meeting record creation and access
Waibel, A.; Bett, M.; Metze, F.; Ries, K.; Schaaf, T.; Schultz, T.; Soltau, H.; Yu, H.; Zechner, K.
2001. Proceedings of IEEE Signal Processing Society International Conference on Acoustics, Speech, and Signal Processing 2001, ICASSP 2001, Salt Lake City, UT, May 7-11, 2001, International Society for Computers and Their Applications (ISCA)
Activity detection for information access to oral communication
Ries, K.; Waibel, A.
2001. Proceedings of the 1st International Conference on Human Language Technology Research, HLT 2001, San Diego, USA 2001. Ed.: J. Allan, In: Proceedings of the 1st International Conference on Human Language Technology Research, HLT 2001, San Diego, USA 2001. Ed.: J. Allan. San Francisco 2001. [CD-ROM]., Association for Computational Linguistics (ACL)
Robuste kontinuierliche Spracherkennung für mobile Informationssysteme. PhD dissertation
Westphal, M.
2001. Aachen 2001. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 15.6.2000., Universität Karlsruhe (TH)
2000
Beyond HCI: Multimodal Manipulation of Multimedia Information
Yang, Y.; Waibel, A.
2000. Ji suan ji xue bao : yue kan = Chinese journal of computers / Zhong guo ji suan ji xue hui zhu ban. Ji suan ji xue bao bian ji wei yuan hui bian ji, 23 (12), 1245–1252
Multilinguality in speech and spoken language systems
Waibel, A.; Geutner, P.; Mayfield-Tomokiyo, L.; Schultz, T.; Woszczyna, M.
2000. Proceedings of the IEEE, Proc. of the IEEE 88 (2000) Spec. issue "Spoken language processing" S. 1297-1313., 88 (8), 1297–1313
The Janus-III Translation System: Speech-to-Speech Translation in Multiple Domains
Levin, L.; Lavie, A.; Woszczyna, M.; Gates, D.; Gevaldà, M.; Koll, D.; Waibel, A.
2000. Machine Translation, 15 (1-2), 3–25. doi:10.1023/A:1011186420821
Minimizing Word Error Rate In Textual Summaries Of Spoken Language
Zechner, K.; Waibel, A.
2000. NAACL 2000: Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference. Ed.: J. Wiebe, 186–193, Association for Computing Machinery (ACM)
Multimodal Meeting Tracker
Bett, M.; Gross, R.; Yu, H.; Zhu, X.; Pan, Y.; Yang, J.; Waibel, A.
2000. RIAO ’00: Content-Based Multimedia Information Access. Bd.: 1. Ed.: J. Mariani, 32–45, Association for Computing Machinery (ACM)
Growing Semantic Grammars. PhD dissertation
Gavalda, M.
2000. Carnegie Mellon University
Ein auf Flexionsformen basierendes Sprachmodell für deutsche Spracherkennung. student research project
Klein, M.
2000. Universität Karlsruhe (TH)
Learning Models of Speaker Variation. PhD dissertation
Witbrock, M. J.
2000. Carnegie Mellon University
Segmenting Hands Of Arbitrary Color
Zhu, X.; Yang, J.; Waibel, A.
2000. Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), Grenoble, France, 28-30 March 2000, 446–453, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/AFGR.2000.840673
Face Recognition In A Meeting Room
Gross, R.; Yang, J.; Waibel, A.
2000. Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580), Grenoble, France, 28-30 March 2000, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/AFGR.2000.840649
Shallow Discourse Genre Annotation in CallHome Spanish
Ries, K.; Levin, L.; Valle, L.; Lavie, A.; Waibel, A.
2000. Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00). Ed.: M. Gavrilidou, European Language Resources Association (ELRA)
Towards A Multimodal Meeting Record
Gross, R.; Bett, M.; Yu, H.; Zhu, X.; Pan, Y.; Yang, J.; Waibel, A.
2000. Proceedings of the International Conference on Multimedia and Expo, ICME 2000, New York City, USA, 30 July - 02 August 2000, Volume 3, 1593–1596, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICME.2000.871074
DIASUMM: Flexible Summarization of Spontaneous Dialogues in Unrestricted Domains
Zechner, K.; Waibel, A.
2000. COLING ’00: Proceedings of the 18th conference on Computational linguistics - Volume 2. Ed.: M. Kay, 968–974, Association for Computational Linguistics (ACL). doi:10.3115/992730.992786
Growing Gaussian Mixture Models for Pose Invariant Face Recognition
Gross, R.; Yang, J.; Waibel, A.
2000. Proceedings of the 15th International Conference on Pattern Recognition, ICPR 2000, Barcelona, Spain, 03-07 September 2000, Volume 1, 1088–1091, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICPR.2000.905661
NPEN++: An On-line Handwriting Recognition System
Jaeger, S.; Manke, S.; Waibel, A.
2000. Proceedings / 7th International Workshop on Frontiers in Handwriting Recognition : September 11 - 13, 2000, Amsterdam, The Netherlands. Ed.: L. R. B. Schomaker, 249–260
The Effects Of Room Acoustics On MFCC Speech Parameter
Pan, Y.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Volume 4. Ed.: D. Guan, 129–132, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-768
Streamlining the Front End of a Speech Recognizer
Yu, H.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Volume 1. Ed.: D. Guan, 353–356, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-88
New Developments In Automatic Meeting Transcription
Yu, H.; Tomokiyo, T.; Wang, Z.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Volume 4. Ed.: D. Guan, 310–313, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-813
Dialogue Management For Multimodal User Registration
Huang, F.; Yang, J.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Volume 3. Ed.: D. Guan, 37–40, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-472
Application of LDA to Speaker Recognition
Jin, Q.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Bd.: 2. Ed.: D. Guan, 250–253, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-256
A Naive De-Lambing Method For Speaker Identification
Yang, J.; Waibel, A.
2000. Proceedings of the International Conference on Spoken Language Processing, ICSLP 2000, Volume 2. Ed.: D. Guan, 466–469, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-308
Phone Dependent Modeling of Hyperarticulated Effects
Soltau, H.; Waibel, A.
2000. Proceedings of the 6th International Conference on Spoken Language Processing, ICSLP 2000, Volume 4. Ed.: D. Guan, 105–108, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.2000-762
Time-Delay Neural Networks and NN/HMM Hybrids: A Family of Connectionist Continuous-Speech Recognition Systems
Fritsch, J.; Hild, H.; Meier, U.; Waibel, A.
2000. Handbook of neural networks for speech processing. Ed.: S. Katagiri, artechhouse
Emotion-sensitive human-computer interfaces
Polzin, T. S.; Waibel, A.
2000. Proceedings of the ISCA Workshop on Speech and Emotion. A Conceptual Framework for Research. Ed.: R. Cowie, 201–206, International Speech Communication Association (ISCA)
Hierarchical connectionist acoustic modeling for domain-adaptive large vocabulary speech recognition. PhD dissertation
Fritsch, J.
2000. Aachen 2000. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 25.10.1999., Universität Karlsruhe (TH)
Ein automatisches Indexierungssystem für Fernsehnachrichtensendungen. PhD dissertation
Kemp, T.
2000. Aachen 2000. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 13.12.1999., Universität Karlsruhe (TH)
Adaptive vocabularies in large vocabulary conversational speech recognition. PhD dissertation
Geutner, P.
2000. Aachen 2000. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 12.2.1999., Universität Karlsruhe (TH)
Towards unrestricted lipreading
Meier, U.; Stiefelhagen, R.; Yang, J.; Waibel, A.
2000. International journal of pattern recognition and artificial intelligence, Int. j. of pattern recognit. and artif. intell. 14 (2000) S. 571-585., 14 (5), 571–585. doi:10.1142/S0218001400000374
Multilingual speech recognition
Waibel, A.; Soltau, H.; Schultz, T.; Schaaf, T.; Metze, F.
2000. Verbmobil: foundations of speech-to-speech translation. Ed.: W. Wahlster, 33–45, Springer-Verlag
Language portability in acoustic modeling
Schultz, T.; Waibel, A.
2000. Proceedings of the Workshop on Multilingual Speech Communication, MSC 2000, Kyoto, Japan, 11th-13th October 2000, 59–64
Acoustic models for hyperarticulated speech
Soltau, H.; Waibel, A.
2000. Proceedings of the International Conference of Spoken Language Processing, ICSLP 2000, Beijing, October 16-20, 2000, In: Proceedings of the International Conference of Spoken Language Processing, ICSLP 2000, Beijing, China 2000. IEEE 2000. [CD-ROM]., International Society for Computers and Their Applications (ISCA)
1999
Modeling focus of attention for meeting indexing
Stiefelhagen, R.; Yang, J.; Waibel, A.
1999. Proceedings of the 7th {ACM} International Conference on Multimedia ’99, Orlando, FL, USA, October 30 - November 5, 1999, Part 1, In: Proceedings of ACM Multimedia ’99, Orlando, Fla. 1999. S. 3-10., 3–10, Association for Computing Machinery (ACM). doi:10.1145/319463.319464
Integrating Knowledge Sources For The Specification Of A Task-Orientated Dialogue System
Denecke, M.; Waibel, A.
1999. Proceedings of the 16th International Joint Conference on Artificial Intelligence, IJCAI 1999, 33–40
Model-based and empirical evaluation of multimodal interactive error correction
Suhm, B.; Myers, B.; Waibel, A.
1999. CHI 99: The CHI is the limit, human factors in computing systems ; Chi 99 Conference Proceedings. Ed.: M.G. Williams. New York 1999, In: CHI 99. The CHI is the limit, human factors in computing systems. Ed.: M.G. Williams. New York 1999., 584–591, Wesley Press. doi:10.1145/302979
Stochastically-based semantic analysis for machine translation
Minker, W.; Gavaldà, M.; Waibel, A.
1999. Computer speech and language, 13 (2), 177–194. doi:10.1006/csla.1999.0119
Modeling people’s focus of attention
Stiefelhagen, R.; Jie Yang; Waibel, A.
1999. Proceedings IEEE International Workshop on Modelling People. MPeople’99, Corfu, 20th September 1999, 79–86, IEEE Comput. Soc. doi:10.1109/PEOPLE.1999.798349
Automatische Segmentierung von Nachrichtensendungen. student research project
Schmidt, M.
1999. Universität Karlsruhe (TH)
Die hocharabische Sprache und Romanisierung ihrer Schrift. student research project
Karboul, O.
1999
Problematik und Techniken bei der Erstellung deutscher Grammatiken. bachelor’s thesis
Müller, D. C.
1999. Universität Karlsruhe (TH)
Evaluation eines alternativen information retrieval Ansatzes. student research project
Weber, M.
1999. Universität Karlsruhe (TH)
Face Translation : An Image-Based Approach to a Multi-Modal Communication Agent. diploma thesis
Ritter, M.
1999. Universität Karlsruhe (TH)
Grundfrequenzverfolgung und deren Anwendung in der Spracherkennung. diploma thesis
Schubert, K.
1999. Universität Karlsruhe (TH)
Integration von situationsabhängigen Modalitäten in kontextbasierte Entscheidungsbäume. diploma thesis
Fügen, C.
1999. Universität Karlsruhe (TH)
Detecting Verbal and Nonverbal Cues in the Communication of Emotions. PhD dissertation
Polzin, T. S.
1999. Carnegie Mellon University
Selection Criteria For Hypothesis Driven Lexical Adaptation
Geutner, P.; Finke, M.; Waibel, A.
1999. 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings, Volume 2. ICASSP99, Phoenix, USA, 15-19 March 1999, 617–620, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1999.759742
Hidden Understanding Models for Machine Translation
Minker, W.; Gavalda, M.; Waibel, A.
1999. Proceedings of the ESCA Tutorial and Research Workshop (ETRW) on Interactive Dialogue in Multi-Modal Systems, Kloster Irsee, Germany, June 22-25, 1999
Face Translation: A Multimodal Translation Agent
Ritter, M.; Meier, U.; Yang, J.; Waibel, A.
1999. Proceedings of the 4th annual Auditory-Visual Speech Processing Conference, AVSP 1999, Santa Cruz, USA, 07-10 August 1999. Ed.: D. Massaro, International Speech Communication Association (ISCA)
Modeling and efficient decoding of large vocabulary conversational speech
Finke, M.; Fritsch, J.; Koll, D.; Waibel, A.
1999. Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, Hungary, 05-09 September 1999. Ed.: G. Gordos, 467–470, International Speech Communication Association (ISCA). doi:10.21437/Eurospeech.1999-120
Multimodal People ID For A Multimedia Meeting Browser
Yang, J.; Zhu, X.; Gross, R.; Kominek, J.; Pan, Y.; Waibel, A.
1999. Proceedings of the 7th ACM International Conference on Multimedia, MULTIMEDIA 1999, Part 1. Ed.: J. Buford, S. Stevens, D. Bulterman, K. Jeffay, H. J. Zhang, 159–168, Association for Computing Machinery (ACM). doi:10.1145/319463.319484
Stochastically-based Semantic Analysis
Minker, W.; Waibel, A.; Mariani, J.
1999. Kluwer Academic Publishers
Selection criteria for hypothesis driven lexical adaption
Geutner, P.; Finke, M.; Waibel, A.
1999. In: Proceedings of the IEEE 1999 International Conference on Acoustics, Speech and Signal Processing, ICASSP, Phoenix, Ariz. 1999
Integrating knowledge sources for the specification of a task-oriented dialogue system
Denecke, M.; Waibel, A.
1999. IJCAI’99: Proceedings of the 16th international joint conference on Artificial intelligence, Stockholm, Sweden, 31st July - 6th August 1999, Vol. 1, In: IJCAI-99. Proceedings of the 16th International Joint Conference on Artificial Intelligence, Stockholm, Sweden 1999. San Francisco, Calif. 1999., Morgan Kaufmann Publishers
Progress in automatic meeting transcription
Yu, H.; Finke, M.; Waibel, A.
1999. Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, 5th-9th September 1999, In: Proceedings of the 6th European Conference on Speech, Communication and Technology, EUROSPEECH, Budapest, Hungary 1999. ESCA 1999. S. 695-698., 695–698, International Society for Computers and Their Applications (ISCA). doi:10.21437/Eurospeech.1999-178x
Modeling peoples focus of attention
Stiefelhagen, R.; Yang, J.; Waibel, A.
1999. In: IEEE International Workshop on Modeling People, Corfu, Greece 1999. S. 79-86
Unsupervised training of a speech recognizer: recent experiments
Kemp, T.; Waibel, A.
1999. Proceedings of the 6th European Conference on Speech, Communication and Technology, EUROSPEECH, Budapest, Hungary 1999. Vol. 6. ESCA 1999, In: Proceedings of the 6th European Conference on Speech, Communication and Technology, EUROSPEECH, Budapest, Hungary 1999. Vol. 6. ESCA 1999. S. 2725-2728., 2725–2728, International Society for Computers and Their Applications (ISCA)
Towards unrestricted lip-reading
Meier, U.; Stiefelhagen, R.; Yang, J.; Waibel, A.
1999. In: Proceedings of the 2nd International Conference on Multimodal Interfaces, ICMI99, Hong Kong, China 1999. Ed.: T. Yang. S. 25-30
Towards spontaneous speech recognition for on-board car navigation and information systems
Westphal, M.; Waibel, A.
1999. Proceedings of the 6th European Conference on Speech Communication and Technology, EUROSPEECH 1999, Budapest, 5th-9th September 1999, In: Proceedings of the 6th European Conference on Speech, Communication and Technology, EUROSPEECH, Budapest, Hungary 1999. ESCA 1999. S. 1955-1958., International Society for Computers and Their Applications (ISCA)
Mandarin large vocabulary speech recognition using the GlobalPhone database
Reichert, J.; Schultz, T.; Waibel, A.
1999. Proceedings of the 6th European Conference on Speech, Communication and Technology, EUROSPEECH, Budapest, Hungary 1999. Vol. 2. ESCA 1999, 815–818, International Society for Computers and Their Applications (ISCA)
Language adaptive LVCSR through polyphone decision tree specialization
Schultz, T.; Waibel, A.
1999. Proceedings of the Workshop on Multi-lingual Interoperability in Speech Technology, MIST 1999, Leusden, The Netherlands, 30. September 1999, 85–90
From gaze to focus of attention
Stiefelhagen, R.; Finke, M.; Yang, J.; Waibel, A.
1999. In: Visual information and information systems. VISUAL ’99. Ed.: D.P. Huijsmans. Berlin 1999. S. 761-768. (Lecture notes in computer science. 1614.)
Experiments towards a multi-language LVCSR interface
Schultz, T.; Waibel, A.
1999. Proceedings of the 2nd International Conference on Multimodal Interfaces, ICMI99, Hong Kong, China 1999. Ed.: T. Yang
Data-driven determination of appropriate dictionary units for Korean LVCSR
Kiecza, D.; Schultz, T.; Waibel, A.
1999. Proceedings of the International Conference on Speech Processing, ICSP’99, Seoul, Korea18th-20th August 1999, 323–327, The Acoustical Society of Korea
From gaze to focus of attention
Stiefelhagen, R.; Finke, M.; Yang, J.; Waibel, A.
1999. Proceedings of the 3rd International Conference On Visual Information Systems, VISUAL 1999, Amsterdam, June 2-4 1999, In: Proceedings of the Workshop on Perceptual User Interfaces, PUI98, San Francisco 1998., 761–768, Springer
Search in a learnable spoken language parser
Buoe, F. D.; Waibel, A.
1999. ECAI 96: 12th European Conference on Artificial Intelligence. Proceedings: August 11-16, 1996, Budapest, Hungary: Ed.: W. Wahlster, In: Proceedings of the ECAI 96, Budapest 1996., John Wiley and Sons
1998
Schnelle adaptive Sprechernormierung für die Spracherkennung. student research project
Schubert, K.
1998, September 30. Universität Karlsruhe (TH)
Entwicklung eines türkischen Spracherkennungssystems für große Vokabulare. diploma thesis
Çarki, K.
1998, September 30. Universität Karlsruhe (TH)
Growing semantic grammars
Gavalda, M.; Waibel, A.
1998. ACL ’98/COLING ’98: Proceedings of the 36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics - Volume 1, In: Proceedings of the COLING/ACL98, Montreal, Quebec, Canada 1998., Association for Computational Linguistics (ACL)
Geräuschreduktion für die Spracherkennung im Auto. student research project
Ritter, M.
1998, January 7. Universität Karlsruhe (TH)
Model-Based Recovery of Dynamic Information from Static Handwriting. diploma thesis
Groß, R.
1998. Universität Karlsruhe (TH)
Spracherkennung im Chinesischen. diploma thesis
Reichert, J.
1998. Universität Karlsruhe (TH)
Grammar Inference And Statistical Machine Translation. PhD dissertation
Wang, Y.-Y.
1998. Carnegie Mellon University
A Framework And Toolkit For The Construction Of Multimodal Learning Interfaces. PhD dissertation
Vo, M. T.
1998. Carnegie Mellon University
Meeting Browser: Tracking And Summarizing Meetings
Waibel, A.; Bett, M.; Finke, M.; Stiefelhagen, R.
1998. Proceedings of the Broadcast News Transcription and Understanding Workshop, February 8-11, 1998, Lansdowne Conference Resort, Lansdowne, Virginia, Morgan Kaufmann Publishers
Clarity: Inferring Discourse Structure from Speech
Finke, M.; Lapata, M.; Lavie, A.; Levin, L.; Tomokiyo, L. M.; Polzin, T.; Ries, K.; Waibel, A.; Zechner, K.
1998. Applying Machine Learning to Discourse Processing : AAAI’98 Spring Symposium Series, March 23-25, 1998, Stanford University, California, 25–32, Association for the Advancement of Artificial Intelligence (AAAI)
Interactive Error Repair for an Online Handwriting Interface
Huerst, W.; Yang, J.; Waibel, A.
1998. CHI ’98: CHI 98 Conference Summary on Human Factors in Computing Systems. Ed.: C.-M Karat, A. Lund, 353–354, Association for Computing Machinery (ACM). doi:10.1145/286498.286818
Error Repair in Human Handwriting - An Intelligent User Interface for Automatic On-Line Handwriting Recognition
Huerst, W.; Yang, J.; Waibel, A.
1998. Proceedings of the IEEE International Joint Symposia on Intelligence and Systems, IJSIS 1998, Rockville, USA, 23 May 1998, 389–395, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IJSIS.1998.685482
A Modular Approach To Spoken Language Translation For Large Domains
Woszczyna, M.; Broadhead, M.; Gates, D.; Gavalda, M.; Lavie, A.; Levin, L.; Waibel, A.
1998. Machine Translation and the Information Soup: Third Conference of the Association for Machine Translation in the Americas, AMTA’98, Langhorne, PA, USA, October 28–31, 1998 Proceedings. Ed.: D. Farwell, L. Gerber, E. Hovy, 31–40, Springer. doi:10.1007/3-540-49478-2_3
Linear Discriminant - A New Criterion For Speaker Normalization
Westphal, M.; Schultz, T.; Waibel, A.
1998. Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998. Ed.: B. Millar, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.1998-433
An Interlingua Based on Domain Actions for Machine Translation of Task-Oriented Dialogues
Levin, L.; Gates, D.; Lavie, A.; Waibel, A.
1998. Proceedings of the 5th International Conference on Spoken Language Processing, ICSLP 1998. Ed.: B. Millar, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.1998-572
Intelligent Animated Agents for Interactive Language Training
Cole, R.; Carmell, T.; Connors, P.; Macon, M.; Wouters, J.; Villiers, J. De; Tarachow, A.; Massaro, D.; Cohen, M.; Beskow, J.; Yang, J.; Meier, U.; Waibel, A.; Stone, P.; Fortier, G.; Davis, A.; Soland, C.
1998. ACM SIGCAPH computers and the physically handicapped, 61, 5–10. doi:10.1145/288076.288077
Development of Multilingual Acoustic Models in the GlobalPhone Project
Schultz, T.; Waibel, A.
1998. Proceedings of the 1st Workshop on Text, Speech, and Dialogue, TSD 1998, Brno, Czech Republic, 30. September 1998, 311–316
Das Projekt GlobalPhone: Multilinguale Spracherkennung
Schultz, T.; Waibel, A.
1998. Computers, Linguistics, and Phonetics between Language and Speech. Proceedings of the 4th Conference on NLP, Konvens 1998, Bonn, Germany, 30. Oktober 1998, 179–189
Adaptation of Pronunciation Dictionaries for Recognition of Unseen Languages
Schultz, T.; Waibel, A.
1998. Workshop on Speech and Communication, SPECOM 1998, St. Petersburg, Russia, 30. Oktober 1998, 207–210
On-line Erkennung kursiver Handschrift bei großen Vokabularen. PhD dissertation
Manke, S.
1998. Aachen 1998. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 13.2.1998., Universität Karlsruhe (TH)
Speech understanding for spoken language systems. Portability across domains and languages. PhD dissertation
Minker, W.
1998. Egelsbach 1998. (Deutsche Hochschulschriften. 2569.) Fak. f. Informatik, Diss. v. 19.12.1997., Universität Karlsruhe (TH)
Meeting browser: tracking and summarising meetings
Waibel, A.; Bett, M.; Finke, M.
1998. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, Va. 1998
Experiments in automatic meeting transcription using JRTK
Yu, H.; Clark, C.; Malkin, R.; Waibel, A.
1998. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Seattle, WA. Piscataway, NJ 1998
Detection emotions in speech
Polzin, T.; Waibel, A.
1998. In: Proceedings of the CMC 1998
Pronunciation variations in emotional speech
Polzin, T.; Waibel, A.
1998. In: ESCA-98, Tutorial and Research Workshop on Modeling Pronunciation Variation for Automatic Speech Recognition, Kerkrade, The Netherlands 1998
Modeling with structures in statistical machine translation
Wang, Y.-Y.; Waibel, A.
1998. In: Proceedings of the COLING/ACL98, Montreal, Quebec, Canada 1998
Using chunk based partial parsing of spontaneous speech in unrestricted domains for reducing word error rate in speech recognition
Zechner, K.; Waibel, A.
1998. In: Proceedings of the COLING/ACL98, Montreal, Quebec, Canada 1998
Language independent and language adaptive large vocabulary speech recognition
Schultz, T.; Waibel, A.
1998. Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia, 30th November - 4th December 1998. Vol. 5, 1819–1822, Causal Production
Unsupervised training of a speech recognizer using TV broadcasts
Kemp, T.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Reducing the OOV rate in broadcast news speech recognition
Kemp, T.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
The Interactive Systems Labs view4you video indexing system
Kemp, T.; Geutner, P.; Schmidt, M.; Tomaz, B.; Weber, M.; Westphal, M.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
On the influence of hyperarticulated speech on recognition performance
Soltau, H.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Conversational speech systems for on-board car navigation and assistance
Geutner, P.; Denecke, M.; Meier, U.; Westphal, M.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Phonetic-distance-based hypothesis driven lexical adaptation for transcribing multilingual broadcast news
Geutner, P.; Finke, M.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Effective structural adaptation of LVCSR systems to unseen domains using hierarchical connectionist acoustic models
Fritsch, J.; Finke, M.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Fast decoding for statistical machine translation
Wang, Y.-Y.; Waibel, A.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Probabilistic dialogue act extraction for concept based multilingual translation systems
Fukada, T.; Koll, D.; Waibel, A.; Tanigaki, K.
1998. In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia 1998
Linear discriminant - a new method for speaker normalization
Westphal, M.; Schultz, T.; Waibel, A.
1998. Proceedings of the International Conference on Spoken Language Processing, ICSLP 98, Sydney, Australia, 30th November - 4th December 1998. Vol. 5, Causal Production
An intelligent user interface for automatic on-line handwriting recognition
Huerst, W.; Yang, J.; Waibel, A.
1998. In: Proceedings of the 1998 IEEE Symposia on Intelligence and Systems, Washington, DC 1998
Hierarchies of neural networks for connectionist speech recognition
Fritsch, J.; Waibel, A.
1998. In: Proceedings of European Symposium on Artificial Neural Networks, ESANN 98, Brugges, Belgium 1998
Real-time face and facial feature tracking and applications
Yang, J.; Stiefelhagen, R.; Meier, U.; Waibel, A.
1998. In: Proceedings of Auditory-Visual Speech Processing, AVSP 98, Terrigal, Australia 1998
A visual timing tool for language training of hearing impaired children
Meier, U.; Yang, J.; Waibel, A.
1998. In: Proceedings of the Workshop on Perceptual User Interfaces, PUI98, San Francisco 1998
Interactive repair for an online handwriting interface
Huerst, W.; Yang, J.; Waibel, A.
1998. In: Proceedings of CHI 98, Conference on Human Factors in Computing Systems, Los Angeles, Calif. 1998. S. 353-354
Visual tracking for multimodal human computer interaction
Yang, J.; Stiefelhagen, R.; Meier, U.; Waibel, A.
1998. In: Proceedings of CHI 98, Conference on Human Factors in Computing Systems, Los Angeles, Calif. 1998. S. 140-147
Towards tracking interaction between people
Stiefelhagen, R.; Yang, J.; Waibel, A.
1998. Proceedings of the Intelligent Environments AAAI Spring Symposium, Stanford Univ., Calif. 1998, 123–127
Multilingual and crosslingual speech recognition
Schultz, T.; Waibel, A.
1998. Proceedings of the DARPA Broadcast News TranscrProceedings of the DARPA Broadcast News Transcription and Understanding, DARPA 1998, Lansdowne Virginia, 20. Februar 1998, 259–262
Recognition of music types
Soltau, H.; Schultz, T.; Westphal, M.; Waibel, A.
1998. Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Seattle, WA. Piscataway, NJ 30. Mai 1998
Serbo-Croatian LVCSR on the dictation and broadcast news domain
Scheytt, P.; Geutner, P.; Waibel, A.
1998. In: Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Seattle, WA. Piscataway, NJ 1998
Transcribing multilingual broadcast news using hypothesis driven lexical adaptation
Geutner, P.; Finke, M.; Scheytt, P.; Waibel, A.; Wactlar, H.
1998. In: Proceedings of the DARPA Broadcast News Transcription and Understanding Workshop, Lansdowne, Va. 1998
1997
Implementierung von Echokompensation für ein automatisches Telefonauskunftssystem. bachelor’s thesis
Carki, K.
1997, September 10. Universität Karlsruhe (TH)
Experimente zur automatischen Schätzung von Sprechgeschwindigkeit für Spontansprache. student research project
Philips, P.
1997, July 3. Universität Karlsruhe (TH)
Optimierte Wörterbucherstellung, Spracherkennung in Automobilen. student research project
Kiezca, O.
1997, June 23. Universität Karlsruhe (TH)
Run-On Recognition In An Online Handwriting Recognition System. diploma thesis
Groß, R.
1997, June. Universität Karlsruhe (TH)
Klassifikation von Musikstilen. diploma thesis
Soltau, H.
1997, May 28. Universität Karlsruhe (TH)
Lookahead mit Neuronalen Netzen in JANUS. student research project
Paschen, K.
1997, March 28. Universität Karlsruhe (TH)
Repair in On-Line Handwriting Recognition. diploma thesis
Hürst, W.; Waibel, A.; Yang, J.
1997, March. Carnegie Mellon University
Spracherkennung von gesprochenen und buchstabierten Eigennamen. diploma thesis
Meyer, M.
1997, March. Universität Karlsruhe (TH)
Maschinelle Erkennung von handgeschriebenen, mathematischen Ausdrücken. diploma thesis
Schulz-Heyn, A.
1997. Universität Karlsruhe (TH)
Verbmobil: The Combination of Deep and Shallow Processing for Spontaneous Speech Translation
Bub, T.; Wahlster, W.; Waibel, A.
1997. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, Munich, Germany, 21-24 April 1997, Volume 1, 71–74, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1997.607199
The JanusRTk Switchboard/CallHome Evaluation System
Finke, M.; Fritsch, J.; Geutner, P.; Ries, K.; Zeppenfeld, T.; Waibel, A.
1997. Proceedings of the LVCSR Hub 5-E Workshop
Dialogue Strategies Guiding Users To Their Communicative Goals
Denecke, M.; Waibel, A.
1997. Proceedings of the 5th European Conference on Speech Communication and Technology, EUROSPEECH 1997, Rhodes, Greece, September 22-25, 1997. Ed.: G. Kokkinakis, 1339–1342, International Society for Computers and Their Applications (ISCA). doi:10.21437/eurospeech.1997-375
Janus : A System for Translation of Conversational Speech
Waibel, A.; Lavie, A.; Levin, L.
1997. Künstliche Intelligenz, 97 (4)
Buchstabiererkennung mit neuronalen Netzen in Auskunftssystemen. PhD dissertation
Hild, H.
1997. Aachen 1997. (Berichte aus der Informatik.) Fak. f. Informatik, Diss. v. 28.5.1997., Universität Karlsruhe (TH)
The GlobalPhone Project: Multilingual LVCSR with JANUS-3
Schultz, T.; Westphal, M.; Waibel, A.
1997. Multilingual information retrieval dialogs
Fast bootstrapping of LVCSR systems with multilingual phoneme sets
Schultz, T.; Waibel, A.
1997. In: Proceedings of the Eurospeech’97, Rhodos, Greece 1997. Vol. 1. S. 371-373
Japanese LVCSR on the spontaneous scheduling task with JANUS-3
Schultz, T.; Koll, D.; Waibel, A.
1997. In: Proceedings of the Eurospeech’97, Rhodos, Greece 1997. Vol. 1. S. 367-370
Janus II: towards spontaneous Spanish speech recognition
Zhan, P.; Ries, K.; Gavalda, M.; Gates, D.; Lavie, A.; Waibel, A.
1997. Proceedings 4th International Conference on Spoken Language Processing (ICSLP 1996), In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., 2285–2288, International Society for Computers and Their Applications (ISCA). doi:10.21437/ICSLP.1996-578
Interactive recovery from speech recognition errors in speech user interfaces
Suhm, B.; Myers, B.; Waibel, A.
1997. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA, 3rd-6th October 1996, In: Proceedings. 4th International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. New York : IEEE 1996., IEEEXplore. doi:10.1109/ICSLP.1996.607738
Multi-lingual translation of spontaneously spoken language in a limited domain
Lavie, A.; Gates, D.; Gavalda, M.; Mayfield, L.; Waibel, A.; Levin, L.
1997. Proceedings of the 16th International Conference on Computational Linguistics, COLING 1996, Copenhagen, 5th-9th August 1996, In: Proceedings. COLING 96, Copenhagen, Denmark 1996. Copenhagen 1996., 442–447, Association for Computational Linguistics (ACL)
Recognition of conversational telephone speech using the Janus speech engine
Zeppenfeld, T.; Finke, M.; Ries, K.; Westphal, M.; Waibel, A.
1997. In: 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Munich 1997. Los Alamitos, Calif. 1997
Janus III: speech-to-speech translation in multiple languages
Lavie, A.; Waibel, A.; Levin, L.; Finke, M.; Gates, D.; Gavalda, M.; Zeppenfeld, T.; Zhan, P.
1997. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing (ICASSP) Munich, Germany, 21-24 April 1997, Vol.: 1, 99–102, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1997.599557
Decoding algorithm in statistical machine translation
Wang, Y.-Y.; Waibel, A.
1997. In: Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics, 1997
Expanding the domain of a multi-lingual speech-to-speech translation system
Lavie, A.; Levin, L.; Zhan, P.; Taboada, M.; Gates, D.; Lapata, M.; Clark, C.; Broadhead, M.; Waibel, A.
1997. In: Proceedings of the Workshop on Spoken Language Translation, 1997
Statistical analysis of dialogue structure
Wang, Y.-Y.; Waibel, A.
1997. In: Proceedings of Eurospeech 97, 5th European Conference on Speech Communication and Technology, Rhodos, Greece 1997
Speaker normalization and speaker adaptation - a combination for conversational speech recognition
Zhan, P.; Westphal, M.; Finke, M.; Waibel, A.
1997. In: Proceedings of Eurospeech 97, 5th European Conference on Speech Communication and Technology, Rhodos, Greece 1997
Speaking mode dependent pronunciation modeling in large vocabulary conversational speech recognition
Finke, M.; Waibel, A.
1997. In: Proceedings of Eurospeech 97, 5th European Conference on Speech Communication and Technology, Rhodos, Greece 1997
Dialogues strategies guiding users to their communicative goals
Denecke, M.; Waibel, A.
1997. In: Proceedings of Eurospeech 97, 5th European Conference on Speech Communication and Technology, Rhodos, Greece 1997
Exploiting repair context in interactive error recovery
Suhm, B.; Waibel, A.
1997. In: Proceedings of Eurospeech 97, 5th European Conference on Speech Communication and Technology, Rhodos, Greece 1997
Flexible transcription alignment
Finke, M.; Waibel, A.
1997. In: Proceedings of 1997 IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, Calif. 1997
Focus of attention: towards low bitrate video tele-conference
Yang, J.; Wu, L.; Waibel, A.
1997. In: Proceedings. International Conference on Image Processing, Lausanne, Switzerland 1996. Piscataway, NJ : IEEE 1996
Java front-end for web-based multimodal human-computer interaction
Jing, X.; Yang, J.; Vo, M. T.; Waibel, A.
1997. In: Proceedings of PUI’97, Banff, Alberta, Canada 1997
Skin-color modeling and adaptation
Yang, J.; Lu, W.; Waibel, A.
1997. In: Computer Vision - ACCV’98. Ed.: R. Chin. Vol. 2. Berlin 1997. S. 687-694. (Lecture notes in computer science. 1352.)
Context-dependent hybrid HME/HMM speech recognition using polyphone clustering decision trees
Fritsch, J.; Finke, M.; Waibel, A.
1997. Proceedings of the International Conference on Acoustics, Speech, and Signal Processing, ICASSP 1997, München, 21st-24th April 1997, In: 1997 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Munich 1997. Los Alamitos, Calif. 1997., Institute of Electrical and Electronics Engineers (IEEE)
Tracking eyes and monitoring eye gaze
Stiefelhagen, R.; Yang, J.; Waibel, A.
1997. In: Workshop on Perceptual User Interfaces, Banff, Canada
A model-based gaze-tracking system
Stiefelhagen, R.; Yang, J.; Waibel, A.
1997. International Journal of artificial intelligent tools, Int. j. of artif. intell. tools 6 (1997) H. 2 S. 193-209., 6 (2), 193–209
1996
Multimodal interfaces
Waibel, A.; Vo, M. T.; Duchnowski, P.; Manke, S.
1996. Artifivicial Intelligence Review. Ed.: D. Liu, Artif. intell. rev. 10 (1995) Spec. vol. on Integration of natural language and vision processing., 299–319, Springer Nature. doi:10.1007/BF00127684
Recovering From Parser Failures: A Hybrid Statistical/Symbolic Approach
Rosé, C. P.; Waibel, A.
1996. The balancing act : combining symbolic and statistical approaches to language. Ed.: J. Klavans, 104–111, Massachusetts Institute of Technology Press (MIT Press). doi:10.7551/mitpress/1507.003.0010
Interactive translation of conversational speech
Waibel, A.
1996. Computer, Computer, IEEE Comput. Soc. 29 (1996) H. 7 S. 41-48., 29 (7), 41–48. doi:10.1109/2.511967
GLR*: A Robust Grammar-Focused Parser For Spontaneously Spoken Language. PhD dissertation
Lavie, A.
1996, May. Carnegie Mellon University
Gaze tracking for multimodal human-computer interaction. diploma thesis
Stiefelhagen, R.
1996. Universität Karlsruhe (TH). doi:10.5445/IR/1000183881
Modular Neural Networks for Speech Recognition. diploma thesis
Fritsch, J.
1996. Carnegie Mellon University
Kanalkompensation in der Spracherkennung. diploma thesis
Baumgärtner, R.
1996. Universität Karlsruhe (TH)
Klassifizierung und Erkennung von Sprachsegmenten. diploma thesis
Buckow, J.-C.
1996. Universität Karlsruhe (TH)
Vertrauensmaße für die maschinelle Spracherkennung. diploma thesis
Schaaf, T.
1996. Universität Karlsruhe (TH)
Designing Interactive Error Recovery Methods for Speech Interfaces
Suhm, B.; Myers, B.; Waibel, A.
1996. Proceedings of the ACM Conference on Human Factors in Computing Systems (CHI 1996)
JANUS-II - Advances in Spontaneous Speech Translation
Wosczyna, M.; Finke, M.; Kemp, T.; McNair, A.; Lavie, A.; Mayfield, L.; Maier, M.; Rogina, I.; Sloboda, T.; Waibel, A.; Zhan, P.; Zeppenfeld, T.
1996. Proceedings of the IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996, Institute of Electrical and Electronics Engineers (IEEE)
Focus Of Attention: Towards Low Bitrate Video Tele-Conferencing
Yang, J.; Wu, L.; Waibel, A.
1996. Proceedings of the 3rd IEEE International Conference on Image Processing, ICIP 1996, Lausanne, Switzerland, 19-19 September 1996, 97–100, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICIP.1996.560611
System Description JANUS. Multi-lingual Translation of Spontaneous Speech in a Limited Domain
Lavie, A.; Levin, L.; Waibel, A.; Gates, D.; Gavalda, M.; Mayfield, L.
1996. Proceedings of the 2nd Conference of the Association for Machine Translation in the Americas, AMTA 1996, 252–255, Association for Computational Linguistics (ACL)
JANUS: a Multi-lingual Speech-to-speech Translation System for Spontaneously Spoken Language in a Limited Domain
Lavie, A.; Levin, L.; Waibel, A.; Gates, D.; Gavalda, M.; Mayfield, L.
1996. Proceedings of the 2nd Conference of the Association for Machine Translation in the Americas, AMTA 1996
Estimation of Verb Subcategorization Frame Frequencies based on Syntactic and Multidimensional Statistical Analysis
Ushioda, A.; Evans, D. A.; Gibson, T.; Waibel, A.
1996. Recent advances in parsing technology. Ed.: H. Bunt, 241–253, Kluwer Academic Publishers
FeasPar - a feature structure parser learning to parse spontaneous speech. PhD dissertation
Buoe, F. D.
1996. Fak. f. Informatik, Diss. v. 11.7.1996., Universität Karlsruhe (TH)
Detection and transcription of new words
Suhm, B.; Woszczyna, M.; Waibel, A.
1996. 3rd European Conference on Speech, Communication and Technology, EUROSPEECH 93, Berlin, 22nd - 25th September 1993, In: 3rd European Conference on Speech, Communication and Technology, EUROSPEECH 93, Berlin, Germany 1993., 2179–2182, International Society for Computers and Their Applications (ISCA). doi:10.21437/Eurospeech.1993-488
Learning complex output representations in connectionist parsing of spoken language
Buoe, F. D.; Polzin, T.; Waibel, A.
1996. Proceedings of ICASSP ’94. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Adelaide, Australia, 19th - 22nd April 1994, In: 1994 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Adelaide, Australia. Piscataway, NJ 1994., Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1994.389280
Bimodal sensor integration on the example of speech-reading
Bregler, C.; Manke, S.; Hild, H.; Waibel, A.
1996. 1993 IEEE International Conference on Neural Networks, San Francisco, Calif. 1993, In: 1993 IEEE International Conference on Neural Networks, San Francisco, Calif. 1993. Piscataway, NJ 1993., Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICNN.1993.298634
Improving connected letter recognition by lipreading
Bregler, C.; Manke, S.; Hild, H.; Waibel, A.
1996. 1993 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Minneapolis, Mn,, 27th-30th April 1993, In: 1993 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Minneapolis, Minn. 1993. Piscataway, NJ 1993., Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1993.319179
Connectionist architectural learning for high performance character and speech recognition
Bodenhausen, U.; Manke, S.; Waibel, A.
1996. In: 1993 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Minneapolis, Minn. 1993. Piscataway, NJ 1993
Multimodal human-computer interaction
Vo, M. T.; Waibel, A.
1996. In: Proceedings of ISSD’93, Waseda, Japan 1993
Connectionist models in multimodal human-computer interaction
Waibel, A.; Duchnowski, P.
1996. Proceedings of the Government Microcircuit Applications Conference, GOMAC, San Diego, November 7-10, 1994, In: Proceedings of the Government Microcircuit Applications Conference, GOMAC, San Diego 1994
Flexibility through incremental learning: neural networks for text categorization
Geutner, P.; Bodenhausen, U.; Waibel, A.
1996. WCNN’93, Portland: World Congress on Neural Networks, July 11-15, 1993, Oregon Convention Center, Portland, Oregon, Band 1, In: Proceedings of the World Congress on Neural Networks, WCNN, Portland, Or. 1993. S. 24-27., 24–27, Psychology Press
Multi-speaker / speaker-independent architectures for the multi-state time delay neural network
Hild, H.; Waibel, A.
1996. In: 1993 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Minneapolis, Minn. Vol. 2. Piscataway, NJ 1993. S. 255-258
A fast search technique for large vocabulary on-line handwriting recognition
Manke, S.; Finke, M.; Waibel, A.
1996. Proceedings of the 5th International Workshop on Frontiers in Handwriting Recognition, IWFHR 1996, Colchester, September 2-5, 1996, In: International Workshop on Frontiers in Handwriting Recognition, Univ. of Essex, Colchester, England 1996
A modelbased gaze tracking system
Stiefelhagen, R.; Yang, J.; Waibel, A.
1996. In: IEEE International Joint Symposia on Intelligence and Systems - Image, Speech & Natural Language Systems, Maryland, USA 1996
LVCSR-based language identification
Schultz, T.; Rogina, I.; Waibel, A.
1996. IEEE International Conference On Acoustics, Speech And Signal Processing, ICASSP 1996, Atlanta, Georgia, USA, May 7-10, 1996, 781–784, IEEE Service Center
JANUS-II - translation of spontaneous conversational speech
Waibel, A.; Finke, M.; Kemp, T.; Maier, M.; Rogina, I.; Sloboda, T.; Woszczyna, M.; Levin, L.; Lavie, A.; Gates, D.; Gavaldà, M.; Mayfield, L.; McNair, A.; Shima, K.; Zeppenfeld, T.; Zhan, P.
1996. IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings, Atlanta, GA, USA, In: Conference proceedings. The 1996 International Conference on Acoustics, Speech and Signal Processing, ICASSP 96, Atlanta, Ga. 1996. Vol. 1. Piscataway, NJ 1996. S. 409-412., 409–412, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1996.541119
Recognition of spelled names over the telephone
Hild, H.; Waibel, A.
1996. Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, Pennsylvania, USA, October 3-6, 1996, Vol. 1, In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. Vol. 1. S. 346-349., 346–349, International Society for Computers and Their Applications (ISCA)
Learning to parse spontaneous speech
Buoe, F. D.; Waibel, A.
1996. Proceedings of the 4th International Conference on Spoken Language Processing, ICSLP 1996, Philadelphia, 3rd-6th October 1996, In: Proceedings of the International Conference on Spoken Language Processing, ICSLP 96, Philadelphia, PA 1996. Vol. 2. S. 1153-1156., International Society for Computers and Their Applications (ISCA)
Feaspar: a feature structure parser learning to parse spoken language
Buoe, F. D.; Waibel, A.
1996. Proceedings of the 16th International Conference on Computational Linguistics, {COLING}, Copenhagen, Denmark, August 5-9, 1996, Vol. 1, In: Proceedings. COLING-96, Copenhagen, Denmark 1996. Copenhagen 1996., Association for Computational Linguistics (ACL)
Adaptively growing hierarchical mixtures of experts
Fritsch, J.; Finke, M.; Waibel, A.
1996. Advances in neural information processing systems 9. Ed.: M.C. Mozer. Cambridge, MA 1997, In: Advances in neural information processing systems 9. Ed.: M.C. Mozer. Cambridge, MA 1997., 459–465, Massachusetts Institute of Technology Press (MIT Press)
Integrating different learning approaches into a multilingual spoken language translation system
Geutner, P.; Buoe, F. D.; Kemp, T.; Mcnair, A. E.; Rogina, I.; Schultz, T.; Sloboda, T.; Woszczyna, M.; Waibel, A.; et al.
1996. Connectionist, statistical and symbolic approaches to learning for natural language processing. Ed.: S. Wermter. Berlin 1996, 117–131, Springer-Verlag
See me, hear me: integrating automatic speech recognition and lip-reading
Duchnowski, P.; Meier, U.; Waibel, A.
1996. Proceedings of the 3rd International Conference on Spoken Language Processing, ICSLP 1994, Yokohama, 18th-22nd September 1994, In: International Conference on Spoken Language Processing, ICSLP, Yokohama 1994., International Society for Computers and Their Applications (ISCA). doi:10.21437/ICSLP.1994-139
Combining bitmaps with dynamic writing information for on-line handwriting recognition
Manke, S.; Finke, M.; Waibel, A.
1996. Proceedings of the 12th ICPR International Conference on Pattern Recognition, Vol. 3 - Conference C: Signal Processing (Cat. No.94CH3440-5), Jerusalem, Israel 1994, In: Proceedings of the 12th IAPR International Conference on Pattern Recognition, Jerusalem, Israel 1994. Los Alamitos, Calif. 1994., 596–598, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICPR.1994.577051
1995
Generierung eines Sprachmodells mit Hilfe automatischer Klassenbildung. student research project
Langenbach, J.; Waibel, A.; Geutner, P.
1995, December 1. Universität Karlsruhe (TH)
Implementierung eines Cluster-Algorithmus für Codebücher auf der Maspar MP-1. student research project
Jürgens, M.
1995, August 25. Universität Karlsruhe (TH)
Installation einer ISDN-Karte für ein Telefon-Auskunftssystem. master’s thesis
Baumgarten, F.-P.
1995, June 30. Universität Karlsruhe (TH)
PLP und RASTA-PLP : Zwei Verfahren zur Vorverarbeitung von Sprachsignalen. student research project
Bolten, E.
1995, May 31. Universität Karlsruhe (TH)
Using Domain Knowledge to Improve End to End Performance in a Speech Translation System. master’s thesis
Glickman, O.
1995, May 12. Carnegie Mellon University
Sicherheitsmaße für die Erkennung spontaner Sprache. student research project
Kopp, S.
1995, April 20. Universität Karlsruhe (TH)
Multimodal learning interfaces
Vo, M. T.; Houghton, R.; Yang, J.; Bub, U.; Meier, U.; Waibel, A.; Duchnowski, P.
1995. Proceedings of the ARPA Spoken Language Systems Technology Workshop, Austin, TX, January 22-25, 1995., In: Proceedings of the ARPA Spoken Language Systems Technology Workshop, Austin, TX 1995. San Francisco, CA 1995., Morgan Kaufmann Publishers
Robuste Systemarchitekturen für maschinelles Lippenlesen. diploma thesis
Meier, U.
1995, March. Universität Karlsruhe (TH)
Schnelle Vektorquantisierung durch Bucket Voronoi Intersection Suche. student research project
Fritsch, J.
1995, January 19. Universität Karlsruhe (TH)
Wordspotting-Techniken. student research project
Scheytt, P.
1995, January. Universität Karlsruhe (TH)
Erkennung von Sprache in Telephonqualität. diploma thesis
Bird, S.
1995. Universität Karlsruhe (TH)
Gaze Tracking based on Face-Color
Schiele, B.; Waibel, A.
1995. Proceedings of the International Workshop on Automatic Face- and Gesture-Recognition, Zurich, Switzerland 1995. Ed.: M. Bichsel. Zürich 1995
Estimation of the Head Orientation Based on a Face-Color-Intensifier
Schiele, B.; Waibel, A.
1995. Proceedings of the 3rd International Symposium on Intelligent Robotic Systems ’95 : [SIRS ’95]; Pisa, Italy, July 10 - 14, 1995. Ed.: C. Colombo
Using Context in Machine Translation of Spoken Language
Levin, L.; Glickman, O.; Qu, Y.; Rosé, C. P.; Gates, D.; Lavie, A.; Ess-Dykema, C. van; Waibel, A.
1995. Proceedings of the Sixth International Conference on Theoretical and Methodological Issues in Machine Translation, TMI’95, July 5-7, 1995. Bd.: 2, 173–187, Katholieke Universiteit Leuven (KU Leuven)
Integrating different learning approaches into a multilingual spoken language translation system
Geutner, P.; Suhm, B.; Buø, F. D.; Kemp, T.; Mayfield, L.; McNair, A. E.; Rogina, I.; Schultz, T.; Sloboda, T.; Ward, W.; Woszczyna, M.; Waibel, A.
1995. Connectionist, Statistical and Symbolic Approaches to Learning for Natural Language Processing. Ed.: S. Wermter, 117–131, Springer Berlin Heidelberg. doi:10.1007/3-540-60925-3_42
Connectionist F-Structure Transfer
Wang, Y.-Y.; Waibel, A.
1995. Recent Advances in Natural Language Processing : Selected Papers from RANLP ’95. Ed.: R. Mitkov, 393–404, John Benjamins Publishing. doi:10.1075/cilt.136.33wan
Natural Speech Processing in Practice: Experiences with the Verbmobil / Janus-2 System
Waibel, A.; Finke, M.; Gates, D.; Gavalda, M.; Geutner, P.; Kemp, T.; Lavie, A.; McNair, A.; Mayfield, L.; Maier, M.; Rogina, I.; Shima, K.; Sloboda, T.; Woszczyna, M.; Zeppenfeld, T.; Zhan, P.
1995. Machine translation and machine interpretation : proceedings of the VERBMOBIL Workshop at the University of Hamburg, Computer Science Department, October 1995. Ed.: W. Hahn, Universität Hamburg
Phoneme Recognition Using Time-Delay Neural Networks
Waibel, A.; Hanazawa, T.; Hinton, G.; Shikano, K.; Lang, K.
1995. Backpropagation : Theory, Architectures, and Applications. Ed.: Y. Chauvin, 35–61, Erlbaum
Künstliche neuronale Netzwerke zur adaptiven Geräuschreduktion für robuste Spracherkennung. PhD dissertation
Trompf, M.
1995. Fak. f. Elektrotechnik, Diss. v. 2.5.1995., Universität Karlsruhe (TH)
Konstruktive neuronale Lernverfahren auf Parallelrechnern. PhD dissertation
Prechelt, L.
1995. Düsseldorf 1995. (Fortschritt-Berichte VDI. Reihe 10, Nr.367.) Fak. f. Informatik, Diss. v. 15.2.1995., Universität Karlsruhe (TH)
Ermittlung von Verkehrsgeschehen durch Bildfolgenauswertung. PhD dissertation
Kollnig, H.
1995. Sankt Augustin 1995. (DISKI. 88.) Fak. f. Informatik, Diss. v. 15.2.1995., Universität Karlsruhe (TH)
IJCAI-Workshop on "New Approaches to Learning for Natural Language Processing", Montreal, August 21, 1995
Geutner, P.; Suhm, B.; Kemp, T.; Lavie, A.; Mcnair, A.; Rogina, I.; Sloboda, T.; Ward, W.; Woszczyna, M.; Waibel, A.
1995. 1995
Parsing real input in Janus: a concept-based approach to spoken language translation
Mayfield, L.; Gavalda, M.; Seo, Y.-H.; Suhm, B.; Ward, W.; Waibel, A.
1995. Proceedings of the 6th International Conference on Theoretical and Methodological Issues in Machine Translation, TMI 1995, Löwen, 5th-7th July 1995, In: Proceedings of the Theoretical and Methodical Issues in Machine Translation Conference, Leuven, Belgium 1995
Concept-based speech translation
Mayfield, L.; Gavalda, M.; Ward, W.; Waibel, A.
1995. IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, May 9-12, 1995, In: Conference proceedings. The 1995 International Conference on Acoustics, Speech and Signal Processing, ICASSP 1995, Detroit, Mich. Vol. 1. Piscataway, NJ 1995. S. 97-100., IEEEXplore
Integrating spelling into spoken dialogue recognition
Hild, H.; Waibel, A.
1995. Proc. 4th European Conference on Speech Communication and Technology (Eurospeech 1995), Madrid, 18th-21st September 1995, In: Proceedings of the 4th European Conference on Speech, Communication and Technology, EUROSPEECH, Madrid 1995. S. 1977-1980., 1977–1979, International Society for Computers and Their Applications (ISCA). doi:10.21437/Eurospeech.1995-482
The use of dynamic writing information in a connectionist on-line cursive handwriting recognition system
Manke, S.; Finke, M.; Waibel, A.
1995. Advances in neural information processing systems 7: Proceedings of the 1994 Conference. Ed.: G. Tesauro, In: Advances in neural information processing systems 7. Ed.: G. Tesauro. Cambridge, MA 1995., 1093–1100
Speeding up the score computation of HMM speech recognizers with the bucket Voronoi intersection algorithm
Fritsch, J.; Rogina, I.; Sloboda, T.; Waibel, A.
1995. Proceedings of the Eurospeech ’95, 4th European Conference on Speech Communication and Technology, Madrid, Spain, 18 - 21 September 1995. Vol. 2, 1091–1094, International Society for Computers and Their Applications (ISCA). doi:10.21437/Eurospeech.1995-212
Janus - towards multilingual spoken language translation
Suhm, B.; Geutner, P.; Kemp, T.; Rogina, I.; Schultz, T.; Sloboda, T.; Woszczyna, M.; Waibel, A.; et al.
1995. Proceedings of the ARPA Spoken Language Systems Technology Workshop, Austin, TX 1995. Vol. 1. San Francisco, CA 1995, 221–226, Morgan Kaufmann Publishers
The Janus speech recognizer
Rogina, I.; Waibel, A.
1995. Proceedings of the Spoken Language Systems Technology Workshop : January 22-25, 1995, Barton Creek Resort Conference Center, Austin, Texas, In: Proceedings of the ARPA Spoken Language Systems Technology Workshop, Austin, TX 1995. San Francisco, CA 1995., Morgan Kaufmann Publishers
Experiments with LVCSR based language identification
Schultz, T.; Rogina, I.; Waibel, A.
1995. Proceedings of the Speech Research Symposium, SRS XV, Baltimore, Md, January 1995, Baltimore
1994
Lippenlesen: verschiedene Methoden der visuellen Vorverarbeitung und Merkmalsextraktion. student research project
Meier, U.
1994, April 19. Universität Karlsruhe (TH)
Ein größeninvariantes Eingabefenster für das "Time Delay Neural Network". student research project
Morgenstern, T.
1994, March 8. Universität Karlsruhe (TH)
Sprachmodelle für einen Buchstabier-Erkenner. diploma thesis
Betz, M.; Waibel, A.; Hild, H.
1994. Universität Karlsruhe (TH)
Lokalisieren von Gesichtern mit Hilfe von neuronalen Netzen. diploma thesis
Hunke, M.
1994. Universität Karlsruhe (TH)
Learning State-Dependent Stream Weights For Multi-Codebook Hmm Speech Recognition Systems
Rogina, I.; Waibel, A.
1994. ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Adelaide, SA, 19 April - 22 April 1994. Vol.: 1, Part 2, I-217-I-220, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1994.389316
Inferring Linguistic Structure in Spoken Language
Woszczyna, M.; Waibel, A.
1994. 3rd International Conference on Spoken Language Processing (ICSLP 1994), 847–850, International Society for Computers and Their Applications (ISCA). doi:10.21437/ICSLP.1994-226
Towards Better Language Models For Spontaneous Speech
Suhm, B.; Waibel, A.
1994. ICSLP 94 : 1994 International Conference on Spoken Language Processing, September 18-22, 1994, Yokohama, Japan. Bd.: 4, 831–834, International Society for Computers and Their Applications (ISCA)
Hybrid Connectionist and Classical Approaches in JANUS. An Advanced Speech-to-Speech Translation System
Waibel, A.; Polzin, T. S.; Bodenhausen, U.; Buø, F. D.; Coccaro, N.; Hild, H.; Suhm, B.
1994. Proceedings of the the International Conference on Neural Information Processing, ICONIP 1994, Seoul, South Korea, October 17-20, 1994
Structured Connectionist Systems : Introduction
Waibel, A.
1994. Machine learning, 15 (2), 121–123. doi:10.1007/BF00993273
Improving Recognizer Acceptance Through Robust, Natural Speech Repair
McNair, A. E.; Waibel, A.
1994. ICSLP 94 : 1994 International Conference on Spoken Language Processing, September 18-22, 1994, Pacific Convention Plaza Yokohama (PACIFICO), Yokohama, Japan; [proceedings] / [Acoustical Society of Japan].: Vol. 4, 1299–1302, International Society for Computers and Their Applications (ISCA)
Automatic structuring of neural networks for spatio-temporal real-world applications. PhD dissertation
Bodenhausen, U.
1994. Fak. f. Informatik, Diss. v. 13.6.1994., Universität Karlsruhe (TH)
Bereichsbasierte Verfahren zur Straßenerkennung für die autonome Führung von Fahrzeugen. PhD dissertation
Zhang, J.
1994. Düsseldorf 1994. (Fortschritt-Berichte VDI. Reihe 10, Nr.298.) Fak. f. Informatik, Diss. v. 7.2.1994., Universität Karlsruhe (TH)
Speech-language integration in a multi-lingual speech translation system
Suhm, B.; Coccaro, N.; Waibel, A.; et al.
1994. Workshop on Integration of Natural Language and Speech Processing, AAAI-94, 1994. Vol. 1, 92–99
JANUS 93: towards spontaneous speech translation
Woszczyna, M.; Coccaro, N.; Kemp, T.; Rogina, I.; Schultz, T.; Waibel, A.; et al.
1994. 1994 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, Adelaide, Australia. Vol. 1. Piscataway, NJ 1994, I/345 - I/348 vol.1, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1994.389285
Face locating and tracking for human-computer interaction
Hunke, M.; Waibel, A.
1994. Conference Record of the 28th Asilomar Conference on Signals, Systems and Computers, ACSSC 1994, PAcific Grove, October 31 - November 2, 1994, In: Conference record of the 28th Asilomar Conference on Signals, Systems & Computers, Pacific Grove, Calif. 1994. Ed.: A. Singh. Los Alamitos, Calif. 1994., IEEEXplore. doi:10.1109/ACSSC.1994.471664
1993
Erkennung und Transkription neuer Wörter in der Spracherkennung. diploma thesis
Suhm, B.
1993, April 28. Universität Karlsruhe (TH)
Multi-Speaker/Speaker-Independent Architectures for the Multi-State Time Delay Neural Network
Hild, H.; Waibel, A.
1993. IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, MN, USA, 27-30 April 1993, II-255 – II-258, IEEE Computer Society. doi:10.1109/ICASSP.1993.319284
A Multi-Modal Human-Computer Interface: Combination of Gesture and Speech Recognition
Vo, M. T.; Waibel, A.
1993. CHI ’93: INTERACT ’93 and CHI ’93 Conference Companion on Human Factors in Computing Systems. Ed.: S. Ashlund, 69–70, Association for Computing Machinery (ACM). doi:10.1145/259964.260076
Lippenlesen als Unterstützung zur robusten automatischen Spracherkennung. diploma thesis
Bregler, C.
1993. Universität Karlsruhe (TH)
Application Oriented Automatic Structuring of Time-Delay Neural Networks for High Performance Character and Speech Recognition
Bodenhausen, U.; Waibel, A.
1993. Proceedings of the International Conference on Neural Networks, ICNN 1993, San Francisco, CA, USA, 28 March 1993 - 01 April 1993, 1627–1632, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICNN.1993.298800
Improving the MS-TDNN for Word Spotting
Zeppenfeld, T.; Houghton, R.; Waibel, A.
1993. IEEE International Conference on Acoustics, Speech, and Signal Processing, Minneapolis, MN, USA, 27-30 April 1993, II-475 - II-478, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1993.319344
The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora
Ushioda, A.; Evans, D. A.; Gibson, T.; Waibel, A.
1993. Proceedings of the Workshop: Acquisition of Lexical Knowledge from Text? sponsored by the Association for Computational Linguistics (ACL), 95–106, Association for Computational Linguistics (ACL)
Frequency Estimation of Verb Subcategorization Frames Based on Syntactic and Multidimensional Statistical Analysis
Ushioda, A.; Evans, D. A.; Gibson, T.; Waibel, A.
1993. Proceedings of the Third International Workshop on Parsing Technologies, 309–318, Association for Computational Linguistics (ACL)
Speaker-Independent Connected Letter Recognition With A Multi-State Time Delay Neural Network
Hild, H.; Waibel, A.
1993. Proceedings // Eurospeech 93, 3rd European Conference on Speech Communication and Technology, Berlin, Germany, 21 - 23 September 1993.: Vol. 2, 1481–1484, International Society for Computers and Their Applications (ISCA)
Tuning by Doing: Flexibility Through Automatic Structure Optimization
Bodenhausen, U.; Waibel, A.
1993. Proceedings // Eurospeech 93, 3rd European Conference on Speech Communication and Technology, Berlin, Germany, 21 - 23 September 1993. Vol.: 2, 1485–1488
Performance through Consistency: MS-TDNN’s for Large Vocabulary Continuous Speech Recognition
Tebelskis, J.; Waibel, A.
1993. Advances in neural information processing systems 5 : [collected papers presented at the Sixth Annual NIPS Conference (short for Neural Information Processing Systems - Natural and Synthetic), held in Denver, Colorado, from 30 November to 3 December, 1992], 696–703, Morgan Kaufmann Publishers
Recent advances in Janus: a speech translation system
Woszczyna, M.; Coccaro, N.; Eisele, A.; Mcnair, A.; Rogina, I.; Sloboda, T.; Waibel, A.; et al.
1993. Human Language Technology: Proceedings of a Workshop Held at Plainsboro, New Jersey, March 21-24, 1993, In: Proceedings of the 1993 ARPA-HLT Workshop
Connected letter recognition with a multi-state time delay neural network
Hild, H.; Waibel, A.
1993. Advances in neural information processing systems 5 : [collected papers presented at the Sixth Annual NIPS Conference (short for Neural Information Processing Systems - Natural and Synthetic), held in Denver, Colorado, from 30 November to 3 December, 1992]. Ed.: S. Hanson, In: Advances in neural information processing systems 5. Ed.: S.J. Hanson. San Mateo, CA 1993. S. 712-719., 712–719, Morgan Kaufmann Publishers
1992
The Meta-Pi Network: Building Distributed Knowledge Representations for Robust Multi-Source Pattern Recognition
Hampshire, J. B.; Waibel, A.
1992. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14 (7), 751–769
TDNN-Netzwerkarchitekturen zur on-line Einzelzeichenerkennung. master’s thesis
Bräutigam, C.
1992, March 15. Universität Karlsruhe (TH)
Algorithmen der Spracherkennung auf massiv parallelen SIMD-Rechnern. diploma thesis
Sloboda, T.
1992, January 14. Universität Karlsruhe (TH)
Spracherkennung mit Hardware Neuronalen Netzen. diploma thesis
Berthold, M.; Waibel, A.; Hild, H.
1992. Universität Karlsruhe (TH)
Reconnaissance de parole continue avec un système hybride neuronal et markovien. PhD dissertation
Devillers, L.
1992. Université Paris-Saclay
Testing Generality in Janus: A Multi-Lingual Speech Translation System
Osterholtz, L.; Augustine, C.; McNair, A.; Rogina, I.; Saito, H.; Sloboda, T.; Tebelskis, J.; Waibel, A.
1992. ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA, USA, 23-26 March 1992, I-209-I-212, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1992.225935
A Hybrid Neural Network, Dynamic Programming Word Spotter
Zeppenfeld, T.; Waibel, A. H.
1992. IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA, USA, 23-26 March 1992, II-77-II-80, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1992.226116
PARSEC: A Structured Connectionist Parsing System for Spoken Language
Jain, A. N.; Waibel, A.; Touretzky, D. S.
1992. IEEE International Conference on Acoustics, Speech, and Signal Processing, San Francisco, CA, USA, 23-26 March 1992, I-205 - I-208, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1992.225936
The Tempo 2 Algorithm: Adjusting Time-Delays by Supervised Learning
Bodenhausen, U.; Waibel, A.
1992. Advances in neural information processing systems 4 : [papers presented at the Fifth NIPS Conference (short for "Neural Information Processing Systems - Natural and Synthetic"), held in Denver, Colorado, from 2 - 5 December 1991]. Ed.: J. Moody, 155–161, Kaufmann
Connectionist Large Vocabulary Speech Recognition
Waibel, A.
1992. Speech Recognition and Understanding : Recent Advances, Trends and Applications. Ed.: P. Laface, 259–273, Springer Berlin Heidelberg. doi:10.1007/978-3-642-76626-8_28
JANUS: Speech-to-Speech Translation Using Connectionist and Non-Connectionist Techniques
Waibel, A.; Jain, A.; McNair, A.; Tebelskis, J.; Osterholtz, L.; Saito, H.; Schmidbauer, O.; Sloboda, T.; Wosczcyna, M.
1992. Advances in neural information processing systems 4 : [papers presented at the Fifth NIPS Conference , held in Denver, Colorado, from 2 - 5 December 1991]. Ed.: J. Moody, Morgan Kaufmann Publishers
Multi-State Time Delay Neural Networks for Continuous Speech Recognition
Haffner, P.; Waibel, A.
1992. Advances in neural information processing systems 4 : [papers presented at the Fifth NIPS Conference , held in Denver, Colorado, from 2 - 5 December 1991]. Ed.: J. Moody, 135–142, Morgan Kaufmann Publishers
1991
Continuous Speech Recognition by Linked Predictve Neural Networks
Tebelskis, J.; Waibel, A.; Petek, B.; Schmidbauer, O.
1991. Advances in neural information processing systems 3 : [papers presented at the NIPS Conference (short for "Neural Information Processing Systems - Natural and Synthetic"), held in Denver, Colorado, from 26 - 29 November 1990]. Ed.: R. Lippmann, 199–205, Morgan Kaufmann Publishers
Connectionist Approaches to Large Vocabulary Continuous Speech Recognition
Sawai, H.; Minami, Y.; Miyatake, M.; Waibel, A.; Shikano, K.
1991. Transactions of the IEICE, E 74 (7), 1834–1844
Integrated phoneme-function word architecture of hidden control neural networks for continuous speech recognition
Petek, B.; Waibel, A. H.; Tebelskis, J. M.
1991. 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), Genua, 24th-26th September 1991, 1407–1410, ISCA. doi:10.21437/Eurospeech.1991-142
A New Fuzzy Training Method for Phoneme Identification Neural Networks
Komori, Y.; Sagayama, S.; Waibel, A.
1991. Proceedings of the Spring Meeting of the Acoustical Society of Japan
A Connectionist Model for Dialog Processing
Wang, Y.-Y.; Waibel, A.
1991. ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, ON, Canada, 14-17 April 1991, 785–788, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1991.150090
Learning the Architecture of Neural Networks for Speech Recognition
Bodenhausen, U.; Waibel, A.
1991. ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, ON, Canada,14-17 April 1991, 117–120, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1991.150292
Continuous Speech Recognition Using Linked Predictive Neural Networks
Tebelskis, J.; Waibel, A.; Petek, B.; Schmidbauer, O.
1991. ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, ON, Canada, 14-17 April 1991. Bd.: 1, 61–64, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1991.150278
Integrating Time Alignment and Neural Networks for High Performance Continuous Speech Recognition
Haffner, P.; Franzini, M.; Waibel, A.
1991. ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, ON, Canada,14-17 April 1991, 105–108, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1991.150289
JANUS: A Speech-to-Speech Translation System Using Connectionist and Symbolic Processing Strategies
Waibel, A.; Jain, A. N.; McNair, A. E.; Saito, H.; Hauptmann, A. G.; Tebelskis, J.
1991. ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing, Toronto, ON, Canada,14-17 April 1991, 793–796, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1991.150456
Review of TDNN (Time-Delay Neural Network) Architectures for Speech Recognition
Sugiyama, M.; Sawai, H.; Waibel, A. H.
1991. IEEE International Symposium on Circuits and Systems (ISCAS), Singapore, 11-14 June 1991, 582–585, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ISCAS.1991.176402
Evaluation Speaker-Independent Phoneme Recognition on TIMIT Database Using TDNNs
Hataoka, N.; Waibel, A. H.
1991. Eurospeech 91 : 2nd European Conference on Speech Communication and Technology, Genova, Italy, 24 - 26 September 1991. Ed.: G. Modena, general chairman]. Vol.: 1, 105–108, Istituto Internazionale Delle Comunicazioni
Speaker-Independent Phoneme Recognition on TIMIT Database Using Integrated Time-Delay Neural Networks (TDNNS)
Hataoka, N.; Waibel, A. H.
1991. IJCNN International Joint Conference on Neural Networks, San Diego, CA, USA, 17-21 June 1990, 57–62, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IJCNN.1990.137544
Recent Work in Continuous Speech Recognition using the Connectionist Viterbi Training Procedure
Franzini, M. A.; Waibel, A. H.; Lee, K.-F.
1991. Proceedings 2nd European Conference on Speech Communication and Technology (Eurospeech 1991), 1213–1216, International Speech Communication Association (ISCA). doi:10.21437/Eurospeech.1991-177
Time-Delay Neural Networks Embedding Time Alignment: A Performance Analysis
Haffner, P.; Waibel, A.
1991. Proceedings of the 2nd European Conference on Speech Communication and Technology, EUROSPEECH 1991, 1415–1418, International Speech Communication Association (ISCA). doi:10.21437/Eurospeech.1991-144
Effectiveness of the Neural Fuzzy Training Method for Continuous Speech Recognition
Fukuzawa, K.; Komori, Y.; Sugiyama, M.; Sagayama, S.; Waibel, A.
1991. Proceedings of the Fall Meeting of the Acoustical Society of Japan, Japan, 1991
Connectionist Speaker Normalization and its Applications to Speech Recognition
Huang, X. D.; Lee, K. F.; Waibel, A.
1991. Neural Networks for Signal Processing Proceedings of the 1991 IEEE Workshop, 357–366, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/NNSP.1991.239506
Continuous Speech Recognition with the Connectionist Viterbi Training Procedure: A Summary of Recent Work
Franzini, M.; Waibel, A.; Lee, K.-F.
1991. Artificial Neural Networks : International Workshop IWANN ’91, Granada, Spain, September 17-19, 1991. Proceedings, 355–360, Springer Berlin Heidelberg
Parsing with Connectionist Networks
Jain, A. N.; Waibel, A. H.
1991. Current Issues in Parsing Technology. Ed.: M. Tomita, 243–260, Springer. doi:10.1007/978-1-4615-3986-5_16
Neural Networks Approaches for Speech Recognition
Waibel, A.
1991. Advances in speech signal processing. Ed.: S. Furui, 555–595, Dekker
1990
Readings in Speech Recognition
Lee, K.-F.; Waibel, A. (Eds.)
1990. Morgan Kaufmann Publishers
A Novel Objective Function for Improved Phoneme Recognition Using Time-Delay Neural Networks
Hampshire, J. B.; Waibel, A. H.
1990. IEEE transactions on neural networks, 1 (2), 216–228. doi:10.1109/72.80233
Incremental Parsing by Modular Recurrent Connectionist Networks
Jain, A. N.; Waibel, A. H.
1990. Advances in neural information processing systems. Ed.: D. Touretzky, 364–371, Morgan Kaufmann Publishers
Learned Phonetic Discrimination using Connectionist Networks
Watrous, R. L.; Shastri, L.; Waibel, A.
1990. Readings in Speech Recognition. Ed.: A. Waibel, 409–412, Morgan Kaufmann Publishers. doi:10.1016/B978-0-08-051584-7.50039-5
Connectionist Architectures for Multi Speaker Phoneme Recognition
Hampshire II, J. B.; Waibel, A.
1990. Advances in neural information processing systems 2 : [collected papers of the 1989 IEEE Conference on Neural Information Processing Systems - Natural and Synthetic, held November 27 - 30, 1989, in Denver, Colorado]. Ed.: D. Touretzky, Morgan Kaufmann Publishers
Robust Connectionist Parsing of Spoken Language
Jain, A. N.; Waibel, A. H.
1990. International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 03-06 April 1990, 593–596, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1990.115782
Connectionist Viterbi Training: A new Hybrid Method for Continuous Speech Recognition
Franzini, M.; Lee, K.-F.; Waibel, A.
1990. International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 03-06 April 1990, 425–428, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1990.115733
The Meta-Pi Network: Connectionist Rapid Adaptation for High-Performance Multi-Speaker Phoneme Recognition
Hampshire, J. B.; Waibel, A. H.
1990. International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 03-06 April 1990, 165–168, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1990.115564
Large Vocabulary Recognition Using Linked Predictive Neural Networks
Tebelskis, J.; Waibel, A.
1990. International Conference on Acoustics, Speech, and Signal Processing, Albuquerque, NM, USA, 03-06 April 1990, 437–440, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1990.115742
Phoneme-Based Word Recognition by Neural Network - A Step Toward Large Vocabulary Recognition
Hirai, A.; Waibel, A.
1990. 1990 IJCNN International Joint Conference on Neural Networks, San Diego, CA, USA, 17-21 June 1990, 671–676, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/IJCNN.1990.137915
Speech Recognition Using Sub-Phoneme Recognition Neural Network
Aikawa, K.; Waibel, A. H.
1990. Proceedings of the 1st International Conference on Spoken Language Processing, ICSLP 1990, 685–688, International Speech Communication Association (ISCA). doi:10.21437/ICSLP.1990-195
Machine Learning
Mitchell, T.; Buchanan, B.; DeJong, G.; Diettrich, T.; Rosenblum, P.; Waibel, A.; Mitchell, T.
1990. Annual review of Computer Science, 417–433
A Time-Delay Neural Network Architecture for Isolated Word Recognition
Lang, K. J.; Waibel, A. H.; Hinton, G. E.
1990. Neural Networks, 3 (1), 23–43
Connectionist Architectures for Multi-Speaker Phoneme Recognition
Hampshire, J. B.; Waibel, A.
1990. Advances in neural information processing systems 2 : [collected papers of the 1989 IEEE Conference on Neural Information Processing Systems - Natural and Synthetic, held November 27 - 30, 1989, in Denver, Colorado]. Ed.: D. Touretzky, 203–210, Morgan Kaufmann Publishers
1989
Modularity and Scaling in Large Phonemic Neural Networks
Waibel, A.; Sawai, H.; Shikano, K.
1989. IEEE transactions on acoustics, speech, and signal processing, 37 (12), 1888–1898. doi:10.1109/29.45535
A Connectionist Parser Aimed at Spoken Language
Jain, A.; Waibel, A.
1989. Proceedings of the 1st International Workshop on Parsing Technologies, IWPT 1989, Pittsburgh, Pennsylvania, USA. Ed.: M. Tomita, 221–229, Carnegie Mellon University
Building Blocks for Speech
Waibel, A.; Hampshire, J.
1989. Byte, 14 (8), 235–242
Modular Construction of Time-Delay Neural Networks for Speech Recognition
Waibel, A.
1989. Neural computation, 1 (1), 39–46
Phoneme recognition using time-delay neural networks
Waibel, A.; Hanazawa, T.; Hinton, G.; Shikano, K.; Lang, K. J.
1989. IEEE transactions on acoustics, speech, and signal processing, 37 (3), 328–339. doi:10.1109/29.21701
Spotting Japanese CV-Syllables and Phonemes Using Time-Delay Neural Networks
Sawai, H.; Waibel, A.; Miyatake, M.; Shikano, K.
1989. International Conference on Acoustics, Speech, and Signal Processing, Glasgow, UK, 23-26 May 1989, 25–28, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1989.266354
Fast Back-Propagation Learning Methods for Large Phonemic Neural Networks
Haffner, P.; Waibel, A.; Sawai, H.; Shikano, K.
1989. First European Conference on Speech Communication and Technology, 2553–2556, International Speech Communication Association (ISCA). doi:10.21437/Eurospeech.1989-95
Speech Recognition by Neural Networks
Waibel, A.; Hampshire, J.
1989. The 1989 Geno Summer School
1988
Fast Back-Propagation Learning Methods for Neural Networks in Speech
Haffner, P.; Sawai, H.; Waibel, A.; Shikano, K.
1988. First European Conference on Speech Communication and Technology, Eurospeech 1989, 2553–2556, International Society for Computers and Their Applications (ISCA)
A Preliminary Study on Spotting Japanese CV-Syllables by Time-Delay Neural Networks
Sawai, H.; Waibel, A.; Shikano, K.
1988. Proceedings of the Fall Meeting of the Acoustical Society of Japan, 223–224, Carnegie Mellon University
Noise Reduction by Neural Networks
Tamura, S.; Waibel, A.
1988. Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), Osaka, Japan, January 1988
Noise Reduction through Waveform Input and Output Using Neural Networks
Tamura, S.; Waibel, A.
1988. Spring Meeting of the Acoustical Society of Japan 1988, 426–427
Phoneme Recognition: Neural Networks vs. Hidden Markov Models
Waibel, A.; Hanazawa, T.; Hinton, G.; Shikano, K.; Lang, K.
1988. ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing, New York, NY, USA, 11-14 April 1988, 107–110, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1988.196523
Noise Reduction Using Connectionist Models
Tamura, S.; Waibel, A.
1988. ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing, New York, NY, USA, 11-14 April 1988, 553–556, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1988.196643
Connectionist Glue: Modular Design of Neural Speech Systems
Waibel, A.
1988. Proceedings of the 1988 Connectionist Models Summer School, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 17-26, 1988, 417–425
Phoneme Recognition by Modular Construction of Time-Delay Neural Networks
Waibel, A.; Sawai, H.; Shikano, K.
1988. Proceedings of the Fall Meeting of the Acoustical Society of Japan, October 1988, 225–226
Phoneme Recognition by Scaling up Modular Time-Delay Neural Networks
Sawai, H.; Waibel, A.; Miyatake, M.; Shikano, K.
1988. Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), 73–80
Prosody and Speech Recognition
Waibel, A.
1988. Morgan Kaufmann Publishers
1987
Phoneme Recognition Using Time-Delay Neural Networks
Waibel, A.; Hanazawa, T.; Hinton, G.; Shikano, K.; Lang, K.
1987. doi:10.5445/IR/1000181405
Phoneme Recognition Using Time-Delay Neural Networks
Waibel, A.
1987. Meeting of the Institute of Electrical, Information and Communication Engineers (IEICE), Tokyo, Japan, December 1987, 19–24
1986
Recognition of Lexical Stress in a Continuous Speech Understanding System - A Pattern Recognition Approach
Waibel, A.
1986. IEEE International Conference on Acoustics, Speech, and Signal Processing, Tokyo, Japan, 07-11 April 1986 ICASSP, 2287–2290, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1986.1168788
Suprasegmentals in Very large Vocabulary Word Recognition
Waibel, A.
1986. Pattern Recognition by Humans and Machines: Vol. 1: Speech Perception. Ed. E. Schwab, 159–184, Academic Press
1985
A Coarse Phonetic Knowledge Source for Template Independent Large Vocabulary Word Recognition
Lagger, H.; Waibel, A.
1985. IEEE International Conference on Acoustics, Speech and Signal Processing, Tampa, FL, USA, 26-29 April 1985 (ICASSP), 862–865, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1985.1168314
1984
Suprasegmentals in Very Large Vocabulary Isolated Word Recognition
Waibel, A.
1984. IEEE International Conference on Acoustics, Speech, and Signal Processing, San Diego, CA, USA, 19-21 March 1984 (ICASSP), 26.3.1–26.3.4, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1984.1172524
1983
Comparative Study of Nonlinear Time Warping Techniques in Isolated Word Speech Recognition Systems
Waibel, A.; Yegnanarayana, B.
1983. IEEE Transactions on Acoustics, Speech and Signal Processing, 31 (6), 1582–1586. doi:10.1109/TASSP.1983.1164241
1982
Very large vocabulary recognition (VLVR): using prosodic and spectral filters
Waibel, A.
1982. The Journal of the Acoustical Society of America, 72 (S1), S32. doi:10.1121/1.2019828
Performance Trade-Offs in Search Techniques for Isolated Word Speech Recognition
Bisiani, R.; Waibel, A.
1982. IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP ’82, Paris, France, May 3-5, 1982, 570–573, Institute of Electrical and Electronics Engineers (IEEE). doi:10.1109/ICASSP.1982.1171635

On these pages we have listed all our publications, including articles and contribution to conference. Furthermore you will find all PhD Thesis, Diploma/Masterthesis and Studien/Bachelorthesis compiled under our supervision. 

In case you need further information or have questions around our publications, just send an e-mail to Margit Rödder.