Bibliography of Denisov, Pavel
- 2023
- Lux, Florian; Koch, Julia; Meyer, Sarina; Bott, Thomas; Schauffler, Nadja; Denisov, Pavel; Schweitzer, Antje; Vu, Ngoc Thang
The IMS Toucan system for the Blizzard Challenge 2023
in Blizzard Challenge 2023.
- Meyer, Sarina; Lux, Florian; Koch, Julia; Denisov, Pavel; Tilli, Pascal; Vu, Ngoc Thang
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning
in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 1-5.
- Meyer, Sarina; Tilli, Pascal; Denisov, Pavel; Lux, Florian; Koch, Julia; Vu, Ngoc Thang
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
in Proc. IEEE Spoken Language Technology Workshop (SLT) 2022 pp. 912-919 Doha, Qatar.
- 2022
- Arora, Siddhant; Dalmia, Siddharth; Denisov, Pavel; Chang, Xuankai; Ueda, Yushi; Peng, Yifan; Zhang, Yuekai; Kumar, Sujay; Ganesan, Karthik; Yan, Brian; Vu, Ngoc Thang; Black, Alan W; Watanabe, Shinji
ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet
.
- Hamed, Injy; Denisov, Pavel; Li, Chia-Yu; Elmahdy, Mohamed; Abdennadher, Slim; Vu, Ngoc Thang
Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech
Computer Speech & Language 72.
- Meyer, Sarina; Lux, Florian; Denisov, Pavel; Koch, Julia; Tilli, Pascal; Vu, Ngoc Thang
Speaker Anonymization with Phonetic Intermediate Representations
in Proc. Interspeech 2022 pp. 4925-4929 Incheon, Korea.
- Meyer, Sarina; Tilli, Pascal; Lux, Florian; Denisov, Pavel; Koch, Julia; Vu, Ngoc Thang
Cascade of Phonetic Speech Recognition, Speaker Embeddings GAN and Multispeaker Speech Synthesis for the VoicePrivacy 2022 Challenge
in Proc. 2nd Symposium on Security and Privacy in Speech Communication Incheon, Korea.
- 2021
- Denisov, Pavel; Mager, Manuel; Vu, Ngoc Thang
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021) pp. 175-181.
- Raj, Desh; Denisov, Pavel; Chen, Zhuo; Erdogan, Hakan; Huang, Zili; He, Maokui; Watanabe, Shinji; Du, Jun; Yoshioka, Takuya; Luo, Yi; Kanda, Naoyuki; Li, Jinyu; Wisdom, Scott; Hershey, John R.
Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis
2021 IEEE Spoken Language Technology Workshop (SLT) pp. 897-904.
- 2020
- Denisov, Pavel; Vu, Ngoc Thang
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
Proceedings of Interspeech 2020 pp. 881-885.
- Li, Chia-Yu; Ortega, Daniel; Väth, Dirk; Lux, Florian; Vanderlyn, Lindsey; Schmidt, Maximilian; Neumann, Michael; Völkel, Moritz; Denisov, Pavel; Jenne, Sabrina; Kacarevic, Zorica; Vu, Ngoc Thang
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations pp. 279-286.
- 2019
- Denisov, Pavel; Vu, Ngoc Thang
End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning
Proceedings of Interspeech 2019 pp. 4425-4429.
- Denisov, Pavel; Vu, Ngoc Thang
IMS-speech: A speech to text tool
Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 pp. 170-177.
- Ortega, Daniel; Li, Chia-Yu; Vallejo, Gisela; Denisov, Pavel; Vu, Ngoc Thang
Context-aware neural-based dialog act classification on automatically generated transcriptions
in 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 7265-7269 IEEE.
- 2018
- Denisov, Pavel; Vu, Ngoc Thang; Font, Marc Ferras
Unsupervised domain adaptation by adversarial learning for robust speech recognition
in Speech Communication; 13th ITG-Symposium.