Bibliography of Denisov, Pavel

Lux, Florian; Koch, Julia; Meyer, Sarina; Bott, Thomas; Schauffler, Nadja; Denisov, Pavel; Schweitzer, Antje; Vu, Ngoc Thang The IMS Toucan system for the Blizzard Challenge 2023 in Blizzard Challenge 2023.
Meyer, Sarina; Lux, Florian; Koch, Julia; Denisov, Pavel; Tilli, Pascal; Vu, Ngoc Thang Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning in ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 1-5.
Meyer, Sarina; Tilli, Pascal; Denisov, Pavel; Lux, Florian; Koch, Julia; Vu, Ngoc Thang Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy in Proc. IEEE Spoken Language Technology Workshop (SLT) 2022 pp. 912-919 Doha, Qatar.

Arora, Siddhant; Dalmia, Siddharth; Denisov, Pavel; Chang, Xuankai; Ueda, Yushi; Peng, Yifan; Zhang, Yuekai; Kumar, Sujay; Ganesan, Karthik; Yan, Brian; Vu, Ngoc Thang; Black, Alan W; Watanabe, Shinji ESPnet-SLU: Advancing Spoken Language Understanding through ESPnet .
Hamed, Injy; Denisov, Pavel; Li, Chia-Yu; Elmahdy, Mohamed; Abdennadher, Slim; Vu, Ngoc Thang Investigations on speech recognition systems for low-resource dialectal Arabic-English code-switching speech Computer Speech & Language 72.
Meyer, Sarina; Lux, Florian; Denisov, Pavel; Koch, Julia; Tilli, Pascal; Vu, Ngoc Thang Speaker Anonymization with Phonetic Intermediate Representations in Proc. Interspeech 2022 pp. 4925-4929 Incheon, Korea.
Meyer, Sarina; Tilli, Pascal; Lux, Florian; Denisov, Pavel; Koch, Julia; Vu, Ngoc Thang Cascade of Phonetic Speech Recognition, Speaker Embeddings GAN and Multispeaker Speech Synthesis for the VoicePrivacy 2022 Challenge in Proc. 2nd Symposium on Security and Privacy in Speech Communication Incheon, Korea.

Denisov, Pavel; Mager, Manuel; Vu, Ngoc Thang IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task Proceedings of the 18th International Conference on Spoken Language Translation (IWSLT 2021) pp. 175-181.
Raj, Desh; Denisov, Pavel; Chen, Zhuo; Erdogan, Hakan; Huang, Zili; He, Maokui; Watanabe, Shinji; Du, Jun; Yoshioka, Takuya; Luo, Yi; Kanda, Naoyuki; Li, Jinyu; Wisdom, Scott; Hershey, John R. Integration of Speech Separation, Diarization, and Recognition for Multi-Speaker Meetings: System Description, Comparison, and Analysis 2021 IEEE Spoken Language Technology Workshop (SLT) pp. 897-904.

Denisov, Pavel; Vu, Ngoc Thang Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning Proceedings of Interspeech 2020 pp. 881-885.
Li, Chia-Yu; Ortega, Daniel; Väth, Dirk; Lux, Florian; Vanderlyn, Lindsey; Schmidt, Maximilian; Neumann, Michael; Völkel, Moritz; Denisov, Pavel; Jenne, Sabrina; Kacarevic, Zorica; Vu, Ngoc Thang ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations pp. 279-286.

Denisov, Pavel; Vu, Ngoc Thang End-to-End Multi-Speaker Speech Recognition Using Speaker Embeddings and Transfer Learning Proceedings of Interspeech 2019 pp. 4425-4429.
Denisov, Pavel; Vu, Ngoc Thang IMS-speech: A speech to text tool Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung 2019 pp. 170-177.
Ortega, Daniel; Li, Chia-Yu; Vallejo, Gisela; Denisov, Pavel; Vu, Ngoc Thang Context-aware neural-based dialog act classification on automatically generated transcriptions in 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) pp. 7265-7269 IEEE.

Denisov, Pavel; Vu, Ngoc Thang; Font, Marc Ferras Unsupervised domain adaptation by adversarial learning for robust speech recognition in Speech Communication; 13th ITG-Symposium.