Följ
Pavel Denisov
Pavel Denisov
Verifierad e-postadress på ims.uni-stuttgart.de - Startsida
Titel
Citeras av
Citeras av
År
Integration of speech separation, diarization, and recognition for multi-speaker meetings: System description, comparison, and analysis
D Raj, P Denisov, Z Chen, H Erdogan, Z Huang, M He, S Watanabe, J Du, ...
2021 IEEE Spoken Language Technology Workshop (SLT), 897-904, 2021
812021
ESPnet-SLU: Advancing Spoken Language Understanding Through ESPnet
S Arora, S Dalmia, P Denisov, X Chang, Y Ueda, Y Peng, Y Zhang, ...
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
602022
Investigations on speech recognition systems for low-resource dialectal Arabic–English code-switching speech
I Hamed, P Denisov, CY Li, M Elmahdy, S Abdennadher, NT Vu
Computer Speech & Language 72, 101278, 2022
402022
Pretrained Semantic Speech Embeddings for End-to-End Spoken Language Understanding via Cross-Modal Teacher-Student Learning
P Denisov, NT Vu
Interspeech 2020, 881-885, 2020
312020
End-to-End Multi-Speaker Speech Recognition using Speaker Embeddings and Transfer Learning
P Denisov, NT Vu
Interspeech 2019, 4425-4429, 2019
242019
Unsupervised domain adaptation by adversarial learning for robust speech recognition
P Denisov, NT Vu, MF Font
Speech Communication; 13th ITG-Symposium, 1-5, 2018
202018
Speaker Anonymization with Phonetic Intermediate Representations
S Meyer, F Lux, P Denisov, J Koch, P Tilli, NT Vu
Interspeech 2022, 4925-4929, 2022
182022
IMS-speech: A speech to text tool
P Denisov, NT Vu
Studientexte zur Sprachkommunikation: Elektronische Sprachsignalverarbeitung …, 2019
152019
Context-aware Neural-based Dialog Act Classification on Automatically Generated Transcriptions
D Ortega, CY Li, G Vallejo, P Denisov, NT Vu
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
132019
Anonymizing Speech with Generative Adversarial Networks to Preserve Speaker Privacy
S Meyer, P Tilli, P Denisov, F Lux, J Koch, NT Vu
2022 IEEE Spoken Language Technology Workshop (SLT), 912-919, 2023
112023
The IMS Toucan System for the Blizzard Challenge 2023
F Lux, J Koch, S Meyer, T Bott, N Schauffler, P Denisov, A Schweitzer, ...
18th Blizzard Challenge Workshop, 2023
92023
ADVISER: A Toolkit for Developing Multi-modal, Multi-domain and Socially-engaged Conversational Agents
CY Li, D Ortega, D Väth, F Lux, L Vanderlyn, M Schmidt, M Neumann, ...
arXiv preprint arXiv:2005.01777, 2020
82020
Exploring Speech Recognition, Translation, and Understanding with Discrete Speech Units: A Comparative Study
X Chang, B Yan, K Choi, J Jung, Y Lu, S Maiti, R Sharma, J Shi, J Tian, ...
arXiv preprint arXiv:2309.15800, 2023
62023
IMS' Systems for the IWSLT 2021 Low-Resource Speech Translation Task
P Denisov, M Mager, NT Vu
2021 International Conference on Spoken Language Translation (IWSLT), 175-181, 2021
32021
Prosody Is Not Identity: A Speaker Anonymization Approach Using Prosody Cloning
S Meyer, F Lux, J Koch, P Denisov, P Tilli, NT Vu
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
22023
Cascade of Phonetic Speech Recognition, Speaker Embeddings GAN and Multispeaker Speech Synthesis for the VoicePrivacy 2022 Challenge
S Meyer, P Tilli, F Lux, P Denisov, J Koch, NT Vu
2nd Symposium on Security and Privacy in Speech Communication, 2022
22022
Leveraging Multilingual Self-Supervised Pretrained Models for Sequence-to-Sequence End-to-End Spoken Language Understanding
P Denisov, NT Vu
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-8, 2023
2023
Findings of the Second AmericasNLP Competition on Speech-to-Text Translation
A Ebrahimi, M Mager, A Wiemerslage, P Denisov, A Oncevay, D Liu, ...
NeurIPS 2022 Competition Track 220, 217-232, 2022
2022
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–18