Följ
Kartik Audhkhasi
Kartik Audhkhasi
Verifierad e-postadress på google.com - Startsida
Titel
Citeras av
Citeras av
År
English conversational telephone speech recognition by humans and machines
G Saon, G Kurata, T Sercu, K Audhkhasi, S Thomas, D Dimitriadis, X Cui, ...
arXiv preprint arXiv:1703.02136, 2017
4632017
Applying machine learning to facilitate autism diagnostics: pitfalls and promises
D Bone, MS Goodwin, MP Black, CC Lee, K Audhkhasi, S Narayanan
Journal of autism and developmental disorders 45, 1121-1136, 2015
2732015
Direct acoustics-to-word models for english conversational speech recognition
K Audhkhasi, B Ramabhadran, G Saon, M Picheny, D Nahamoo
arXiv preprint arXiv:1703.07754, 2017
1662017
Building competitive direct acoustics-to-word models for english conversational speech recognition
K Audhkhasi, B Kingsbury, B Ramabhadran, G Saon, M Picheny
2018 IEEE international conference on acoustics, speech and signal …, 2018
1362018
Avlnet: Learning audio-visual language representations from instructional videos
A Rouditchenko, A Boggust, D Harwath, B Chen, D Joshi, S Thomas, ...
arXiv preprint arXiv:2006.09199, 2020
1302020
End-to-End ASR-free Keyword Search from Speech
K Audhkhasi, A Rosenberg, A Sethy, B Ramabhadran, B Kingsbury
arXiv preprint arXiv:1701.04313, 2017
1252017
Noise-enhanced convolutional neural networks
K Audhkhasi, O Osoba, B Kosko
Neural Networks 78, 15-23, 2016
1182016
Multilingual representations for low resource speech recognition and keyword search
J Cui, B Kingsbury, B Ramabhadran, A Sethy, K Audhkhasi, Z Tüske, ...
Proc. ASRU, 2015
1092015
Joint modeling of accents and acoustics for multi-accent speech recognition
X Yang, K Audhkhasi, A Rosenberg, S Thomas, B Ramabhadran, ...
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
822018
Invariant representations for noisy speech recognition
D Serdyuk, K Audhkhasi, P Brakel, B Ramabhadran, S Thomas, Y Bengio
arXiv preprint arXiv:1612.01928, 2016
792016
Formant-based technique for automatic filled-pause detection in spontaneous spoken English
K Audhkhasi, K Kandhway, OD Deshmukh, A Verma
2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009
792009
Single headed attention based sequence-to-sequence model for state-of-the-art results on switchboard
Z Tüske, G Saon, K Audhkhasi, B Kingsbury
arXiv preprint arXiv:2001.07263, 2020
782020
Which ASR should I choose for my dialogue system?
F Morbini, K Audhkhasi, K Sagae, R Artstein, D Can, P Georgiou, ...
SIGDIAL 2013, 2013
732013
Knowledge distillation across ensembles of multilingual models for low-resource languages
J Cui, B Kingsbury, B Ramabhadran, G Saon, T Sercu, K Audhkhasi, ...
2017 IEEE International Conference on Acoustics, Speech and Signal …, 2017
712017
End-to-end speech recognition and keyword search on low-resource languages
A Rosenberg, K Audhkhasi, A Sethy, B Ramabhadran, M Picheny
2017 ieee international conference on acoustics, speech and signal …, 2017
662017
Leveraging unpaired text data for training end-to-end speech-to-intent systems
Y Huang, HK Kuo, S Thomas, Z Kons, K Audhkhasi, B Kingsbury, R Hoory, ...
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
632020
Noise-enhanced convolutional neural networks
K Audhkhasi, B Kosko, O Osoba
US Patent 11,256,982, 2022
572022
Alignment-length synchronous decoding for RNN transducer
G Saon, Z Tüske, K Audhkhasi
ICASSP 2020-2020 IEEE International Conference on Acoustics, Speech and …, 2020
552020
External word embedding neural network language models
K Audhkhasi, B Ramabhadran, A Sethy
US Patent 10,019,438, 2018
532018
Automatic evaluation of spoken fluency
K Audhkhasi, OD Deshmukh, K Kandhway, A Verma
US Patent 8,457,967, 2013
532013
Systemet kan inte utföra åtgärden just nu. Försök igen senare.
Artiklar 1–20