John Hershey

Citeras av

	Alla	Sedan 2019
Citat	16842	12185
h-index	58	47
i10-index	142	104

2600

1300

650

1950

20042005200620072008200920102011201220132014201520162017201820192020202120222023202442 42 55 99 85 144 212 185 273 271 290 365 411 804 1250 1754 2242 2499 2455 2515 717

Offentlig åtkomst

Visa alla

6 artiklar

0 artiklar

tillgänglig

inte tillgänglig

Enligt krav från finansiärer

Medförfattare

Jonathan Le RouxMERLVerifierad e-postadress på merl.com
Shinji WatanabeCarnegie Mellon UniversityVerifierad e-postadress på cmu.edu
Hakan ErdoganGoogleVerifierad e-postadress på google.com
Scott WisdomGoogle ResearchVerifierad e-postadress på google.com
Takaaki HoriAppleVerifierad e-postadress på apple.com
Peder A OlsenMicrosoft Research (formerly IBM Research)Verifierad e-postadress på microsoft.com
Zhuo ChenBytedance (formerly Microsoft, Columbia University)Verifierad e-postadress på columbia.edu
Steven J. RenniePryon Inc. (Formerly Fusemachines Inc, IBM Research, University of Toronto)Verifierad e-postadress på pryoninc.com
Felix WeningerMicrosoftVerifierad e-postadress på microsoft.com
Kevin WilsonGoogleVerifierad e-postadress på google.com
Trausti T KristjanssonAmazon Lab126, Adjoint Professor at University of Reykjavik (formerly Google, IBM, MSR)Verifierad e-postadress på amazon.com
Javier MovellanResearch Professor, University of California San DiegoVerifierad e-postadress på mplab.ucsd.edu
Chiori HoriMERLVerifierad e-postadress på merl.com
Tim K. MarksPrincipal Research Scientist, Mitsubishi Electric Research Labs (MERL)Verifierad e-postadress på merl.com
Efthymios TzinisResearch Scientist at Google | Ex. UIUC, MERL, MetaVerifierad e-postadress på google.com
Zhong-Qiu WangPostdoc, Carnegie Mellon UniversityVerifierad e-postadress på andrew.cmu.edu
Ron J WeissGoogleVerifierad e-postadress på google.com
Yuuki TachiokaDenso IT LaboratoryVerifierad e-postadress på d-itlab.co.jp
Björn SchullerProfessor, Technische Universität München (TUM) / Imperial College London & CSO, audEERINGVerifierad e-postadress på tum.de
Joshua M SusskindApple AI ResearchVerifierad e-postadress på apple.com

Följ

John Hershey

Google (formerly MERL, IBM, MSR, UCSD)

Verifierad e-postadress på google.com

machine learning sound separation speech recognition audio-visual perception


Titel Sortera efter citat Sortera efter år Sortera efter titel	Citeras av Citeras av	År
Deep clustering: Discriminative embeddings for segmentation and separation JR Hershey, Z Chen, J Le Roux, S Watanabe 2016 IEEE international conference on acoustics, speech and signal …, 2016	1463	2016
Approximating the Kullback Leibler divergence between Gaussian mixture models JR Hershey, PA Olsen 2007 IEEE International Conference on Acoustics, Speech and Signal …, 2007	1349	2007
SDR–half-baked or well done? J Le Roux, S Wisdom, H Erdogan, JR Hershey ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019	1053	2019
Hybrid CTC/attention architecture for end-to-end speech recognition S Watanabe, T Hori, S Kim, JR Hershey, T Hayashi IEEE Journal of Selected Topics in Signal Processing 11 (8), 1240-1253, 2017	819	2017
Phase-sensitive and recognition-boosted speech separation using deep recurrent neural networks H Erdogan, JR Hershey, S Watanabe, J Le Roux 2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015	734	2015
Speech enhancement with LSTM recurrent neural networks and its application to noise-robust ASR F Weninger, H Erdogan, S Watanabe, E Vincent, J Le Roux, JR Hershey, ... Latent Variable Analysis and Signal Separation: 12th International …, 2015	668	2015
Deep unfolding: Model-based inspiration of novel deep architectures JR Hershey, JL Roux, F Weninger arXiv preprint arXiv:1409.2574, 2014	483	2014
Single-channel multi-speaker separation using deep clustering Y Isik, JL Roux, Z Chen, S Watanabe, JR Hershey arXiv preprint arXiv:1607.02173, 2016	476	2016
Attention-based multimodal fusion for video description C Hori, T Hori, TY Lee, Z Zhang, B Harsham, JR Hershey, TK Marks, ... Proceedings of the IEEE international conference on computer vision, 4193-4202, 2017	401	2017
Voicefilter: Targeted voice separation by speaker-conditioned spectrogram masking Q Wang, H Muckenhirn, K Wilson, P Sridhar, Z Wu, J Hershey, ... arXiv preprint arXiv:1810.04826, 2018	388	2018
Audio vision: Using audio-visual synchrony to locate sounds J Hershey, J Movellan Advances in neural information processing systems 12, 1999	370	1999
Discriminatively trained recurrent neural networks for single-channel speech separation F Weninger, JR Hershey, J Le Roux, B Schuller 2014 IEEE global conference on signal and information processing (GlobalSIP …, 2014	352	2014
Improved mvdr beamforming using single-channel mask prediction networks. H Erdogan, JR Hershey, S Watanabe, MI Mandel, J Le Roux Interspeech, 1981-1985, 2016	347	2016
Full-capacity unitary recurrent neural networks S Wisdom, T Powers, J Hershey, J Le Roux, L Atlas Advances in Neural Information Processing Systems, 4880-4888, 2016	342	2016
Multi-channel deep clustering: Discriminative spectral and spatial embeddings for speaker-independent speech separation ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	253	2018
Monaural speech separation and recognition challenge M Cooke, JR Hershey, SJ Rennie Computer Speech & Language 24 (1), 1-15, 2010	247	2010
Alternative objective functions for deep clustering ZQ Wang, J Le Roux, JR Hershey 2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018	212	2018
Deep beamforming networks for multi-channel speech recognition X Xiao, S Watanabe, H Erdogan, L Lu, J Hershey, ML Seltzer, G Chen, ... 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016	211	2016
Super-human multi-talker speech recognition: A graphical modeling approach JR Hershey, SJ Rennie, PA Olsen, TT Kristjansson Computer Speech & Language 24 (1), 45-66, 2010	211	2010
Universal sound separation I Kavalerov, S Wisdom, H Erdogan, B Patton, K Wilson, J Le Roux, ... 2019 IEEE Workshop on Applications of Signal Processing to Audio and …, 2019	200	2019

Systemet kan inte utföra åtgärden just nu. Försök igen senare.

Artiklar 1–20

Citat per år

Dubblettcitat

Sammanfogade citat

Lägg till medförfattareMedförfattare

Följ

Citeras av

Medförfattare