Follow
Wei Chu
Title
Cited by
Cited by
Year
Reducing f0 frame error of f0 tracking algorithms under noisy conditions with an unvoiced/voiced classification frontend
W Chu, A Alwan
2009 IEEE International Conference on Acoustics, Speech and Signal …, 2009
1192009
Noise robust bird song detection using syllable pattern-based hidden Markov models
W Chu, DT Blumstein
2011 IEEE International Conference on Acoustics, Speech and Signal …, 2011
702011
SAFE: A Statistical Approach to F0 Estimation Under Clean and Noisy Conditions
W Chu, A Alwan
Audio, Speech, and Language Processing, IEEE Transactions on, 1-1, 2010
682010
Hybrid CTC-attention based end-to-end speech recognition using subword units
Z Xiao, Z Ou, W Chu, H Lin
2018 11th International Symposium on Chinese Spoken Language Processing …, 2018
412018
CASS-NAT: CTC Alignment-based Single Step Non-autoregressive Transformer for Speech Recognition
R Fan, W Chu, P Chang, J Xiao
ICASSP 2021, 2020
392020
Singing voice conversion with non-parallel data
X Chen, W Chu, J Guo, N Xu
2019 IEEE Conference on Multimedia Information Processing and Retrieval …, 2019
372019
Joint audio-video facial animation system
C Cao, X Chen, W Chu, Z Xue
US Patent 10,586,368, 2020
232020
An improved single step non-autoregressive transformer for automatic speech recognition
R Fan, W Chu, P Chang, J Xiao, A Alwan
arXiv preprint arXiv:2106.09885, 2021
182021
BandNet: A Neural Network-based, Multi-Instrument Beatles-Style MIDI Music Composition Machine
Y Zhou, W Chu, S Young, X Chen
20th annual conference of the International Society for Music Information …, 2018
172018
FBEM: A filter bank EM algorithm for the joint optimization of features and acoustic model parameters in bird call classification
W Chu, A Alwan
2012 IEEE International Conference on Acoustics, Speech and Signal …, 2012
142012
A Correlation-Maximization Denoising Filter Used as An Enhancement Frontend for Noise Robust Bird Call Classification
W Chu, A Alwan
Tenth Annual Conference of the International Speech Communication Association, 2009
102009
Low Resource German ASR with Untranscribed Data Spoken by Non-native Children--INTERSPEECH 2021 Shared Task SPAPL System
J Wang, Y Zhu, R Fan, W Chu, A Alwan
arXiv preprint arXiv:2106.09963, 2021
92021
Pictorial symbol prediction
W Brendel, F Barbieri, X Chen, W Chu, VSP Karuturi, LCDS Marujo, ...
US Patent 10,788,900, 2020
92020
A ctc alignment-based non-autoregressive transformer for end-to-end automatic speech recognition
R Fan, W Chu, P Chang, A Alwan
IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1436-1448, 2023
72023
Recognize Mispronunciations to Improve Non-Native Acoustic Modeling Through a Phone Decoder Built from One Edit Distance Finite State Automaton.
W Chu, Y Liu, J Zhou
INTERSPEECH, 3062-3066, 2020
52020
Joint audio-video driven facial animation
X Chen, C Cao, Z Xue, W Chu
2018 IEEE International Conference on Acoustics, Speech and Signal …, 2018
52018
Speech characteristic recognition and conversion
W Chu
US Patent 10,818,308, 2020
32020
Speaker Cluster-Based Speaker Adaptive Training for Deep Neural Network Acoustic Modeling
W Chu, R Chen
ICASSP, 2016
32016
Noise Robust Signal Processing for Human Pitch Tracking and Bird Song Classification and Detection
W Chu
University of California, Los Angeles, 2012
32012
SAFE: a statistical algorithm for F0 estimation for both clean and noisy speech
W Chu, A Alwan
Eleventh Annual Conference of the International Speech Communication Association, 2010
32010
The system can't perform the operation now. Try again later.
Articles 1–20