Qiantong Xu

Qiantong is a Research Engineer at Facebook AI Research (FAIR), focusing on acoustic modeling and language modeling in end-to-end speech recognition. He earned a B.S. in artificial intelligence from Peking University, and an M.S. in computer science from Cornell University.

Qiantong's Publications

April 07, 2020

RESEARCH

COMPUTER VISION

Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions

We propose a fully convolutional sequence-to-sequence encoder architecture with a simple and efficient decoder. Our model improves WER on LibriSpeech while being an order of magnitude more efficient than a strong RNN baseline. Key to our…

Awni Hannun, Ann Lee, Qiantong Xu, Ronan Collobert,

April 07, 2020

April 07, 2020

RESEARCH

NLP

Scaling up online speech recognition using ConvNets

We design an online end-to-end speech recognition system based on Time-Depth Separable (TDS) convolutions and Connectionist Temporal Classification (CTC). The system has almost three times the throughput of a well tuned hybrid ASR baseline…

Vineel Pratap, Qiantong Xu, Jacob Kahn, Gilad Avidov, Tatiana Likhomanenko, Awni Hannun, Vitaliy Liptchinsky, Gabriel Synnaeve, Ronan Collobert,

April 07, 2020