Qiantong Xu

Qiantong is a Research Engineer at Facebook AI Research (FAIR), focusing on acoustic modeling and language modeling in end-to-end speech recognition. He earned a B.S. in artificial intelligence from Peking University, and an M.S. in computer science from Cornell University.

Qiantong's Publications

RESEARCH

COMPUTER VISION

Sequence-to-Sequence Speech Recognition with Time-Depth Separable Convolutions

We propose a fully convolutional sequence-to-sequence encoder architecture with a simple and efficient decoder. Our model improves WER on LibriSpeech while being an order of magnitude more efficient than a strong RNN baseline. Key to our…

Awni Hannun, Ann Lee, Qiantong Xu, Ronan Collobert,

RESEARCH

NLP

Scaling up online speech recognition using ConvNets

We design an online end-to-end speech recognition system based on Time-Depth Separable (TDS) convolutions and Connectionist Temporal Classification (CTC). The system has almost three times the throughput of a well tuned hybrid ASR baseline…

Vineel Pratap, Qiantong Xu, Jacob Kahn, Gilad Avidov, Tatiana Likhomanenko, Awni Hannun, Vitaliy Liptchinsky, Gabriel Synnaeve, Ronan Collobert,