Holger Schwenk

Holger Schwenk is a research scientist at Facebook Artificial Intelligence Research, Paris. He received his PhD in computer science from the University of Paris 6 in 1996. He then spent one year at the University of Montreal working with Y. Bengio and one year at the International Computer Science Institute in Berkeley. From 1998 to 2007, Holger held an assistant professor position at the University of Paris 11/LIMSI. Prior to joining Facebook in 2015, he was a professor of computer science at the University of Le Mans where he led a large group on statistical machine translation. In 2013, Holger was awarded senior member of the Institut Universitaire de France.

Holger's Publications

November 03, 2019

RESEARCH

ML APPLICATIONS

Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond

We introduce an architecture to learn joint multilingual sentence representations for 93 languages, belonging to more than 30 different families and written in 28 different scripts. Our system uses a single BiLSTM encoder with a shared…

Mikel Artetxe, Holger Schwenk

November 03, 2019

August 02, 2019

RESEARCH

ML APPLICATIONS

Low-Resource Corpus Filtering using Multilingual Sentence Embeddings

In this paper, we describe our submission to the WMT19 low-resource parallel corpus filtering shared task. Our main approach is based on the LASER toolkit (Language-Agnostic SEntence Representations), which uses an encoder-decoder architecture…

Vishrav Chaudhary, Yuqing Tang, Francisco (Paco) Guzmán, Holger Schwenk, Philipp Koehn

August 02, 2019

July 27, 2019

RESEARCH

ML APPLICATIONS

Margin-based Parallel Corpus Mining with Multilingual Sentence Embeddings

Machine translation is highly sensitive to the size and quality of the training data, which has led to an increasing interest in collecting and filtering large parallel corpora. In this paper, we propose a new method for this task based on…

Mikel Artetxe, Holger Schwenk

July 27, 2019