August 22, 2020
In this paper, we present GrokNet, a deployed image recognition system for commerce applications. GrokNet leverages a multi-task learning approach to train a single computer vision trunk. We achieve a 2.1x improvement in exact product match accuracy when compared to the previous state-of-the-art Facebook product recognition system. We achieve this by training on 7 datasets across several commerce verticals, using 80 categorical loss functions and 3 embedding losses. We share our experience of combining diverse sources with wide-ranging label semantics and image statistics, including learning from human annotations, user-generated tags, and noisy search engine interaction data. GrokNet has demonstrated gains in production applications and operates at Facebook scale.
Written by
Sean BellYiqun Liu
Sami Alsheikh
Yina Tang
Ed Pizzi
M. Henning
Karun Singh
Omkar Parkhi
Fedor Borisyuk
Publisher
KDD
February 01, 2021
Inference in deep neural networks can be computationally expensive, and networks capable of anytime inference are important in scenarios where the amount of compute or quantity of input data varies over time.…
Adria Ruiz, Jakob Verbeek
February 01, 2021
January 09, 2021
Super-resolution aims at increasing the resolution and level of detail within an image.…
Baptiste Roziere, Nathanaël Carraz Rakotonirina, Vlad Hosu, Andry Rasoanaivo, Hanhe Lin, Camille Couprie, Olivier Teytaud
January 09, 2021
January 01, 2021
We consider a multi-agent framework for distributed optimization where each agent has access to a local smooth strongly convex function, and the collective goal is to achieve consensus on the parameters that minimize the sum of the agents’…
Mahmoud Assran, Michael Rabbat
January 01, 2021
December 12, 2020
Bridging logical and algorithmic reasoning with modern machine learning techniques is a fundamental challenge with potentially transformative impact…
Anselm Paulus, Michal Rolinek, Vit Musil, Brandon Amos, Georg Martius
December 12, 2020