GrokNet: Unified Computer Vision Model Trunk and Embeddings For Commerce

August 22, 2020


In this paper, we present GrokNet, a deployed image recognition system for commerce applications. GrokNet leverages a multi-task learning approach to train a single computer vision trunk. We achieve a 2.1x improvement in exact product match accuracy when compared to the previous state-of-the-art Facebook product recognition system. We achieve this by training on 7 datasets across several commerce verticals, using 80 categorical loss functions and 3 embedding losses. We share our experience of combining diverse sources with wide-ranging label semantics and image statistics, including learning from human annotations, user-generated tags, and noisy search engine interaction data. GrokNet has demonstrated gains in production applications and operates at Facebook scale.

Download the Paper


Written by

Sean Bell

Yiqun Liu

Sami Alsheikh

Yina Tang

Ed Pizzi

M. Henning

Karun Singh

Omkar Parkhi

Fedor Borisyuk



Recent Publications

January 01, 2021

Asynchronous Gradient-Push | Facebook AI Research

We consider a multi-agent framework for distributed optimization where each agent has access to a local smooth strongly convex function, and the collective goal is to achieve consensus on the parameters that minimize the sum of the agents’…

Mahmoud Assran, Michael Rabbat

January 01, 2021

November 16, 2020


An Imitation Game for Learning Semantic Parsers from User Interaction

Despite the widely successful applications, building a semantic parser is still a tedious process in practice with challenges from costly data annotation and privacy risks.…

Ziyu Yao, Yiqi Tang, Wen-tau Yih, Huan Sun, Yu Su

November 16, 2020

November 16, 2020


Dense Passage Retrieval for Open-Domain Question Answering

Open-domain question answering relies on efficient passage retrieval to select candidate contexts, where traditional sparse vector space models, such as TF-IDF or BM25, are the de facto method.…

Vladimir Karpukhin, Barlas Oguz, Sewon Min, Patrick Lewis, Ledell Wu, Sergey Edunov, Danqi Chen, Wen-tau Yih

November 16, 2020

November 16, 2020


Measuring the Similarity of Grammatical Gender Systems by Comparing Partitions

A grammatical gender system divides a lexicon into a small number of relatively fixed grammatical categories. How similar are these gender systems across languages? To quantify the similarity…

Arya D. McCarthy, Adina Williams, Shijia Liu, David Yarowsky, Ryan Cotterell

November 16, 2020

Help Us Pioneer The Future of AI

We share our open source frameworks, tools, libraries, and models for everything from research exploration to large-scale production deployment.