TOOLS

StarSpace

StarSpace is a general purpose neural embedding model that can be applied to many areas including text classification.

A multi-purpose learning model

StarSpace learns to represent objects of different types into a common vectorial embedding space in order to compare them against each other. This makes it well suited for a variety of problems, including:

  • Learning word, sentence or document level embeddings.

  • Information retrieval - ranking of sets of entities / documents or objects, e.g., ranking web documents.

  • Text classification, or any other labeling task.

  • Metric / similarity learning, e.g., learning sentence or document similarity.

  • Content-based or Collaborative Filtering-based Recommendation, e.g., recommending music or videos.

  • Embedding graphs, e.g., multi-relational graphs such as Freebase.

Additional details on StarSpace can be found in the research paper.

Get Started

1

Install the Boost library and specify the path of the Boost library in makefile in order to run StarSpace.

1
2
3
$ wget https://dl.bintray.com/boostorg/release/1.63.0/source/boost_1_63_0.zip
$ unzip boost_1_63_0.zip
$ sudo mv boost_1_63_0 /usr/local/bin
          

Optional: To run unit tests in src directory, Google Test is required, and its path needs to be specified in 'TEST_INCLUDES' in the makefile.

2

Build StarSpace

1
2
3
git clone https://github.com/facebookresearch/Starspace.git
cd Starspace
make
          

3

Review documentation to familiarize yourself with the StarSpace file format, example use cases, parameters, and functions.

4

Prepare datasets, train, and evaluate models with StarSpace.

More Tools