Need advice about which tool to choose?Ask the StackShare community!

Gensim

64
81
+ 1
0
SpaCy

211
283
+ 1
14
Add tool

Gensim vs SpaCy: What are the differences?

What is Gensim? A python library for Topic Modelling. It is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

What is SpaCy? Industrial-Strength Natural Language Processing in Python. It is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It comes with pre-trained statistical models and word vectors, and currently supports tokenization for 49+ languages.

Gensim and SpaCy belong to "NLP / Sentiment Analysis" category of the tech stack.

Gensim is an open source tool with 9.65K GitHub stars and 3.52K GitHub forks. Here's a link to Gensim's open source repository on GitHub.

According to the StackShare community, SpaCy has a broader approval, being mentioned in 14 company stacks & 11 developers stacks; compared to Gensim, which is listed in 3 company stacks and 5 developer stacks.

Get Advice from developers at your company using StackShare Enterprise. Sign up for StackShare Enterprise.
Learn More
Pros of Gensim
Pros of SpaCy
    Be the first to leave a pro
    • 12
      Speed
    • 2
      No vendor lock-in

    Sign up to add or upvote prosMake informed product decisions

    Cons of Gensim
    Cons of SpaCy
      Be the first to leave a con
      • 1
        Requires creating a training set and managing training

      Sign up to add or upvote consMake informed product decisions

      What is Gensim?

      It is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

      What is SpaCy?

      It is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest research, and was designed from day one to be used in real products. It comes with pre-trained statistical models and word vectors, and currently supports tokenization for 49+ languages.

      Need advice about which tool to choose?Ask the StackShare community!

      Jobs that mention Gensim and SpaCy as a desired skillset
      What companies use Gensim?
      What companies use SpaCy?
      See which teams inside your own company are using Gensim or SpaCy.
      Sign up for StackShare EnterpriseLearn More

      Sign up to get full access to all the companiesMake informed product decisions

      What tools integrate with Gensim?
      What tools integrate with SpaCy?
      What are some alternatives to Gensim and SpaCy?
      NLTK
      It is a suite of libraries and programs for symbolic and statistical natural language processing for English written in the Python programming language.
      Keras
      Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on TensorFlow or Theano. https://keras.io/
      FastText
      It is an open-source, free, lightweight library that allows users to learn text representations and text classifiers. It works on standard, generic hardware. Models can later be reduced in size to even fit on mobile devices.
      TensorFlow
      TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.
      Transformers
      It provides general-purpose architectures (BERT, GPT-2, RoBERTa, XLM, DistilBert, XLNet…) for Natural Language Understanding (NLU) and Natural Language Generation (NLG) with over 32+ pretrained models in 100+ languages and deep interoperability between TensorFlow 2.0 and PyTorch.
      See all alternatives