Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
It is a scalable graph database optimized for storing and querying graphs containing hundreds of billions of vertices and edges distributed across a multi-machine cluster. It is a transactional database that can support thousands of concurrent users executing complex graph traversals in real time. | It brings organization and collaboration to data science projects. All the experiement-related objects are backed-up and organized ready to be analyzed, reproduced and shared with others. Works with all common technologies and integrates with other tools. |
Elastic and linear scalability for a growing data and user base;
Data distribution and replication for performance and fault tolerance;
Multi-datacenter high availability and hot backups;
Support for ACID and eventual consistency;
Support for various storage backends: HBase, Cassandra, Bigtable, DynamoDB, BerkeleyDB;
Support for global graph data analytics, reporting, and ETL through integration with big data platforms: Spark, Giraph, Hadoop;
Support for geo, numeric range, and full-text search via:
ElasticSearch, Solr, Lucene;
Native integration with the Apache TinkerPop graph stack;
Open source under the Apache 2 license | Experiment tracking; Experiment versioning; Experiment comparison; Experiment monitoring; Experiment sharing; Notebook versioning |
Statistics | |
Stacks 44 | Stacks 16 |
Followers 96 | Followers 38 |
Votes 0 | Votes 2 |
Pros & Cons | |
No community feedback yet | Pros
Cons
|
Integrations | |

Neo4j stores data in nodes connected by directed, typed relationships with properties on both, also known as a Property Graph. It is a high performance graph store with all the features expected of a mature and robust database, like a friendly query language and ACID transactions.

TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.

scikit-learn is a Python module for machine learning built on top of SciPy and distributed under the 3-Clause BSD license.

PyTorch is not a Python binding into a monolothic C++ framework. It is built to be deeply integrated into Python. You can use it naturally like you would use numpy / scipy / scikit-learn etc.

Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on TensorFlow or Theano. https://keras.io/

The Kubeflow project is dedicated to making Machine Learning on Kubernetes easy, portable and scalable by providing a straightforward way for spinning up best of breed OSS solutions.

Use flexible and intuitive APIs to build and train models from scratch using the low-level JavaScript linear algebra library or the high-level layers API

An enterprise-grade open source platform for building, training, and monitoring large scale deep learning applications.

It is the app framework specifically for Machine Learning and Data Science teams. You can rapidly build the tools you need. Build apps in a dozen lines of Python with a simple API.

MLflow is an open source platform for managing the end-to-end machine learning lifecycle.