Microsoft Distributed Machine Learning Tookit

What is DMTK?

DMTK provides a parameter server based framework for training machine learning models on big data with numbers of machines. It is currently a standard C++ library and provides a series of friendly programming interfaces.

DMTK is a tool in the Machine Learning Tools category of a tech stack.

DMTK is an open source tool with 2.65K Github Stars and 598 Github Forks. Here’s a link to DMTK's open source repository on Github

Who Uses DMTK?

Why people like DMTK

Here’s a list of reasons why companies and developers use DMTK.

Add a one-liner

DMTK's Features

  • DMTK Framework: a flexible framework that supports unified interface for data parallelization, hybrid data structure for big model storage, model scheduling for big model training, and automatic pipelining for high training efficiency.
  • LightLDA, an extremely fast and scalable topic model algorithm, with a O(1) Gibbs sampler and an efficient distributed implementation.
  • Distributed (Multisense) Word Embedding, a distributed version of (multi-sense) word embedding algorithm.

DMTK's alternatives

  • TensorFlow - Open Source Software Library for Machine Intelligence
  • scikit-learn - Easy-to-use and general-purpose machine learning in Python
  • Keras - Deep Learning library for Theano and TensorFlow
  • PyTorch - A deep learning framework that puts Python first
  • ML Kit - Machine learning for mobile developers (by Google)

See all alternatives to DMTK