Microsoft Distributed Machine Learning Tookit
Companies using DMTK
Why people like DMTK
Add a one-liner



DMTK provides a parameter server based framework for training machine learning models on big data with numbers of machines. It is currently a standard C++ library and provides a series of friendly programming interfaces.

DMTK's Features
  • DMTK Framework: a flexible framework that supports unified interface for data parallelization, hybrid data structure for big model storage, model scheduling for big model training, and automatic pipelining for high training efficiency.
  • LightLDA, an extremely fast and scalable topic model algorithm, with a O(1) Gibbs sampler and an efficient distributed implementation.
  • Distributed (Multisense) Word Embedding, a distributed version of (multi-sense) word embedding algorithm.