Need advice about which tool to choose?Ask the StackShare community!

DMTK

4
18
+ 1
0
PyTorch

1.5K
1.5K
+ 1
43
Add tool

DMTK vs PyTorch: What are the differences?

DMTK: Microsoft Distributed Machine Learning Tookit. DMTK provides a parameter server based framework for training machine learning models on big data with numbers of machines. It is currently a standard C++ library and provides a series of friendly programming interfaces; PyTorch: A deep learning framework that puts Python first. PyTorch is not a Python binding into a monolothic C++ framework. It is built to be deeply integrated into Python. You can use it naturally like you would use numpy / scipy / scikit-learn etc.

DMTK and PyTorch belong to "Machine Learning Tools" category of the tech stack.

DMTK and PyTorch are both open source tools. It seems that PyTorch with 29.6K GitHub stars and 7.18K forks on GitHub has more adoption than DMTK with 2.69K GitHub stars and 595 GitHub forks.

Decisions about DMTK and PyTorch

Pytorch is a famous tool in the realm of machine learning and it has already set up its own ecosystem. Tutorial documentation is really detailed on the official website. It can help us to create our deep learning model and allowed us to use GPU as the hardware support.

I have plenty of projects based on Pytorch and I am familiar with building deep learning models with this tool. I have used TensorFlow too but it is not dynamic. Tensorflow works on a static graph concept that means the user first has to define the computation graph of the model and then run the ML model, whereas PyTorch believes in a dynamic graph that allows defining/manipulating the graph on the go. PyTorch offers an advantage with its dynamic nature of creating graphs.

See more
Fabian Ulmer
Software Developer at Hestia · | 3 upvotes · 49.2K views

For my company, we may need to classify image data. Keras provides a high-level Machine Learning framework to achieve this. Specifically, CNN models can be compactly created with little code. Furthermore, already well-proven classifiers are available in Keras, which could be used as Transfer Learning for our use case.

We chose Keras over PyTorch, another Machine Learning framework, as our preliminary research showed that Keras is more compatible with .js. You can also convert a PyTorch model into TensorFlow.js, but it seems that Keras needs to be a middle step in between, which makes Keras a better choice.

See more
Xi Huang
Developer at University of Toronto · | 8 upvotes · 90.9K views

For data analysis, we choose a Python-based framework because of Python's simplicity as well as its large community and available supporting tools. We choose PyTorch over TensorFlow for our machine learning library because it has a flatter learning curve and it is easy to debug, in addition to the fact that our team has some existing experience with PyTorch. Numpy is used for data processing because of its user-friendliness, efficiency, and integration with other tools we have chosen. Finally, we decide to include Anaconda in our dev process because of its simple setup process to provide sufficient data science environment for our purposes. The trained model then gets deployed to the back end as a pickle.

See more

A large part of our product is training and using a machine learning model. As such, we chose one of the best coding languages, Python, for machine learning. This coding language has many packages which help build and integrate ML models. For the main portion of the machine learning, we chose PyTorch as it is one of the highest quality ML packages for Python. PyTorch allows for extreme creativity with your models while not being too complex. Also, we chose to include scikit-learn as it contains many useful functions and models which can be quickly deployed. Scikit-learn is perfect for testing models, but it does not have as much flexibility as PyTorch. We also include NumPy and Pandas as these are wonderful Python packages for data manipulation. Also for testing models and depicting data, we have chosen to use Matplotlib and seaborn, a package which creates very good looking plots. Matplotlib is the standard for displaying data in Python and ML. Whereas, seaborn is a package built on top of Matplotlib which creates very visually pleasing plots.

See more
Get Advice from developers at your company using StackShare Enterprise. Sign up for StackShare Enterprise.
Learn More
Pros of DMTK
Pros of PyTorch
    Be the first to leave a pro
    • 15
      Easy to use
    • 11
      Developer Friendly
    • 10
      Easy to debug
    • 7
      Sometimes faster than TensorFlow

    Sign up to add or upvote prosMake informed product decisions

    Cons of DMTK
    Cons of PyTorch
      Be the first to leave a con
      • 3
        Lots of code
      • 1
        It eats poop

      Sign up to add or upvote consMake informed product decisions

      What is DMTK?

      DMTK provides a parameter server based framework for training machine learning models on big data with numbers of machines. It is currently a standard C++ library and provides a series of friendly programming interfaces.

      What is PyTorch?

      PyTorch is not a Python binding into a monolothic C++ framework. It is built to be deeply integrated into Python. You can use it naturally like you would use numpy / scipy / scikit-learn etc.

      Need advice about which tool to choose?Ask the StackShare community!

      What companies use DMTK?
      What companies use PyTorch?
      See which teams inside your own company are using DMTK or PyTorch.
      Sign up for StackShare EnterpriseLearn More

      Sign up to get full access to all the companiesMake informed product decisions

      What tools integrate with DMTK?
      What tools integrate with PyTorch?
        No integrations found

        Sign up to get full access to all the tool integrationsMake informed product decisions

        Blog Posts

        PythonDockerKubernetes+14
        12
        2603
        Dec 4 2019 at 8:01PM

        Pinterest

        KubernetesJenkinsTensorFlow+4
        5
        3274
        What are some alternatives to DMTK and PyTorch?
        TensorFlow
        TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API.
        scikit-learn
        scikit-learn is a Python module for machine learning built on top of SciPy and distributed under the 3-Clause BSD license.
        Keras
        Deep Learning library for Python. Convnets, recurrent neural networks, and more. Runs on TensorFlow or Theano. https://keras.io/
        CUDA
        A parallel computing platform and application programming interface model,it enables developers to speed up compute-intensive applications by harnessing the power of GPUs for the parallelizable part of the computation.
        Streamlit
        It is the app framework specifically for Machine Learning and Data Science teams. You can rapidly build the tools you need. Build apps in a dozen lines of Python with a simple API.
        See all alternatives