Alternatives to MLflow logo

Alternatives to MLflow

Kubeflow, Airflow, TensorFlow, DVC, and Seldon are the most popular alternatives and competitors to MLflow.
206
9

What is MLflow and what are its top alternatives?

MLflow is an open source platform for managing the end-to-end machine learning lifecycle.
MLflow is a tool in the Machine Learning Tools category of a tech stack.
MLflow is an open source tool with 18.8K GitHub stars and 4.2K GitHub forks. Here’s a link to MLflow's open source repository on GitHub

Top Alternatives to MLflow

  • Kubeflow
    Kubeflow

    The Kubeflow project is dedicated to making Machine Learning on Kubernetes easy, portable and scalable by providing a straightforward way for spinning up best of breed OSS solutions. ...

  • Airflow
    Airflow

    Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes your tasks on an array of workers while following the specified dependencies. Rich command lines utilities makes performing complex surgeries on DAGs a snap. The rich user interface makes it easy to visualize pipelines running in production, monitor progress and troubleshoot issues when needed. ...

  • TensorFlow
    TensorFlow

    TensorFlow is an open source software library for numerical computation using data flow graphs. Nodes in the graph represent mathematical operations, while the graph edges represent the multidimensional data arrays (tensors) communicated between them. The flexible architecture allows you to deploy computation to one or more CPUs or GPUs in a desktop, server, or mobile device with a single API. ...

  • DVC
    DVC

    It is an open-source Version Control System for data science and machine learning projects. It is designed to handle large files, data sets, machine learning models, and metrics as well as code. ...

  • Seldon
    Seldon

    Seldon is an Open Predictive Platform that currently allows recommendations to be generated based on structured historical data. It has a variety of algorithms to produce these recommendations and can report a variety of statistics. ...

  • Metaflow
    Metaflow

    It is a human-friendly Python library that helps scientists and engineers build and manage real-life data science projects. It was originally developed at Netflix to boost productivity of data scientists who work on a wide variety of projects from classical statistics to state-of-the-art deep learning. ...

  • Postman
    Postman

    It is the only complete API development environment, used by nearly five million developers and more than 100,000 companies worldwide. ...

  • Postman
    Postman

    It is the only complete API development environment, used by nearly five million developers and more than 100,000 companies worldwide. ...

MLflow alternatives & related posts

Kubeflow logo

Kubeflow

200
580
18
Machine Learning Toolkit for Kubernetes
200
580
+ 1
18
PROS OF KUBEFLOW
  • 9
    System designer
  • 3
    Google backed
  • 3
    Customisation
  • 3
    Kfp dsl
  • 0
    Azure
CONS OF KUBEFLOW
    Be the first to leave a con

    related Kubeflow posts

    Biswajit Pathak
    Project Manager at Sony · | 6 upvotes · 853.2K views

    Can you please advise which one to choose FastText Or Gensim, in terms of:

    1. Operability with ML Ops tools such as MLflow, Kubeflow, etc.
    2. Performance
    3. Customization of Intermediate steps
    4. FastText and Gensim both have the same underlying libraries
    5. Use cases each one tries to solve
    6. Unsupervised Vs Supervised dimensions
    7. Ease of Use.

    Please mention any other points that I may have missed here.

    See more
    Shared insights
    on
    KubeflowKubeflowKubernetesKubernetesMLflowMLflow

    We are trying to standardise DevOps across both ML (model selection and deployment) and regular software. Want to minimise the number of tools we have to learn. Also want a scalable solution which is easy enough to start small - eg. on a powerful laptop and eventually be deployed at scale. MLflow vs Kubernetes (Kubeflow)?

    See more
    Airflow logo

    Airflow

    1.7K
    2.7K
    128
    A platform to programmaticaly author, schedule and monitor data pipelines, by Airbnb
    1.7K
    2.7K
    + 1
    128
    PROS OF AIRFLOW
    • 53
      Features
    • 14
      Task Dependency Management
    • 12
      Beautiful UI
    • 12
      Cluster of workers
    • 10
      Extensibility
    • 6
      Open source
    • 5
      Complex workflows
    • 5
      Python
    • 3
      Good api
    • 3
      Apache project
    • 3
      Custom operators
    • 2
      Dashboard
    CONS OF AIRFLOW
    • 2
      Observability is not great when the DAGs exceed 250
    • 2
      Running it on kubernetes cluster relatively complex
    • 2
      Open source - provides minimum or no support
    • 1
      Logical separation of DAGs is not straight forward

    related Airflow posts

    Data science and engineering teams at Lyft maintain several big data pipelines that serve as the foundation for various types of analysis throughout the business.

    Apache Airflow sits at the center of this big data infrastructure, allowing users to “programmatically author, schedule, and monitor data pipelines.” Airflow is an open source tool, and “Lyft is the very first Airflow adopter in production since the project was open sourced around three years ago.”

    There are several key components of the architecture. A web UI allows users to view the status of their queries, along with an audit trail of any modifications the query. A metadata database stores things like job status and task instance status. A multi-process scheduler handles job requests, and triggers the executor to execute those tasks.

    Airflow supports several executors, though Lyft uses CeleryExecutor to scale task execution in production. Airflow is deployed to three Amazon Auto Scaling Groups, with each associated with a celery queue.

    Audit logs supplied to the web UI are powered by the existing Airflow audit logs as well as Flask signal.

    Datadog, Statsd, Grafana, and PagerDuty are all used to monitor the Airflow system.

    See more

    We are a young start-up with 2 developers and a team in India looking to choose our next ETL tool. We have a few processes in Azure Data Factory but are looking to switch to a better platform. We were debating Trifacta and Airflow. Or even staying with Azure Data Factory. The use case will be to feed data to front-end APIs.

    See more
    TensorFlow logo

    TensorFlow

    3.8K
    3.5K
    106
    Open Source Software Library for Machine Intelligence
    3.8K
    3.5K
    + 1
    106
    PROS OF TENSORFLOW
    • 32
      High Performance
    • 19
      Connect Research and Production
    • 16
      Deep Flexibility
    • 12
      Auto-Differentiation
    • 11
      True Portability
    • 6
      Easy to use
    • 5
      High level abstraction
    • 5
      Powerful
    CONS OF TENSORFLOW
    • 9
      Hard
    • 6
      Hard to debug
    • 2
      Documentation not very helpful

    related TensorFlow posts

    Tom Klein

    Google Analytics is a great tool to analyze your traffic. To debug our software and ask questions, we love to use Postman and Stack Overflow. Google Drive helps our team to share documents. We're able to build our great products through the APIs by Google Maps, CloudFlare, Stripe, PayPal, Twilio, Let's Encrypt, and TensorFlow.

    See more
    Conor Myhrvold
    Tech Brand Mgr, Office of CTO at Uber · | 8 upvotes · 2.8M views

    Why we built an open source, distributed training framework for TensorFlow , Keras , and PyTorch:

    At Uber, we apply deep learning across our business; from self-driving research to trip forecasting and fraud prevention, deep learning enables our engineers and data scientists to create better experiences for our users.

    TensorFlow has become a preferred deep learning library at Uber for a variety of reasons. To start, the framework is one of the most widely used open source frameworks for deep learning, which makes it easy to onboard new users. It also combines high performance with an ability to tinker with low-level model details—for instance, we can use both high-level APIs, such as Keras, and implement our own custom operators using NVIDIA’s CUDA toolkit.

    Uber has introduced Michelangelo (https://eng.uber.com/michelangelo/), an internal ML-as-a-service platform that democratizes machine learning and makes it easy to build and deploy these systems at scale. In this article, we pull back the curtain on Horovod, an open source component of Michelangelo’s deep learning toolkit which makes it easier to start—and speed up—distributed deep learning projects with TensorFlow:

    https://eng.uber.com/horovod/

    (Direct GitHub repo: https://github.com/uber/horovod)

    See more
    DVC logo

    DVC

    57
    91
    2
    Open-source Version Control System for Machine Learning Projects
    57
    91
    + 1
    2
    PROS OF DVC
    • 2
      Full reproducibility
    CONS OF DVC
    • 1
      Coupling between orchestration and version control
    • 1
      Requires working locally with the data
    • 1
      Doesn't scale for big data

    related DVC posts

    Shared insights
    on
    MLflowMLflowDVCDVC

    I already use DVC to keep track and store my datasets in my machine learning pipeline. I have also started to use MLflow to keep track of my experiments. However, I still don't know whether to use DVC for my model files or I use the MLflow artifact store for this purpose. Or maybe these two serve different purposes, and it may be good to do both! Can anyone help, please?

    See more
    Seldon logo

    Seldon

    13
    46
    0
    Open-source predictive analytics and recommendation engine
    13
    46
    + 1
    0
    PROS OF SELDON
      Be the first to leave a pro
      CONS OF SELDON
        Be the first to leave a con

        related Seldon posts

        Metaflow logo

        Metaflow

        16
        50
        0
        Build and manage real-life data science projects with ease (by Netflix)
        16
        50
        + 1
        0
        PROS OF METAFLOW
          Be the first to leave a pro
          CONS OF METAFLOW
            Be the first to leave a con

            related Metaflow posts

            Postman logo

            Postman

            94.4K
            80.9K
            1.8K
            Only complete API development environment
            94.4K
            80.9K
            + 1
            1.8K
            PROS OF POSTMAN
            • 490
              Easy to use
            • 369
              Great tool
            • 276
              Makes developing rest api's easy peasy
            • 156
              Easy setup, looks good
            • 144
              The best api workflow out there
            • 53
              It's the best
            • 53
              History feature
            • 44
              Adds real value to my workflow
            • 43
              Great interface that magically predicts your needs
            • 35
              The best in class app
            • 12
              Can save and share script
            • 10
              Fully featured without looking cluttered
            • 8
              Collections
            • 8
              Option to run scrips
            • 8
              Global/Environment Variables
            • 7
              Shareable Collections
            • 7
              Dead simple and useful. Excellent
            • 7
              Dark theme easy on the eyes
            • 6
              Awesome customer support
            • 6
              Great integration with newman
            • 5
              Documentation
            • 5
              Simple
            • 5
              The test script is useful
            • 4
              Saves responses
            • 4
              This has simplified my testing significantly
            • 4
              Makes testing API's as easy as 1,2,3
            • 4
              Easy as pie
            • 3
              API-network
            • 3
              I'd recommend it to everyone who works with apis
            • 3
              Mocking API calls with predefined response
            • 2
              Now supports GraphQL
            • 2
              Postman Runner CI Integration
            • 2
              Easy to setup, test and provides test storage
            • 2
              Continuous integration using newman
            • 2
              Pre-request Script and Test attributes are invaluable
            • 2
              Runner
            • 2
              Graph
            • 1
              <a href="http://fixbit.com/">useful tool</a>
            CONS OF POSTMAN
            • 10
              Stores credentials in HTTP
            • 9
              Bloated features and UI
            • 8
              Cumbersome to switch authentication tokens
            • 7
              Poor GraphQL support
            • 5
              Expensive
            • 3
              Not free after 5 users
            • 3
              Can't prompt for per-request variables
            • 1
              Import swagger
            • 1
              Support websocket
            • 1
              Import curl

            related Postman posts

            Noah Zoschke
            Engineering Manager at Segment · | 30 upvotes · 2.9M views

            We just launched the Segment Config API (try it out for yourself here) — a set of public REST APIs that enable you to manage your Segment configuration. A public API is only as good as its #documentation. For the API reference doc we are using Postman.

            Postman is an “API development environment”. You download the desktop app, and build API requests by URL and payload. Over time you can build up a set of requests and organize them into a “Postman Collection”. You can generalize a collection with “collection variables”. This allows you to parameterize things like username, password and workspace_name so a user can fill their own values in before making an API call. This makes it possible to use Postman for one-off API tasks instead of writing code.

            Then you can add Markdown content to the entire collection, a folder of related methods, and/or every API method to explain how the APIs work. You can publish a collection and easily share it with a URL.

            This turns Postman from a personal #API utility to full-blown public interactive API documentation. The result is a great looking web page with all the API calls, docs and sample requests and responses in one place. Check out the results here.

            Postman’s powers don’t end here. You can automate Postman with “test scripts” and have it periodically run a collection scripts as “monitors”. We now have #QA around all the APIs in public docs to make sure they are always correct

            Along the way we tried other techniques for documenting APIs like ReadMe.io or Swagger UI. These required a lot of effort to customize.

            Writing and maintaining a Postman collection takes some work, but the resulting documentation site, interactivity and API testing tools are well worth it.

            See more
            Simon Reymann
            Senior Fullstack Developer at QUANTUSflow Software GmbH · | 27 upvotes · 5.1M views

            Our whole Node.js backend stack consists of the following tools:

            • Lerna as a tool for multi package and multi repository management
            • npm as package manager
            • NestJS as Node.js framework
            • TypeScript as programming language
            • ExpressJS as web server
            • Swagger UI for visualizing and interacting with the API’s resources
            • Postman as a tool for API development
            • TypeORM as object relational mapping layer
            • JSON Web Token for access token management

            The main reason we have chosen Node.js over PHP is related to the following artifacts:

            • Made for the web and widely in use: Node.js is a software platform for developing server-side network services. Well-known projects that rely on Node.js include the blogging software Ghost, the project management tool Trello and the operating system WebOS. Node.js requires the JavaScript runtime environment V8, which was specially developed by Google for the popular Chrome browser. This guarantees a very resource-saving architecture, which qualifies Node.js especially for the operation of a web server. Ryan Dahl, the developer of Node.js, released the first stable version on May 27, 2009. He developed Node.js out of dissatisfaction with the possibilities that JavaScript offered at the time. The basic functionality of Node.js has been mapped with JavaScript since the first version, which can be expanded with a large number of different modules. The current package managers (npm or Yarn) for Node.js know more than 1,000,000 of these modules.
            • Fast server-side solutions: Node.js adopts the JavaScript "event-loop" to create non-blocking I/O applications that conveniently serve simultaneous events. With the standard available asynchronous processing within JavaScript/TypeScript, highly scalable, server-side solutions can be realized. The efficient use of the CPU and the RAM is maximized and more simultaneous requests can be processed than with conventional multi-thread servers.
            • A language along the entire stack: Widely used frameworks such as React or AngularJS or Vue.js, which we prefer, are written in JavaScript/TypeScript. If Node.js is now used on the server side, you can use all the advantages of a uniform script language throughout the entire application development. The same language in the back- and frontend simplifies the maintenance of the application and also the coordination within the development team.
            • Flexibility: Node.js sets very few strict dependencies, rules and guidelines and thus grants a high degree of flexibility in application development. There are no strict conventions so that the appropriate architecture, design structures, modules and features can be freely selected for the development.
            See more
            Postman logo

            Postman

            94.4K
            80.9K
            1.8K
            Only complete API development environment
            94.4K
            80.9K
            + 1
            1.8K
            PROS OF POSTMAN
            • 490
              Easy to use
            • 369
              Great tool
            • 276
              Makes developing rest api's easy peasy
            • 156
              Easy setup, looks good
            • 144
              The best api workflow out there
            • 53
              It's the best
            • 53
              History feature
            • 44
              Adds real value to my workflow
            • 43
              Great interface that magically predicts your needs
            • 35
              The best in class app
            • 12
              Can save and share script
            • 10
              Fully featured without looking cluttered
            • 8
              Collections
            • 8
              Option to run scrips
            • 8
              Global/Environment Variables
            • 7
              Shareable Collections
            • 7
              Dead simple and useful. Excellent
            • 7
              Dark theme easy on the eyes
            • 6
              Awesome customer support
            • 6
              Great integration with newman
            • 5
              Documentation
            • 5
              Simple
            • 5
              The test script is useful
            • 4
              Saves responses
            • 4
              This has simplified my testing significantly
            • 4
              Makes testing API's as easy as 1,2,3
            • 4
              Easy as pie
            • 3
              API-network
            • 3
              I'd recommend it to everyone who works with apis
            • 3
              Mocking API calls with predefined response
            • 2
              Now supports GraphQL
            • 2
              Postman Runner CI Integration
            • 2
              Easy to setup, test and provides test storage
            • 2
              Continuous integration using newman
            • 2
              Pre-request Script and Test attributes are invaluable
            • 2
              Runner
            • 2
              Graph
            • 1
              <a href="http://fixbit.com/">useful tool</a>
            CONS OF POSTMAN
            • 10
              Stores credentials in HTTP
            • 9
              Bloated features and UI
            • 8
              Cumbersome to switch authentication tokens
            • 7
              Poor GraphQL support
            • 5
              Expensive
            • 3
              Not free after 5 users
            • 3
              Can't prompt for per-request variables
            • 1
              Import swagger
            • 1
              Support websocket
            • 1
              Import curl

            related Postman posts

            Noah Zoschke
            Engineering Manager at Segment · | 30 upvotes · 2.9M views

            We just launched the Segment Config API (try it out for yourself here) — a set of public REST APIs that enable you to manage your Segment configuration. A public API is only as good as its #documentation. For the API reference doc we are using Postman.

            Postman is an “API development environment”. You download the desktop app, and build API requests by URL and payload. Over time you can build up a set of requests and organize them into a “Postman Collection”. You can generalize a collection with “collection variables”. This allows you to parameterize things like username, password and workspace_name so a user can fill their own values in before making an API call. This makes it possible to use Postman for one-off API tasks instead of writing code.

            Then you can add Markdown content to the entire collection, a folder of related methods, and/or every API method to explain how the APIs work. You can publish a collection and easily share it with a URL.

            This turns Postman from a personal #API utility to full-blown public interactive API documentation. The result is a great looking web page with all the API calls, docs and sample requests and responses in one place. Check out the results here.

            Postman’s powers don’t end here. You can automate Postman with “test scripts” and have it periodically run a collection scripts as “monitors”. We now have #QA around all the APIs in public docs to make sure they are always correct

            Along the way we tried other techniques for documenting APIs like ReadMe.io or Swagger UI. These required a lot of effort to customize.

            Writing and maintaining a Postman collection takes some work, but the resulting documentation site, interactivity and API testing tools are well worth it.

            See more
            Simon Reymann
            Senior Fullstack Developer at QUANTUSflow Software GmbH · | 27 upvotes · 5.1M views

            Our whole Node.js backend stack consists of the following tools:

            • Lerna as a tool for multi package and multi repository management
            • npm as package manager
            • NestJS as Node.js framework
            • TypeScript as programming language
            • ExpressJS as web server
            • Swagger UI for visualizing and interacting with the API’s resources
            • Postman as a tool for API development
            • TypeORM as object relational mapping layer
            • JSON Web Token for access token management

            The main reason we have chosen Node.js over PHP is related to the following artifacts:

            • Made for the web and widely in use: Node.js is a software platform for developing server-side network services. Well-known projects that rely on Node.js include the blogging software Ghost, the project management tool Trello and the operating system WebOS. Node.js requires the JavaScript runtime environment V8, which was specially developed by Google for the popular Chrome browser. This guarantees a very resource-saving architecture, which qualifies Node.js especially for the operation of a web server. Ryan Dahl, the developer of Node.js, released the first stable version on May 27, 2009. He developed Node.js out of dissatisfaction with the possibilities that JavaScript offered at the time. The basic functionality of Node.js has been mapped with JavaScript since the first version, which can be expanded with a large number of different modules. The current package managers (npm or Yarn) for Node.js know more than 1,000,000 of these modules.
            • Fast server-side solutions: Node.js adopts the JavaScript "event-loop" to create non-blocking I/O applications that conveniently serve simultaneous events. With the standard available asynchronous processing within JavaScript/TypeScript, highly scalable, server-side solutions can be realized. The efficient use of the CPU and the RAM is maximized and more simultaneous requests can be processed than with conventional multi-thread servers.
            • A language along the entire stack: Widely used frameworks such as React or AngularJS or Vue.js, which we prefer, are written in JavaScript/TypeScript. If Node.js is now used on the server side, you can use all the advantages of a uniform script language throughout the entire application development. The same language in the back- and frontend simplifies the maintenance of the application and also the coordination within the development team.
            • Flexibility: Node.js sets very few strict dependencies, rules and guidelines and thus grants a high degree of flexibility in application development. There are no strict conventions so that the appropriate architecture, design structures, modules and features can be freely selected for the development.
            See more