MLflow

Utilities / Application Utilities / Machine Learning Tools

Murali Nagaraj

Needs advice

and

We are trying to standardise DevOps across both ML (model selection and deployment) and regular software. Want to minimise the number of tools we have to learn. Also want a scalable solution which is easy enough to start small - eg. on a powerful laptop and eventually be deployed at scale. MLflow vs Kubernetes (Kubeflow)?

5 upvotes·53.4K views

Replies (1)

Amir Mehler

DevOps Manager at AI21 Labs·Mar 2, 2024

Recommends

Argo

We do MLOps with Argo Workflows. But generally use Kubernetes for everything, even for the jobs Argo Workflows are running. Just adding text so the red bubble leaves me alone, more text, text, yes text.

3 upvotes·2.7K views

Biswajit Pathak

Project Manager at Sony·Sep 13, 2021

Needs advice

FastText

and

Gensim

Can you please advise which one to choose FastText Or Gensim, in terms of:

Operability with ML Ops tools such as MLflow, Kubeflow, etc.
Performance
Customization of Intermediate steps
FastText and Gensim both have the same underlying libraries
Use cases each one tries to solve
Unsupervised Vs Supervised dimensions
Ease of Use.

Please mention any other points that I may have missed here.

6 upvotes·854.4K views

Needs advice

and

I already use DVC to keep track and store my datasets in my machine learning pipeline. I have also started to use MLflow to keep track of my experiments. However, I still don't know whether to use DVC for my model files or I use the MLflow artifact store for this purpose. Or maybe these two serve different purposes, and it may be good to do both! Can anyone help, please?

7 upvotes·263K views

Replies (2)

Simon Lousky

Jun 24, 2021

Recommends

DVC

MLflow

I personally think that MLflow does a great job at experiment tracking, but If you've already set dvc and you're already using it, it makes more sense to me to keep data, code and model in the context of the same commit, under the same roof, than having some dangling files in another system that requires you to track down a commit on the ui, and then get a link to the model manually. Using artifact logging is very useful if you need to see for example generated photos in real time, and stop training in the middle, or if you don't already have a data versioning system set up. By the way DAGsHub let's you combine both very easily.

7 upvotes·1.7K views

Mike Moynihan

AE at Iterative·Apr 13, 2022

Recommends

DVC

Hey Hamid - I'm on the DVC team and I'm glad I randomly came across this.

We actually just released an experiment tracking tool that integrates seamlessly with DVC. If you want to see how it works please reach out! mikem@iterative.ai

1 upvote·175 views