lakeFS logo

lakeFS

Open source data version control system for data lakes
2
3
+ 1
37

What is lakeFS?

It is an open-source data version control system for data lakes. It provides a “Git for data” platform enabling you to implement best practices from software engineering on your data lake, including branching and merging, CI/CD, and production-like dev/test environments.
lakeFS is a tool in the Big Data Tools category of a tech stack.
lakeFS is an open source tool with 4.2K GitHub stars and 339 GitHub forks. Here’s a link to lakeFS's open source repository on GitHub

Who uses lakeFS?

lakeFS Integrations

Python, Amazon S3, Kafka, Airflow, and Presto are some of the popular tools that integrate with lakeFS. Here's a list of all 18 tools that integrate with lakeFS.
Pros of lakeFS
2
Full reproducibility
2
Easy integration with other tools
2
Cloud agnostic
2
Scalability
2
Open Source
2
Format agnostic
2
Highly Scalable
2
Inexpensive
2
Available On prem
2
Doesn't require local copies of the data
2
Easy to use
2
Big Data Scale
2
Strong Team
2
Scales to big data
2
Cloud agnostics
2
Supports unstructured data
2
SaaS
1
Highly performant
1
Great Git integration
1
Supports both data engineering and data science

lakeFS's Features

  • Zero copy version management
  • Any data formats: structured, unstructured, open table, etc
  • Scales to Petabytes and millions of objects with negligible performance impact
  • Seamless integration with all your data stack

lakeFS Alternatives & Comparisons

What are some alternatives to lakeFS?
JavaScript
JavaScript is most known as the scripting language for Web pages, but used in many non-browser environments as well such as node.js or Apache CouchDB. It is a prototype-based, multi-paradigm scripting language that is dynamic,and supports object-oriented, imperative, and functional programming styles.
Git
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.
GitHub
GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over three million people use GitHub to build amazing things together.
Python
Python is a general purpose programming language created by Guido Van Rossum. Python is most praised for its elegant syntax and readable code, if you are just beginning your programming career python suits you best.
jQuery
jQuery is a cross-platform JavaScript library designed to simplify the client-side scripting of HTML.
See all alternatives
Related Comparisons
No related comparisons found

lakeFS's Followers
3 developers follow lakeFS to keep up with related blogs and decisions.