Help developers discover the tools you use. Get visibility for your team's tech choices and contribute to the community's knowledge.
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency. | It is an open-source data integration platform that syncs data from applications, APIs & databases to data warehouses lakes & DBs. |
| - | Scheduled updates; Manual full refresh; Real-time monitoring; Debugging autonomy; Optional normalized schemas; Full control over the data; Benefit from the long tail of connectors, and adapt them to your needs; Build connectors in the language of your choice, as they run in Docker containers |
Statistics | |
GitHub Stars 57.1K | GitHub Stars 20.0K |
GitHub Forks 26.9K | GitHub Forks 4.9K |
Stacks 343.7K | Stacks 105 |
Followers 184.2K | Followers 112 |
Votes 6.6K | Votes 5 |
Pros & Cons | |
Pros
Cons
| Pros
|
Integrations | |
| No integrations available | |

Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google's infrastructure. Load data with ease. Bulk load your data using Google Cloud Storage or stream it in. Easy access. Access BigQuery by using a browser tool, a command-line tool, or by making calls to the BigQuery REST API with client libraries such as Java, PHP or Python.

It is optimized for data sets ranging from a few hundred gigabytes to a petabyte or more and costs less than $1,000 per terabyte per year, a tenth the cost of most traditional data warehousing solutions.

Mercurial is dedicated to speed and efficiency with a sane user interface. It is written in Python. Mercurial's implementation and data structures are designed to be fast. You can generate diffs between revisions, or jump back in time within seconds.

Qubole is a cloud based service that makes big data easy for analysts and data engineers.

It is used in a variety of applications, including log analysis, data warehousing, machine learning, financial analysis, scientific simulation, and bioinformatics.

Subversion exists to be universally recognized and adopted as an open-source, centralized version control system characterized by its reliability as a safe haven for valuable data; the simplicity of its model and usage; and its ability to support the needs of a wide variety of users and projects, from individuals to large-scale enterprise operations.

we run Apache Hadoop for you. We not only deploy Hadoop, we monitor, manage, fix, and update it for you. Then we take it a step further: We monitor your jobs, notify you when something’s wrong with them, and can help with tuning.

Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.

Plastic SCM is a distributed version control designed for big projects. It excels on branching and merging, graphical user interfaces, and can also deal with large files and even file-locking (great for game devs). It includes "semantic" features like refactor detection to ease diffing complex refactors.

Pijul is a free and open source (AGPL 3) distributed version control system. Its distinctive feature is to be based on a sound theory of patches, which makes it easy to learn and use, and really distributed.