What is Embulk?
It is an open-source bulk data loader that helps data transfer between various databases, storages, file formats, and cloud services.
Embulk is a tool in the API Tools category of a tech stack.
Embulk is an open source tool with 1.7K GitHub stars and 200 GitHub forks. Here’s a link to Embulk's open source repository on GitHub
Who uses Embulk?
Companies
8 companies reportedly use Embulk in their tech stacks, including Repro, Radiotalk, and SPACEMARKET.
Developers
17 developers on StackShare have stated that they use Embulk.
Embulk's Features
- Automatic guessing of input file formats
- Parallel & distributed execution to deal with big data sets
- Transaction control to guarantee All-or-Nothing
- Resuming
- Plugins released on RubyGems.org
Embulk Alternatives & Comparisons
What are some alternatives to Embulk?
Fluentd
Fluentd collects events from various data sources and writes them to files, RDBMS, NoSQL, IaaS, SaaS, Hadoop and so on. Fluentd helps you unify your logging infrastructure.
Sqoop
It is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastores such as relational databases of The Apache Software Foundation
Logstash
Logstash is a tool for managing events and logs. You can use it to collect logs, parse them, and store them for later use (like, for searching). If you store them in Elasticsearch, you can view and analyze them with Kibana.
JavaScript
JavaScript is most known as the scripting language for Web pages, but used in many non-browser environments as well such as node.js or Apache CouchDB. It is a prototype-based, multi-paradigm scripting language that is dynamic,and supports object-oriented, imperative, and functional programming styles.
Git
Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency.