Decision at Uber Technologies about Kafka Manager, Kafka, GitHub, Apache Spark, Hadoop

Avatar of conor
Tech Brand Mgr, Office of CTO at Uber ·

Why we built Marmaray, an open source generic data ingestion and dispersal framework and library for Apache Hadoop :

Built and designed by our Hadoop Platform team, Marmaray is a plug-in-based framework built on top of the Hadoop ecosystem. Users can add support to ingest data from any source and disperse to any sink leveraging the use of Apache Spark . The name, Marmaray, comes from a tunnel in Turkey connecting Europe and Asia. Similarly, we envisioned Marmaray within Uber as a pipeline connecting data from any source to any sink depending on customer preference:

https://eng.uber.com/marmaray-hadoop-ingestion-open-source/

(Direct GitHub repo: https://github.com/uber/marmaray Kafka Kafka Manager )

3 upvotes·74.5K views
Avatar of Conor Myhrvold

Conor Myhrvold

Tech Brand Mgr, Office of CTO at Uber