Tech Brand Mgr, Office of CTO at Uber·

Why we built Marmaray, an open source generic data ingestion and dispersal framework and library for Apache Hadoop :

Built and designed by our Hadoop Platform team, Marmaray is a plug-in-based framework built on top of the Hadoop ecosystem. Users can add support to ingest data from any source and disperse to any sink leveraging the use of Apache Spark . The name, Marmaray, comes from a tunnel in Turkey connecting Europe and Asia. Similarly, we envisioned Marmaray within Uber as a pipeline connecting data from any source to any sink depending on customer preference:

https://eng.uber.com/marmaray-hadoop-ingestion-open-source/

(Direct GitHub repo: https://github.com/uber/marmaray Kafka Kafka Manager )

READ LESS
Marmaray: An Open Source Generic Data Ingestion and Dispersal Framework and Library for Apache Hadoop | Uber Engineering Blog (eng.uber.com)
7 upvotes·2.9M views
Avatar of Conor Myhrvold

Conor Myhrvold

Tech Brand Mgr, Office of CTO at Uber