PySpark

PySpark

Application and Data / Libraries / Data Science Tools
Data Engineer at Tata Consultancy Services·

I have to collect different data from multiple sources and store them in a single cloud location. Then perform cleaning and transforming using PySpark, and push the end results to other applications like reporting tools, etc. What would be the best solution? I can only think of Azure Data Factory + Databricks. Are there any alternatives to #AWS services + Databricks?

READ MORE
4 upvotes·243.6K views