Azure Data Factory vs Azure Databricks

Need advice about which tool to choose?Ask the StackShare community!

Azure Data Factory

193
408
+ 1
0
Azure Databricks

194
320
+ 1
0
Add tool
Advice on Azure Data Factory and Azure Databricks
Vamshi Krishna
Data Engineer at Tata Consultancy Services · | 4 upvotes · 169.7K views

I have to collect different data from multiple sources and store them in a single cloud location. Then perform cleaning and transforming using PySpark, and push the end results to other applications like reporting tools, etc. What would be the best solution? I can only think of Azure Data Factory + Databricks. Are there any alternatives to #AWS services + Databricks?

See more
Get Advice from developers at your company using StackShare Enterprise. Sign up for StackShare Enterprise.
Learn More
- No public GitHub repository available -

What is Azure Data Factory?

It is a service designed to allow developers to integrate disparate data sources. It is a platform somewhat like SSIS in the cloud to manage the data you have both on-prem and in the cloud.

What is Azure Databricks?

Accelerate big data analytics and artificial intelligence (AI) solutions with Azure Databricks, a fast, easy and collaborative Apache Spark–based analytics service.

Need advice about which tool to choose?Ask the StackShare community!

Jobs that mention Azure Data Factory and Azure Databricks as a desired skillset
CBRE
United States of America Texas Richardson
CBRE
Philippines National Capital Region Makati City
CBRE
United Kingdom of Great Britain and Northern Ireland England Feltham
CBRE
United States of America Florida Tampa
CBRE
United States of America Texas Richardson
What companies use Azure Data Factory?
What companies use Azure Databricks?
See which teams inside your own company are using Azure Data Factory or Azure Databricks.
Sign up for StackShare EnterpriseLearn More

Sign up to get full access to all the companiesMake informed product decisions

What tools integrate with Azure Data Factory?
What tools integrate with Azure Databricks?

Sign up to get full access to all the tool integrationsMake informed product decisions

What are some alternatives to Azure Data Factory and Azure Databricks?
Databricks
Databricks Unified Analytics Platform, from the original creators of Apache Spark™, unifies data science and engineering across the Machine Learning lifecycle from data preparation to experimentation and deployment of ML applications.
Azure Machine Learning
Azure Machine Learning is a fully-managed cloud service that enables data scientists and developers to efficiently embed predictive analytics into their applications, helping organizations use massive data sets and bring all the benefits of the cloud to machine learning.
Azure HDInsight
It is a cloud-based service from Microsoft for big data analytics that helps organizations process large amounts of streaming or historical data.
Apache Spark
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters through YARN or Spark's standalone mode, and it can process data in HDFS, HBase, Cassandra, Hive, and any Hadoop InputFormat. It is designed to perform both batch processing (similar to MapReduce) and new workloads like streaming, interactive queries, and machine learning.
Snowflake
Snowflake eliminates the administration and management demands of traditional data warehouses and big data platforms. Snowflake is a true data warehouse as a service running on Amazon Web Services (AWS)—no infrastructure to manage and no knobs to turn.
See all alternatives