Jim Nguyen
jimnguyen
3 points
Tools jimnguyen is Following
Amazon S3
aws.amazon.com/s3
Amazon Simple Storage Service provides a fully redundant data storage infrastructure for storing and retrie...
Amazon EMR
aws.amazon.com/elasticmapre...
It is used in a variety of applications, including log analysis, data warehousing, machine learning, financ...
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
Amazon Athena
aws.amazon.com/athena
Amazon Athena is an interactive query service that makes it easy to analyze data in Amazon S3 using standar...
AWS Glue
aws.amazon.com/glue
A fully managed extract, transform, and load (ETL) service that makes it easy for customers to prepare and ...
Databricks
databricks.com
Databricks Unified Analytics Platform, from the original creators of Apache Spark™, unifies data science an...
Delta Lake
delta.io
An open-source storage layer that brings ACID transactions to Apache Spark™ and big data workloads.
AWS Lake Formation
docs.aws.amazon.com/lake-fo...
It is a fully managed service that makes it easier for you to build, secure, and manage data lakes. It simp...