Sourav Aikat
souravaikat
3 points
Tools souravaikat is Following
Apache Impala
impala.io
Impala is a modern, open source, MPP SQL query engine for Apache Hadoop. Impala is shipped by Cloudera, Map...
Oracle
oracle.com/us/products/data...
Oracle Database is an RDBMS. An RDBMS that implements object-oriented features such as user-defined types, ...
Hadoop
hadoop.apache.org
The Apache Hadoop software library is a framework that allows for the distributed processing of large data ...
Hue
gethue.com
It is open source and lets regular users import their big data, query it, search it, visualize it and build...
Apache Hive
hive.apache.org
Hive facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. S...
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
Shell
en.wikipedia.org/wiki/Shell...
A shell is a text-based terminal, used for manipulating programs and files. Shell scripts typically manage ...
PySpark
spark.apache.org/docs/2.2.0...
It is the collaboration of Apache Spark and Python. it is a Python API for Spark that lets you harness the ...
pyspark
github.com/apache/spark/tre...
Apache Spark Python API.