Ramesh Borukati
datastacks
3 points
Tools datastacks is Following
Amazon CloudFront
aws.amazon.com/cloudfront
Amazon CloudFront can be used to deliver your entire website, including dynamic, static, streaming, and int...
Amazon EC2
aws.amazon.com/ec2
It is a web service that provides resizable compute capacity in the cloud. It is designed to make web-scale...
Amazon S3
aws.amazon.com/s3
Amazon Simple Storage Service provides a fully redundant data storage infrastructure for storing and retrie...
Amazon Route 53
aws.amazon.com/route53
Amazon Route 53 is designed to give developers and businesses an extremely reliable and cost effective way ...
Amazon RDS
aws.amazon.com/rds
Amazon RDS gives you access to the capabilities of a familiar MySQL, Oracle or Microsoft SQL Server databas...
Jenkins
jenkins-ci.org
In a nutshell Jenkins CI is the leading open-source continuous integration server. Built with Java, it prov...
Python
python.org
Python is a general purpose programming language created by Guido Van Rossum. Python is most praised for it...
Scala
scala-lang.org
Scala is an acronym for “Scalable Language”. This means that Scala grows with you. You can play with it by ...
MongoDB
mongodb.com
MongoDB stores data in JSON-like documents that can vary in structure, offering a dynamic, flexible schema....
Hadoop
hadoop.apache.org
The Apache Hadoop software library is a framework that allows for the distributed processing of large data ...
Kafka
kafka.apache.org
Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a mess...
PyCharm
jetbrains.com/pycharm
PyCharm’s smart code editor provides first-class support for Python, JavaScript, CoffeeScript, TypeScript, ...
AWS Lambda
aws.amazon.com/lambda
AWS Lambda is a compute service that runs your code in response to events and automatically manages the und...
Apache Hive
hive.apache.org
Hive facilitates reading, writing, and managing large datasets residing in distributed storage using SQL. S...
Pandas
pandas.pydata.org
Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures si...
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
YARN Hadoop
hadoop.apache.org/docs/curr...
Its fundamental idea is to split up the functionalities of resource management and job scheduling/monitorin...
Apache Flume
flume.apache.org
It is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving la...
Sqoop
sqoop.apache.org
It is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastore...
Apache NiFi
nifi.apache.org
An easy to use, powerful, and reliable system to process and distribute data. It supports powerful and scal...