Sangwoo Shim
be4rpooh02
Data Engineer
|
6 points
Tools be4rpooh02 is Following
Amazon S3
aws.amazon.com/s3
Amazon Simple Storage Service provides a fully redundant data storage infrastructure for storing and retrie...
GitHub
github.com
GitHub is the best place to share code with friends, co-workers, classmates, and complete strangers. Over t...
Bitbucket
bitbucket.org
Bitbucket gives teams one place to plan projects, collaborate on code, test and deploy, all with free priva...
Jira
atlassian.com/software/jira
Jira's secret sauce is the way it simplifies the complexities of software development into manageable units...
Amazon EMR
aws.amazon.com/elasticmapre...
It is used in a variety of applications, including log analysis, data warehousing, machine learning, financ...
Docker
docker.com
The Docker Platform is the industry-leading container platform for continuous, high-velocity innovation, en...
Notepad++
notepad-plus-plus.org
Notepad++ is a free (as in "free speech" and also as in "free beer") source code editor and Notepad replace...
Jenkins
jenkins-ci.org
In a nutshell Jenkins CI is the leading open-source continuous integration server. Built with Java, it prov...
Confluence
atlassian.com/software/conf...
Capture the knowledge that's too often lost in email inboxes and shared network drives in Confluence instea...
VirtualBox
virtualbox.org
VirtualBox is a powerful x86 and AMD64/Intel64 virtualization product for enterprise as well as home use. N...
GitLab
about.gitlab.com
GitLab offers git repository management, code reviews, issue tracking, activity feeds and wikis. Enterprise...
Python
python.org
Python is a general purpose programming language created by Guido Van Rossum. Python is most praised for it...
Scala
scala-lang.org
Scala is an acronym for “Scalable Language”. This means that Scala grows with you. You can play with it by ...
PostgreSQL
postgresql.org
PostgreSQL is an advanced object-relational database management system that supports an extended subset of...
Hadoop
hadoop.apache.org
The Apache Hadoop software library is a framework that allows for the distributed processing of large data ...
Git
git-scm.com
Git is a free and open source distributed version control system designed to handle everything from small t...
IntelliJ IDEA
jetbrains.com/idea
Out of the box, IntelliJ IDEA provides a comprehensive feature set including tools and integrations with th...
PyCharm
jetbrains.com/pycharm
PyCharm’s smart code editor provides first-class support for Python, JavaScript, CoffeeScript, TypeScript, ...
Bamboo
atlassian.com/software/bamboo
Focus on coding and count on Bamboo as your CI and build server! Create multi-stage build plans, set up tri...
Sonatype Nexus
sonatype.com/nexus-reposito...
It is an open source repository that supports many artifact formats, including Docker, Java™ and npm. With ...
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
SBT
scala-sbt.org
It is similar to Java's Maven and Ant. Its main features are: Native support for compiling Scala code and i...
Airflow
airbnb.io/projects/airflow
Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes ...
Ubuntu
ubuntu.com
Ubuntu is an ancient African word meaning ‘humanity to others’. It also means ‘I am what I am because of wh...
Apache Zeppelin
zeppelin.incubator.apache.org
A web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interacti...
Jupyter
jupyter.org
The Jupyter Notebook is a web-based interactive computing platform. The notebook combines live code, equati...
Visual Studio Code
code.visualstudio.com
Build and debug modern web and cloud applications. Code is free and available on your favorite platform - L...
Minio
minio.io
Minio is an object storage server compatible with Amazon S3 and licensed under Apache 2.0 License
JFrog Artifactory
jfrog.com/artifactory
It integrates with your existing ecosystem supporting end-to-end binary management that overcomes the compl...
Superset
airbnb.io/projects/superset
Superset's main goal is to make it easy to slice, dice and visualize data. It empowers users to perform ana...
Ambari
ambari.apache.org
This project is aimed at making Hadoop management simpler by developing software for provisioning, managing...
PySpark
spark.apache.org/docs/2.2.0...
It is the collaboration of Apache Spark and Python. it is a Python API for Spark that lets you harness the ...
Linux
kernel.org
A clone of the operating system Unix, written from scratch by Linus Torvalds with assistance from a loosely...
pandas
pandas.pydata.org
Powerful data structures for data analysis, time series, and statistics.
selenium
github.com/SeleniumHQ/selenium
Python bindings for Selenium.
BeautifulSoup
crummy.com/software/Beautif...
Screen-scraping library.