x x
sklanncz
Recent Tech Decisions
6 points
Tools sklanncz is Following
MemCachier
memcachier.com
MemCachier provides an easy and powerful managed caching solution for all your performance and scalability ...
Amazon DynamoDB
aws.amazon.com/dynamodb
With it , you can offload the administrative burden of operating and scaling a highly available distributed...
Google BigQuery
cloud.google.com/bigquery/w...
Run super-fast, SQL-like queries against terabytes of data in seconds, using the processing power of Google...
Apache Impala
impala.io
Impala is a modern, open source, MPP SQL query engine for Apache Hadoop. Impala is shipped by Cloudera, Map...
Redis
redis.io
Redis is an open source (BSD licensed), in-memory data structure store, used as a database, cache, and mess...
Cassandra
cassandra.apache.org
Partitioning means that Cassandra can distribute your data across multiple machines in an application-trans...
CouchDB
couchdb.apache.org
Apache CouchDB is a database that uses JSON for documents, JavaScript for MapReduce indexes, and regular HT...
Hadoop
hadoop.apache.org
The Apache Hadoop software library is a framework that allows for the distributed processing of large data ...
RabbitMQ
rabbitmq.com
RabbitMQ gives your applications a common platform to send and receive messages, and your messages a safe p...
Kafka
kafka.apache.org
Kafka is a distributed, partitioned, replicated commit log service. It provides the functionality of a mess...
Couchbase
couchbase.com
Developed as an alternative to traditionally inflexible SQL databases, the Couchbase NoSQL database is buil...
Mongoose
mongoosejs.com
Let's face it, writing MongoDB validation, casting and business logic boilerplate is a drag. That's why we ...
InfluxDB
influxdb.com
InfluxDB is a scalable datastore for metrics, events, and real-time analytics. It has a built-in HTTP API s...
Sematext
sematext.com
Sematext pulls together performance monitoring, logs, user experience and synthetic monitoring that tools o...
Apache Storm
storm.apache.org
Apache Storm is a free and open source distributed realtime computation system. Storm makes it easy to reli...
Solr
solr.apache.org
Solr is the popular, blazing fast open source enterprise search platform from the Apache Lucene project. It...
MariaDB
mariadb.com
Started by core members of the original MySQL team, MariaDB actively works with outside developers to deliv...
Kibana
elastic.co/kibana
Kibana is an open source (Apache Licensed), browser based analytics and search dashboard for Elasticsearch....
Apache Spark
spark.apache.org
Spark is a fast and general processing engine compatible with Hadoop data. It can run in Hadoop clusters th...
Presto
prestodb.io
Distributed SQL Query Engine for Big Data
Citus
citusdata.com
It's an extension to Postgres that distributes data and queries in a cluster of multiple machines. Its quer...
Apache Flink
flink.apache.org
Apache Flink is an open source system for fast and versatile data analytics in clusters. Flink supports bat...
Google Cloud Big...
cloud.google.com/bigtable
Google Cloud Bigtable offers you a fast, fully managed, massively scalable NoSQL database service that's id...
Snowplow
snowplowanalytics.com
Snowplow is a real-time event data pipeline that lets you track, contextualize, validate and model your cus...
Airflow
airbnb.io/projects/airflow
Use Airflow to author workflows as directed acyclic graphs (DAGs) of tasks. The Airflow scheduler executes ...
Apache Zeppelin
zeppelin.incubator.apache.org
A web-based notebook that enables interactive data analytics. You can make beautiful data-driven, interacti...
Apache Kudu
kudu.apache.org
A new addition to the open source Apache Hadoop ecosystem, Kudu completes Hadoop's storage layer to enable ...
Druid
druid.io
Druid is a distributed, column-oriented, real-time analytics data store that is commonly used to power expl...
Apache Parquet
parquet.apache.org
It is a columnar storage format available to any project in the Hadoop ecosystem, regardless of the choice ...
Apache Flume
flume.apache.org
It is a distributed, reliable, and available service for efficiently collecting, aggregating, and moving la...
SQLdep
sqldep.com
SQLdep is a cloud service generating data-lineage from SQL code or stored procedures. By transforming SQL c...
Sqoop
sqoop.apache.org
It is a tool designed for efficiently transferring bulk data between Apache Hadoop and structured datastore...
Apache RocketMQ
rocketmq.incubator.apache.org
Apache RocketMQ is a distributed messaging and streaming platform with low latency, high performance and re...
Apache Beam
beam.incubator.apache.org
It implements batch and streaming data processing jobs that run on any execution engine. It executes pipeli...
Apache Pulsar
pulsar.apache.org
Apache Pulsar is a distributed messaging solution developed and released to open source at Yahoo. Pulsar su...
Snowflake
snowflake.net
Snowflake eliminates the administration and management demands of traditional data warehouses and big data ...
Apache Ignite
ignite.apache.org
It is a memory-centric distributed database, caching, and processing platform for transactional, analytical...
Azure HDInsight
azure.microsoft.com/en-us/s...
It is a cloud-based service from Microsoft for big data analytics that helps organizations process large am...
TimescaleDB
timescale.com
TimescaleDB: An open-source database built for analyzing time-series data with the power and convenience ...
ScyllaDB
scylladb.com
ScyllaDB is the database for data-intensive apps that require high performance and low latency. It enables ...
JanusGraph
janusgraph.org
It is a scalable graph database optimized for storing and querying graphs containing hundreds of billions o...
RedisGraph
redisgraph.io
RedisGraph is a graph database developed from scratch on top of Redis, using the new Redis Modules API to e...
Apache Kylin
kylin.apache.org
Apache Kylin™ is an open source Distributed Analytics Engine designed to provide SQL interface and multi-di...
Dremio
dremio.com
Dremio—the data lake engine, operationalizes your data lake storage and speeds your analytics processes wit...
XGBoost
xgboost.ai
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scal...
YugabyteDB
yugabyte.com
An open-source, high-performance, distributed SQL database built for resilience and scale. Re-uses the uppe...
Fauna
fauna.com
Escape the boundaries imposed by legacy databases with a data API that is simple to adopt, highly productiv...
Knowage
knowage-suite.com/site/home
It is composed of several modules, each one conceived for a specific analytical domain. They can be used in...
Apache Pinot
pinot.apache.org
Apache Pinot is a fast, scalable real-time analytics database. It is a column-oriented distributed Online ...
cnvrg.io
cnvrg.io
It is an AI OS, transforming the way enterprises manage, scale and accelerate AI and data science developme...
KNIME
knime.com
It is a free and open-source data analytics, reporting and integration platform. KNIME integrates various c...
IBM Db2 Big SQL
ibm.com/products/db2-big-sql
It is a hybrid SQL-on-Hadoop engine delivering advanced, security-rich data query across enterprise big dat...
TileDB
tiledb.com
TileDB offers a data engine that makes data management and compute fast, easy and universal. Manage, store,...
Kestra
kestra.io
It is an infinitely scalable orchestration and scheduling platform, creating, running, scheduling, and moni...
Apache Iceberg
iceberg.apache.org
It is a high-performance format for huge analytic tables. It brings the reliability and simplicity of SQL t...
Flyte
flyte.org
It is an open-source, Kubernetes-native workflow orchestrator implemented in Go. It enables highly concurre...