Need advice about which tool to choose?Ask the StackShare community!
Kubernetes vs Yarn: What are the differences?
Introduction
This markdown provides a comparison between Kubernetes and Yarn, highlighting the key differences between them.
Scalability and Cluster Management: Kubernetes is an open-source container orchestration platform that focuses on automating deployment, scaling, and management of containerized applications. It provides efficient cluster management capabilities, allowing seamless scaling of applications and allocation of resources based on demand. On the other hand, Yarn is a resource management framework that efficiently manages resources in a distributed computing cluster. It is primarily used for managing and scheduling resources for Apache Hadoop applications.
Container Orchestration vs. Resource Management: Kubernetes primarily focuses on container orchestration, providing features like automated scaling, rolling deployments, service discovery, and load balancing. It offers comprehensive support for deploying and managing complex containerized applications across multiple nodes. In contrast, Yarn primarily focuses on resource management in a distributed cluster. It efficiently schedules resources and manages the allocation, tracking, and monitoring of applications running on Hadoop.
Supported Workloads: Kubernetes is designed to support a wide range of workloads, including stateless and stateful applications, batch processing, data processing, and machine learning. It provides a flexible and extensible platform for deploying diverse types of applications. Yarn, on the other hand, is specifically optimized for running big data workloads on Hadoop. It provides fine-grained resource allocation and isolation for execution engines like MapReduce, Apache Spark, and Apache Hive.
Community and Ecosystem: Kubernetes has a vast and vibrant community with a rich ecosystem of tools, plugins, and integrations. It is widely adopted and supported by major cloud providers, making it a popular choice for container orchestration. Yarn, being an integral part of the Apache Hadoop ecosystem, also benefits from a large community and a wide range of supported tools. However, its ecosystem is more focused on big data processing and analytics.
Architecture and Design: Kubernetes follows a distributed architecture where each node has its own Kubernetes components (e.g., API server, scheduler, controller-manager), and the control plane components communicate with each other to manage the cluster. Yarn, on the other hand, follows a master-slave architecture where the ResourceManager manages the cluster's resources and the NodeManagers run tasks on individual nodes. The ResourceManager and NodeManagers communicate using the Hadoop Distributed File System (HDFS).
Flexibility and Extensibility: Kubernetes provides a highly modular and extensible architecture, allowing users to customize and extend the platform according to their specific requirements. It supports a wide range of plugins and extensions for networking, storage, security, and monitoring. Yarn also offers some level of flexibility and extensibility, but it is more tightly integrated with the Apache Hadoop ecosystem, limiting the options for customization and extension.
In summary, Kubernetes is a container orchestration platform that focuses on automating deployment and management of containerized applications with support for various workloads, while Yarn is a resource management framework optimized for running big data workloads on Hadoop. Kubernetes provides scalable cluster management capabilities and a flexible architecture, while Yarn is more tightly integrated with the Apache Hadoop ecosystem and offers fine-grained resource allocation for big data processing.
From a StackShare Community member: “I’m a freelance web developer (I mostly use Node.js) and for future projects I’m debating between npm or Yarn as my default package manager. I’m a minimalist so I hate installing software if I don’t need to- in this case that would be Yarn. For those who made the switch from npm to Yarn, what benefits have you noticed? For those who stuck with npm, are you happy you with it?"
We use Yarn because it allows us to more simply manage our node_modules. It also simplifies commands and increases speed when installing modules. Our teams module download time was cut in half after switching from NPM to Yarn. We now require all employees to use Yarn (to prevent errors with package-lock.json and yarn.lock).
I use npm since new version is pretty fast as well (Yarn may be still faster a bit but the difference isn't huge). No need for other dependency and mainly Yarn sometimes do not work. Sometimes when I want to install project dependencies I got error using Yarn but with npm everything is installed correctly.
p.s.
I am not sure about the performance of the latest version of npm, whether it is different from my understanding of it below. Because I use npm very rarely when I had the following knowledge.
------⏬
I use Yarn because, first, yarn is the first tool to lock the version. Second, although npm also supports the lock version, when you use npm to lock the version, and then use package-lock.json on other systems, package-lock.json Will be modified. You understand what I mean, when you deploy projects based on Git...
I use npm because I also mainly use React and TypeScript. Since several typings (from DefinitelyTyped) depend on the React typings, Yarn tends to mess up which leads to duplicate libraries present (different versions of the same type definition), which hinders the Typescript compiler. Npm always resolves to a single version per transitive dependency. At least that's my experience with both.
As far as I know Yarn is a super module of NPM. But it still needs npm to run.
Yarn was developed by Facebook's guys to fix some npm issues and performance.
If you use the last version of npm most of this problem does not exist anymore.
You can choose the option which makes you more confortable. I like using yarn because I'm used to it.
In the end the packages will be the same. Just try both and choose the one you feel more confortable. :)
I am a minimalist too. I once had issues with installing Nuxt.js using NPM so I had to install Yarn but I also found that the Dev experience was much better
I use npm because its packaged with node installation and handles npm tokens in CI/CD tools for private packages/libraries.
We tend to stick to npm, yarn is only a fancy alternative, not 10x better. Using a self -hosted private repository (via sinopia/npm-mirror) make package locking (mostly) pointless.
Yarn made it painless for the team to sync on versions of packages that we use on the project <3
I use Yarn because it outputs nice progress messages with cute emoji and installs packages quickly if the package is cached. Also, Yarn creates yarn.lock
file which makes the developer use the consistent environment.
I use npm because its the official package manager for Node. It's reliability, security and speed has increased over time so the battle is over!
I use Yarn because it process my dependencies way faster, predictable deps resolution order, upgrade-interactive is very handy + some Yarn specific features (workspaces, Plug’n’Play alternative installation strategy) ...
I use npm because it has a lot of community support and the performance difference with alternative tool is not so significant for me.
You should use whichever had the best DX (developer experience) for your team. If you are doing a massive front-end project, consider yarn if not only because it makes it a snap to go from zero to ready. What some people say about npm
being more stable or easier for smaller projects is highly true as well. (not to mention, you sometimes have to install yarn) But, note that official NodeJS Docker images ship with both npm and yarn. If you want to use yarn, put package-lock=false
and optionally save-exact=true
in your project's .npmrc
file. Compare whether you prefer the ergonomics of yarn global add
over npm install -g
or see fewer meaningless warnings for the specific set of dependencies you leverage.
Our whole DevOps stack consists of the following tools:
- GitHub (incl. GitHub Pages/Markdown for Documentation, GettingStarted and HowTo's) for collaborative review and code management tool
- Respectively Git as revision control system
- SourceTree as Git GUI
- Visual Studio Code as IDE
- CircleCI for continuous integration (automatize development process)
- Prettier / TSLint / ESLint as code linter
- SonarQube as quality gate
- Docker as container management (incl. Docker Compose for multi-container application management)
- VirtualBox for operating system simulation tests
- Kubernetes as cluster management for docker containers
- Heroku for deploying in test environments
- nginx as web server (preferably used as facade server in production environment)
- SSLMate (using OpenSSL) for certificate management
- Amazon EC2 (incl. Amazon S3) for deploying in stage (production-like) and production environments
- PostgreSQL as preferred database system
- Redis as preferred in-memory database/store (great for caching)
The main reason we have chosen Kubernetes over Docker Swarm is related to the following artifacts:
- Key features: Easy and flexible installation, Clear dashboard, Great scaling operations, Monitoring is an integral part, Great load balancing concepts, Monitors the condition and ensures compensation in the event of failure.
- Applications: An application can be deployed using a combination of pods, deployments, and services (or micro-services).
- Functionality: Kubernetes as a complex installation and setup process, but it not as limited as Docker Swarm.
- Monitoring: It supports multiple versions of logging and monitoring when the services are deployed within the cluster (Elasticsearch/Kibana (ELK), Heapster/Grafana, Sysdig cloud integration).
- Scalability: All-in-one framework for distributed systems.
- Other Benefits: Kubernetes is backed by the Cloud Native Computing Foundation (CNCF), huge community among container orchestration tools, it is an open source and modular tool that works with any OS.
As we have to build the application for many different TV platforms we want to split the application logic from the device/platform specific code. Previously we had different repositories and it was very hard to keep the development process when changes were done in multiple repositories, as we had to synchronize code reviews as well as merging and then updating the dependencies of projects. This issues would be even more critical when building the project from scratch what we did at Joyn. Therefor to keep all code in one place, at the same time keeping in separated in different modules we decided to give a try to monorepo. First we tried out lerna which was fine at the beginning, but later along the way we had issues with adding new dependencies which came out of the blue and were not easy to fix. Next round of evolution was yarn workspaces, we are still using it and are pretty happy with dev experience it provides. And one more advantage we got when switched to yarn workspaces that we also switched from npm to yarn what improved the state of the lock file a lot, because with npm package-lock file was updated every time you run npm install
, frequent updates of package-lock file were causing very often merge conflicts. So right now we not just having faster dependencies installation time but also no conflicts coming from lock file.
This was no real choice - we switched the moment Yarn was available, and never looked back. Yarn is the only reasonable frontend package manager that's actually being developed. They even aim to heal the node_modules madness with v2! Npm is just copying its ideas on top of introducing massive bugs with every change.
Pros of Kubernetes
- Leading docker container management solution166
- Simple and powerful129
- Open source107
- Backed by google76
- The right abstractions58
- Scale services25
- Replication controller20
- Permission managment11
- Supports autoscaling9
- Simple8
- Cheap8
- Self-healing6
- Open, powerful, stable5
- Reliable5
- No cloud platform lock-in5
- Promotes modern/good infrascture practice5
- Scalable4
- Quick cloud setup4
- Custom and extensibility3
- Captain of Container Ship3
- Cloud Agnostic3
- Backed by Red Hat3
- Runs on azure3
- A self healing environment with rich metadata3
- Everything of CaaS2
- Gke2
- Golang2
- Easy setup2
- Expandable2
- Sfg2
Pros of Yarn
- Incredibly fast85
- Easy to use22
- Open Source13
- Can install any npm package11
- Works where npm fails8
- Workspaces7
- Incomplete to run tasks3
- Fast2
Sign up to add or upvote prosMake informed product decisions
Cons of Kubernetes
- Steep learning curve16
- Poor workflow for development15
- Orchestrates only infrastructure8
- High resource requirements for on-prem clusters4
- Too heavy for simple systems2
- Additional vendor lock-in (Docker)1
- More moving parts to secure1
- Additional Technology Overhead1
Cons of Yarn
- 16
- Sends data to facebook7
- Should be installed separately4
- Cannot publish to registry other than npm3