Redwood City, CA

Data Infrastructure Site Reliability Engineer


WePay is the payments partner to the platform economy. At a time when commerce increasingly flows through online platforms, we partner closely with these platforms to provide fully integrated payments and risk services… so they can deliver the end-to-end user experiences they want without taking on the overhead they don’t want. We process billions annually for platforms including GoFundMe, Meetup, Care.com, FreshBooks, and Constant Contact.

We’re well funded and growing so much that we were recently named to the 2015 Inc. 500 list as the 62nd fastest growing private company in America and the 5th fastest growing company in the Silicon Valley. We're also proud to have cultivated an open, supportive culture that cares deeply about customers, employees, and technology. We now seek an extraordinary Site Reliability Engineers to help us reach new heights.

What will one work on?

While on the data infrastructure team you’ll work closely with DevOps & DI to manage the data pipeline, using MySQL, Airflow, Kafka, BigQuery, and Ansible, all hosted in the Google Cloud. Your work will be focused on designing, building and maintaining a high-performance, scalable, reliable infrastructure.

What impact will one have?

WePay’s data infrastructure is used by our entire company, as well as our customers, every day to power both internal and external products. You will help architect the entire operations architecture for WePay’s data pipeline and related data infrastructure. This includes how to configure, deploy, measure, and monitor the systems.

What should be expected in the interview?

We want to see your problem-solving and analytical skills. Be prepared to write good, clean, scalable code. You don’t need to know our entire stack, but we’re looking for practical experience, someone who can solve infrastructure and production problems in the cloud.

You may be a good fit if you have:

Experience debugging complex problems across the whole stack
Experience designing, building, and operating large-scale distributed systems
Experience with cloud services (Google Cloud, AWS, Azure, etc.)
Experience with Airflow, Azkaban, Luigi, Oozie, etc.
Experience with stream processing systems (Storm, Dataflow, Kafka streams, etc.)
Experience with Python, Java, Scala or Go
Experience with open-source databases (MySQL, Postgres, Cassandra, Redis, etc.)
Deployment using Ansible, Chef, Puppet, Salt, etc.

Our hiring process includes:

One phone interview with a team member
One technical phone interview
An on-site interview

Work with this stack