Site Reliability Engineer

new york, NY 10018

Industry: Dev-Ops/SRE Job Number: 4377

Site Reliability Engineer
Location: New York

Who We’ re Looking For:


We are looking for versatile, motivated people who are passionate about software/data engineering, automation, and building solutions.

Our Stack:
  • Docker & Kubernetes - kops, EKS
  • Python
  • Linux
  • Terraform
  • AWS - EC2, ELB, S3, ECS, RDS, Redshift
  • CircleCI
  • Postgres
  • Airflow
  • Monitoring (ELK, Prometheus, Grafana, New Relic, Sentry)
Your responsibilities would include:
  • Creating, configuring and maintaining cloud-based infrastructure and services for the rapid development and monitoring of complex data science and analytics applications
  • Building really cool stuff! We are constantly engineering and automating to improve reliability and stability of our applications (and to minimize repetitive work)
  • Being an active member of the larger software engineering team, helping to improve the organization' s SDLC process and minimizing time from code-complete to production
  • Answering the call - responding to pages and alerts as needed to restore services

What you will bring to the table:
  • 2-5 years of experience as a Software/DevOps/Site Reliability Engineer
  • Solid Bash/Linux skills and proficiency in at least one high-level language (we like PythoN
  • Comfort with containerized development using Docker and a modern container orchestrator (ECS, Kubernetes, Swarm)
  • Proficiency building, configuring and maintaining cloud resources; particularly in AWS
  • Experience with a modern datastore (either RDBMS or NoSQL) at medium-large scale (Postgres, MySQL, Mongo)
  • Ability to troubleshoot errors and outages across the stack and identify root causes and solutions.
  • Measured opinions on system architecture and app design backed by sound logic and experience.


Rachel Newell

Send an email reminder to:

Share This Job:

Related Jobs:

Login to save this search and get notified of similar positions.