Senior Site Reliability Engineer
Company: Pager
Location: San Francisco
Posted on: January 26, 2023
Job Description:
How You Contribute To Our Vision: Key Responsibilities
- You partner with Engineering stakeholders to design and deliver
a reliable, scalable, secure, and performant platform.
- You continuously strive to improve the customer experience:
Full lifecycle support (creation, development, deployment,
retirement), observability, flexible connectivity, and
monitoring.
- You stay current on technology trends in order to suggest
innovative tools and approaches to interesting problems.
- You share your expertise with the entire Engineering
organization
- You participate in a 24/7 on-call rotation. And yes, we use
PagerDuty to manage our on-call schedules.
About You: Skills and Attributes
- You have solved multiple problems by writing code to automate
your way out of them and have a passion for replacing manual
processes time and time again with your code
- You have been responsible for running critical services that
multiple customers depend upon. You understand the importance and
impact that operational optimization can have on a product and the
positive ripple effects that it can have across an entire
organization
- You believe CI servers, push-button deploys, time-series
datastores, metrics dashboards, and centralized logging are not
just "nice to haves," they are critical pieces of infrastructure
that rapidly pay for themselves. You are familiar with the
tool-space and can suggest products in each of these
areas.
- You are empathetic: You take others' opinions into account and
clearly communicate your thoughts to reach technical solutions
quickly.
- You consider it important to understand and appreciate your
customers, and enjoy seeing your work improve the work of
others.
Minimum Requirements
- Experience in a dynamic language like Ruby or Python
- Experience working on cloud-native infrastructure (e.g. AWS,
GCP, Azure)
- Experience working with a container scheduler platform,
preferably Kubernetes
- Experience with cloud-native network services like Transit
Gateways, VPC configurations, and peering.
- Experience with load balancing (Nginx, NLB/ALB, etc.), traffic
ingress, and egress techniques.
Preferred Requirements
- Experience with infrastructure as code (Terraform or
CloudFormation)
- Experience with monitoring, observability and logging platforms
(e.g. DataDog, New Relic, SumoLogic, Splunk, etc.)
- Knowledge of configuration management systems like Ansible,
Chef, or Puppet
- Experience in automating releases, continuous
integration/delivery systems, and relevant tools (e.g., Jenkins,
CircleCI, Travis CI, Buildkite, etc.)
Keywords: Pager, San Francisco , Senior Site Reliability Engineer, Professions , San Francisco, California
Didn't find what you're looking for? Search again!
Loading more jobs...