SonicJobs Logo
Left arrow iconBack to search

Site Reliability Engineer — Info Apps

Apple
Posted 2 months ago, valid for 15 days
Location

Cupertino, Santa Clara 95015, CA

Salary

$110,000 - $132,000 per year

info
Contract type

Full Time

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • Apple is seeking a Site Reliability Engineer with a minimum of 5 years of experience in SRE, DevOps, or Infrastructure roles to oversee the performance and availability of core backend services.
  • The role involves designing and implementing telemetry systems, defining SLOs/SLIs, and promoting operational excellence through automation.
  • Candidates should have deep Kubernetes expertise, strong knowledge of observability tools like Prometheus and Grafana, and experience with public cloud providers such as AWS or GCP.
  • The ideal applicant will also possess scripting skills in Python or Go, and experience in leading incident responses and conducting Root Cause Analysis.
  • Salary details were not specified in the job description.
Do you love building and scaling infrastructure that delights millions of customers? At Apple, we believe reliability is a feature. We are looking for a Site Reliability Engineer to join our team in overseeing the performance and availability of our core backend services in News, Stocks, Weather, Books and Creator Studio applications. As a SRE, you won’t just be responding to alerts; you will be shaping the evolution of our observability strategy, a mentor for incident management, and a champion for automation. You will help us refine our "Golden Signals" and ensure our Kubernetes-based ecosystem remains world-class.

Description


In this role, you will be a key pillar of our engineering organization, ensuring that our services remain highly available and performant. Your impact will include: System Architecture: Designing and implementing the next generation of our telemetry and alerting systems. Reliability Engineering: Defining SLOs/SLIs and ensuring our monitoring strategy captures the true health of the user experience. Operational Excellence: Reducing operational load through software; if you have to do it twice, you’ll want to automate it. Collaboration: Partnering with App Dev teams to influence the "design for reliability" phase of the software development lifecycle. Mentorship: Acting as a technical lead for junior members and off-shore partners, providing guidance on runbook development and disaster recovery.

Minimum Qualifications


Experience: 5+ years in SRE, DevOps, or Infrastructure roles with a proven track record of managing high-traffic, internet-facing production environments. Kubernetes Expertise: Deep experience building and operating container orchestration systems (EKS/GKE/Vanilla K8s). You should be comfortable troubleshooting from the networking layer up to the application pod. Observability Champion: Expert knowledge of the 4 Golden Signals (Latency, Traffic, Errors, and Saturation). Proficiency with tools like Prometheus, Grafana, and Splunk is essential. Cloud Proficiency: Hands-on experience designing and maintaining resilient infrastructure on public cloud providers (AWS, GCP, or Azure). Scripting & Automation: Strong ability to code at a scripting level (Python or Go preferred) to automate toil and build self-healing systems. Incident Leadership: Experience leading incident response, performing Root Cause Analysis (RCA), and implementing blameless post-mortems to improve system resilience. Infrastructure as Code: Proficient in Terraform, CloudFormation, or Pulumi to manage immutable infrastructure.

Preferred Qualifications


Search & Data: Specialized experience operating and tuning Solr or Elasticsearch at scale. Networking: Strong understanding of TCP/IP, Load Balancing (ELB/ALB), and Service Mesh (Istio/Linkerd). Data Systems: Experience with Kafka, Cassandra, or Postgres in a distributed environment.



Learn more about this Employer on their Career Site

Apply now in a few quick clicks

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.