SonicJobs Logo
Left arrow iconBack to search

Compute SRE

Apple
Posted 2 months ago, valid for 6 days
Location

Cupertino, CA 95015, US

Salary

Competitive

Contract type

Full Time

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.

Sonic Summary

info
  • Apple is seeking a Site Reliability Engineer to enhance the reliability, scalability, and observability of its cloud platform.
  • The position requires a minimum of 1 year of experience in a Site Reliability Engineering role and a Bachelor's Degree in Computer Science or a related field.
  • The role involves building and operating Apple’s Cloud Platform, automating core services, and ensuring uptime through well-architected systems.
  • Candidates should be proficient in programming languages such as Go, Python, or Java and familiar with Infrastructure as Code tools like Puppet or Terraform.
  • The salary for this position is competitive, reflecting the high trust and accountability environment Apple aims to foster within its team.
As a Site Reliability Engineer at Apple, you will be responsible for driving the reliability, scalability, and observability of our cloud platform. Your work will ensure the uptime and performance of mission critical systems that serve millions of users every day. We’re looking for a self-motivated engineer, committed to operational excellence and continuous improvement. You’ll work closely with developers and architects within the team to build and extend our platform, as well as be a part of rich fabric of people from many different disciplines all invested in building the best cloud platform to run world-class services at scale. We’re building a team of high trust and accountability, and are searching for a like-minded individual who is excited to build foundational capabilities into Apple’s Cloud Platform!

Description


AS AN SRE AT APPLE YOU WILL: - Build, operate, and scale Apple’s Cloud Platform that powers mission critical services across the globe. - Accelerate delivery of core services with automation and visibility into release cadences. - Collaborate with developers to build and release reliable software that manages the lifecycle of customer VMs. - Drive reliability and excellence of service through CI/CD, production readiness reviews, and incident response. - Instrument, analyze, and iterate on performance bottlenecks across distributed systems. - Actively participate in oncall rotations, capacity planning, scale testing, and disaster recovery exercises. - Ensure uptime SLOs with well-architected systems and rigorous observability.

Minimum Qualifications


Bachelor's Degree in Computer Science, an engineering-related field, or equivalent related experience. 1+ years in a Site Reliability Engineering Infrastructure focused role. Proficiency in Go, Python, or Java. Proficiency with Infrastructure as Code (IaC) tools like Puppet, Chef, Ansible, or Terraform. Experience with cloud infrastructure and experience running businesses. Experience in architecting, building, and running large-scale distributed systems. Experience providing 24/7 on-call support and incident management for critical production infrastructure. Ability to troubleshoot issues across the entire infrastructure stack (Profiling, Tracing, etc).

Preferred Qualifications


Interpersonal and written communication skills, targeted to both technical and non-technical audiences. Experience operating large-scale multi-tenant Infrastructure as a Managed service.



Learn more about this Employer on their Career Site

Apply now in a few quick clicks

By applying, a Sonicjobs account will be created for you. Sonicjobs's Privacy Policy and Terms & Conditions will apply.

SonicJobs' Terms & Conditions and Privacy Policy also apply.