The Sr Software Engineer operates with a high degree of independence and leverages multiple functional and technology skillsets to design, develop, test, and implement resilient software solutions. The Sr Software Engineer supports in leading the team through consistent software development best practices and will mentor and guide junior engineers, fostering their technical growth.
What you'll do:
- Experience of working with large scale distributed systems, including scalability, disaster recovery and fault tolerance.
- Expertise Python scripting .
- Define, implement, and own SLIs, SLOs, and error budgets for critical microservices in collaboration with product and engineering teams.
- Use error budgets to influence release decisions, prioritize reliability work, and manage operational risk.
- Design and maintain observability platforms including metrics, logs, traces, and real-time telemetry.
- Track, manage, and reduce operational toil by converting repetitive operational work into Jira stories and epics with clear ownership and measurable outcomes.
- Design, implement, and validate resiliency mechanisms such as graceful degradation, redundancy, automated failover, and disaster recovery.
- Lead incident response, act as an escalation point for high-severity incidents, and drive blameless postmortems.
- Partner with scrum teams to improve reliability through release readiness reviews, production change validation, and testing strategies.
- Capture incident action items and reliability improvements in Jira, ensuring closure, accountability, and continuous improvement.
- Perform deep root cause analysis, debugging, and performance tuning across distributed systems.
- Provide technical leadership and mentoring to junior SREs and engineers.
- Promote shift-left reliability by embedding operability, monitoring, and failure testing early in the SDLC.
- Strong knowledge on CICD Pipeline, GIT, AWS/Azure/GCP as Paas service
- Demonstrated knowledge of Configuration Management and Deployment tools automation
- Strong Experience with networking concepts and protocols (HTTP, HTTPS, Telnet, SSH, Firewall, VPN, Routing and Load Balancing)
- Strong Experience with Linux
- Experience with Monitoring solutions like Prometheus, Grafana, Products like ELK/Splunk etc.
- Experience of working with large scale systems
- Experience with containers and orchestration technologies like Docker, Kubernetes
- Experience on Service Mesh like Istio, etc. would be added Advantage
- Experience with any CDN like Akamai etc..
What you'll bring:
- Bachelor's Degree in Computer Science or related technical field.
- 4+ years of experience in SRE, software engineering, or production operations supporting large-scale eCommerce platforms.
- Hands-on experience with Java/J2EE-based distributed systems. React experience is a plus.
- Proven ability to design and operate systems using SLO-driven reliability models.
- Experience defining and measuring SLIs (availability, latency, error rates, throughput, saturation).
- Good understanding with NoSQL technologies and RDBMS. Should be able to write queries to fetch results from database.
- Experience deploying and operating services on cloud platforms (AWS, Azure, or Google Cloud).
- Expertise with observability, APM, and caching tools (Dynatrace, Splunk, ELK, Akamai, QuantumMetric/Tealeaf, etc.).
- Strong experience using Jira for backlog management, incident follow-ups, toil reduction tracking, and cross-team coordination.
- Ability to independently own services and drive reliability initiatives end-to-end.
- Strong communication skills and ability to influence engineering and product teams.
- Experience being on On-Call rotation and handling critical/high incidents.
O’Reilly Auto Parts has a proven track record of growth and stability. O’Reilly is full of successful career stories and believes in a strong promote-from-within philosophy, encouraging you to grow your career along with the organization.
Total Compensation Package:
Competitive Wages & Paid Time Off
Stock Purchase Plan & 401k with Employer Contributions Starting Day One
Medical, Dental, & Vision Insurance with Optional Flexible Spending Account (FSA)
Team Member Health/Wellbeing Programs
Tuition Educational Assistance Programs
Opportunities for Career Growth
O’Reilly Auto Parts is an equal opportunity employer. The Company does not discriminate on the basis of race, religion, color, national origin or ancestry (including immigration status or citizenship), sex, sexual orientation, gender identity, pregnancy (including childbirth, lactation, and related medical conditions,) age (40 and over), veteran status, uniformed service member status, physical or mental disability, genetic information (including testing or characteristics) or another protected status as defined by local, state, or federal law, as applicable.
Qualified individuals with a disability may be entitled to reasonable accommodation under the Americans with Disabilities Act. If you require a reasonable accommodation during the application or employment process, please send an email to: rar@oreillyauto.com or call (800) 471-7431 option , and provide your requested accommodation, and position details.
Learn more about this Employer on their Career Site
