Responsibilities
- Develop, design, create, modify, and/or test software services to ensure optimal performance and capacity for growth.
- Own back-end infrastructure components to ensure services run without incident.
- Write and review code, develop documentation and capacity plans, and debug the problems in real time in highly complex software systems.
- Design and operate widescale services according to strict reliability, security, and performance requirements.
- Serve as an escalation contact for service incidents.
- Work on problems of moderate scope where analysis of situations or data requires a review of a variety of factors.
- Exercise judgment within defined procedures and practices to determine appropriate action.
- Collaborate on and/or drive consensus within projects spanning multiple job roles and teams.
Minimum Qualifications
- Bachelor's degree in Computer Science, Engineering, Information Systems, Analytics, Mathematics, Physics, Applied Sciences, or a related field and 1 year of experience in the job offered or a related occupation
- Requires 1 year of experience in the following areas:
- Developing, managing, and debugging large (million level) fan-out, high throughput distributed systems
- Familiarity with distributed coordination systems like Apache Zookeeper
- Coding in higher-level languages (e.g., Python, C++)
- Managing and maintaining widely distributed binaries
- Proficiency in Infrastructure as Code (IaC), including utilizing tooling like Chef
- CI/CD utilization and implementation
- Dependency management of software systems
- Recoverability assessment/testing of low dependency services
- Experience in upholding security standards of distributed systems
- Internet service architectures (such as load balancing, LAMP, or CDN’s)
- Configuring and maintaining applications using at least one of the following: web servers, load balancers, relational databases, storage systems, or messaging systems
- Network protocols including at least one of the following: NFS, DHCP, NTP, SSH, DNS, or SNMP Relational Databases including MySQL
- High performance embedded database like rocksdb
- Network Management tools like DHCP, NTP, SSH, DNS, or SNMP
- Experience utilizing high performance query engines (Presto or Spark) for big data
- RPC framework such as Apache Thrift
- Algorithms and data structures
- Asynchronous programming
- The ability to lead and drive projects across multiple teams
- UNIX or Linux operating system fundamentals
- TCP/IP network fundamentals
- Software frameworks and APIs
- Diagnosing and troubleshooting issues ranging from low-level hardware issues to large scale failures within datacenter clusters
Learn more about this Employer on their Career Site
