Responsibilities
- Develop, design, create, modify, and/or test software services to ensure optimal performance and capacity for growth.
- Own back-end data warehouse services, front-end services like Messenger and Newsfeed, and infrastructure components to ensure services run without incident.
- Write and review code, develop documentation and capacity plans, and debug the problems in real time in highly complex software systems.
- Serve as an escalation contact for service incidents.
- Work on problems of diverse scope where analysis of data requires evaluation of identifiable factors.
- Demonstrate good judgment in selecting methods and techniques for obtaining solutions.
- Collaborate with cross-functional teams to ensure that services meet business requirements and are delivered on time with quality.
- Participate in on-call rotations to provide support for critical services, troubleshoot issues, and implement fixes to prevent future incidents.
Minimum Qualifications
- Requires a Master’s degree (or foreign equivalent degree) in Applied Data Science, Computer Science, Engineering, Applied Sciences or related field. Requires completion of a university-level course, research project, internship, or thesis in the following:
- UNIX or Linux operating system fundamentals
- TCP/IP network fundamentals
- Coding in at least one of the following higher-level programming languages: PHP, Python, C++, or Java
- Software frameworks and APIs
- Performing 'guerilla capacity planning' for internet service architectures
- Internet service architectures (such as load balancing, LAMP, or CDN’s)
- Configuring and maintaining applications using at least one of the following: web servers, load balancers, relational databases, storage systems, or messaging systems
- Relational Databases including MySQL
- Network protocols including at least one of the following: NFS, DHCP, NTP, SSH, DNS, or SNMP
- Maintaining web-based applications using at least one of the following: Apache, Memecached, or Squid
- Storage Systems including NFS
- Diagnosing and troubleshooting issues ranging from low-level hardware issues to large scale failures within datacenter clusters
- Experience utilizing high performance query engines (Presto or Spark) for big data
$169,442/year to $209,000/year + bonus + equity + benefits
Learn more about this Employer on their Career Site
