Senior Site Reliability Engineer, Kubernetes ASE
Austin,Texas,United States
Software and Services
Join the Apple Service Engineering team as a Site Reliability Engineer and be part of something extraordinary. At Apple, your ideas have the power to shape the future of our products, services, and customer experiences. Bring your passion and dedication, and watch your vision become reality. As an SRE, you'll play a pivotal role in supporting and scaling cloud services for thousands of development and operations engineers. Our services demand uncompromising scalability, high availability, and seamless performance. This is a hands-on position where you'll establish SRE practices for our private/public cloud service, accelerating our ability to deliver thousands of applications reliably and consistently. If you're driven by designing, engineering, and running systems that make a real difference for our customers, Apple is the perfect place for you.
Description
We're searching for a driven Site Reliability Engineer (SRE) to join our innovative team. As an SRE, you'll be a cornerstone of our production software, ensuring our systems are uncompromisingly reliable, secure, and scalable. Your expertise will be vital in maintaining constant uptime, seamless scalability, and a thriving environment for new applications and services. The ideal candidate is a highly motivated self-starter with a passion for excellence, quality, and meticulous attention to detail. This role goes beyond traditional SRE work. You'll not only keep our systems running smoothly but also collaborate closely with developers and architects. Together, you'll design and implement solutions for improved stability, security,and scalability.
Minimum Qualifications
Kubernetes Expertise: Deep understanding of Kubernetes architecture, components, and best practices. Proficiency in managing Kubernetes clusters, deploying applications, and automating workflows using tools like Helm and Kustomize.
Cloud Platforms: Experience with major public cloud providers and their cloud-native services. Familiarity with infrastructure as code (IaC) tools like Terraform or Ansible.
SRE Principles: Adherence to SRE principles, including monitoring, alerting, error budgets, fault analysis, and automation. Strong focus on reliability, availability, and performance.
Telemetry and Observability: Expertise in implementing and coordinating telemetry using tools like Splunk, Grafana, and Prometheus. Ability to analyze and troubleshoot complex system issues.
Programming: Proficiency in GoLang for developing automation scripts, tools, and custom applications.
Collaboration: Excellent interpersonal and communication skills. Ability to work effectively in cross-functional teams and foster a collaborative environment.
Key Qualifications
Preferred Qualifications
Documentation & Collaboration: Create clear alert handling procedures and runbooks, ensuring knowledge transfer and collaboration within and between SRE teams.
Automation Champion: Automate service deployment and orchestration in the cloud environment, as well as other routine processes, to streamline operations.
Resilience & Growth: Actively participate in capability planning, scale testing, and disaster recovery exercises, ensuring our systems remain resilient.
Team Player: Foster strong relationships and provide support to partner teams like engineering, QA, and program management.
Education & Experience
Additional Requirements
Apple Footer
Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant (Opens in a new window) .
Apple will not discriminate or retaliate against applicants who inquire about, disclose, or discuss their compensation or that of other applicants. United States Department of Labor. Learn more (Opens in a new window) .
Apple will consider for employment all qualified applicants with criminal histories in a manner consistent with applicable law. If you're applying for a position in San Francisco, review the San Francisco Fair Chance Ordinance guidelines (opens in a new window) applicable in your area.
Apple participates in the E-Verify program in certain locations as required by law. Learn more about the E-Verify program (Opens in a new window) .
Apple is committed to working with and providing reasonable accommodation to applicants with physical and mental disabilities. Reasonable Accommodation and Drug Free Workplace policy Learn more (Opens in a new window) .
Apple is a drug-free workplace. Reasonable Accommodation and Drug Free Workplace policy Learn more (Opens in a new window) .