Job Detail

Principal Technical Program Manager, Datacenter Availability - Microsoft Corporation
Des Moines, IA
Posted: Mar 06, 2024 03:16

Job Description

Microsoft's mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Microsoft's Cloud Operations & Innovation (CO+I) is the engine that powers our cloud services. CO+I is responsible for delivering over 200 Microsoft web portals, Live and Online Services around the world including infrastructure, security and compliance, operations, globalization, and manageability. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide.

Within CO+I, the Datacenter Availability Team is responsible for ensuring the uptime, reliability, and availability of Microsoft's cloud business datacenters. Microsoft has a portfolio of datacenters globally and the Availability Team is looking to fill the vital role of Principal Technical Program Manager, Datacenter Availabilty .

This is a remote role. Candidates can sit anywhere in the United States.

Responsibilities

  • Availability Strategy Development: Develop and implement a comprehensive strategy for ensuring high availability of datacenter services and infrastructure.

  • Program Planning and Execution: Lead the planning and execution of programs related to datacenter availability, ensuring alignment with organizational goals and timelines.

  • Risk Assessment and Mitigation: Identify potential risks to datacenter availability, conduct risk assessments, and develop and implement mitigation strategies.

  • Collaboration with Cross-Functional Teams: Work closely with cross-functional teams, including engineering, operations, and security, to ensure a cohesive approach to datacenter availability.

  • Prognostics and incident prevention: Develop prognostics and incident prevention initiatives in collaboration with our Reliability Engineers, utilizing predictive analytics and proactive measures to identify and address potential issues before they escalate.

  • Capacity Planning: Collaborate with capacity planning teams to ensure that datacenter resources meet current and future demands, optimizing for availability.

  • Documentation and Reporting: Maintain comprehensive documentation of availability-related processes, procedures, and incidents. Provide regular reports to stakeholders on the status of availability initiatives.

  • Continuous Improvement: Establish mechanisms for continuous improvement, including post-incident reviews, performance analysis, and proactive identification of areas for enhancement.

  • Datacenter Efficiency: Implement and oversee initiatives to improve datacenter efficiency, including optimizing power usage, cooling systems, and resource utilization to minimize environmental impact and operational costs.

  • Support of Central Operations Core Pillars: Provide support and collaboration with the Operational Readiness team, Resources Efficiency team, and Central Operations Program Manager to ensure seamless coordination and alignment of efforts across all operational aspects, promoting efficiency and overall effectiveness of datacenter operations.

  • Embody our culture and values.

Qualifications

Required Qualifications:

  • Bachelor's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 8+ years of mission critical experience in electrical, mechanical, or controls engineering

  • OR Master's Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 7+ years of mission critical experience in electrical, mechanical, or controls engineering

  • OR Doctorate Degree in Mechanical Engineering, Materials Engineering, Reliability Engineering, Electrical Engineering, or related field AND 5+ years of mission critical experience in electrical, mechanical, or controls engineering.

Other Requirements:

Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:

  • Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

  • Proficiency in Datacenter or Critical Environment's Mechanical and Electrical systems

  • Communication and leadership skills to drive and support projects.

  • Proficient at utilizing Root Cause Analysis methodologies and creating Failure Mode and Effects Analyses (FMEA).

  • Experience leading construction, design, and process reviews assessing availability and reliability threat vectors, while partnering with various teams to design out or eliminate the potential issues.

  • Have a -sense of urgency- to drive resolution and an unquenchable desire to understand root causes and incident triggers.

  • Understanding of datacenter and topologies or equivalent Mission Critical facility background.

  • Analytical skills with the ability to summarize large, complex data from multiple databases and systems.

  • Team player with the ability to influence cross functional team and leadership team in driving process improvement, efficiencies, and best practices.

Reliability Engineering IC5 - The typical base pay range for this role across the U.S. is USD $133,600 - $256,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $173,200 - $282,200 per year.

Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay

Microsoft is an equal opportunity employer. Consistent with applicable law, all qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations (https://careers.microsoft.com/v2/global/en/accessibility.html) .



Job Detail


Company Overview

Microsoft Corporation

Des Moines, IA