We are looking for an experienced and proactive HPC Data Center Manager to lead operations at a state-of-the-art high-power compute data center in Austin, Texas. The role involves managing a team of technicians, ensuring smooth operations, and providing critical services support while maintaining exceptional service delivery within established Service Level Agreements (SLAs). This is a dynamic opportunity for a leader passionate about data center operations and high-performance computing.
RESPONSIBILITIES
Operational Management:
- Oversee facility infrastructure installations, ensuring adherence to SLAs and industry standards.
- Coordinate and execute power installations, relocations, and decommissioning of IT equipment.
- Conduct routine checks and ensure data halls are cleaned and maintained to operational standards.
- Troubleshoot mechanical and electrical systems, including chillers, CRAC/CRAH units, UPS systems, PDUs, and STS components.
- Maintain detailed and up-to-date documentation for all processes and deployments.
Team Leadership:
- Manage and mentor a team of data center technicians, fostering professional growth and operational excellence.
- Provide regular feedback and identify training opportunities to enhance team performance.
- Encourage open and professional communication within the team and across the organization.
Service Delivery:
- Ensure all deployments are installed to meet internal, manufacturer, and industry standards.
- Support clients with onsite maintenance and critical services, ensuring adherence to SLAs.
- Prepare and utilize Method Statements and Risk Assessments to ensure safe working practices.
- Lead configuration management, preventative maintenance, and repair tasks within SLA timeframes or escalate issues as necessary.
Strategic Problem Solving:
- Address unforeseen operational challenges effectively, providing timely feedback to leadership.
- Collaborate with data center leadership to support seamless operations and implement best practices.
REQUIREMENTS
Education and Experience:
- Bachelor’s degree or equivalent experience.
- 4+ years of experience leading data center operations teams.
Technical Expertise:
- In-depth knowledge of critical data center equipment, including UPS, PDU, STS, CRAC, CRAH, and Building Management Systems (BMS).
- Strong understanding of data center environments and operational requirements.
- Accredited training in data center infrastructure installation is preferred.
Skills and Attributes:
- Excellent written and verbal communication skills.
- Strong organizational, analytical, and problem-solving abilities.
- Ability to interact professionally with customers and stakeholders at all levels.
- Commitment to providing timely and reliable service.