Job DescriptionJob Description
Overview:
We are seeking an experienced Data Center Technician to support and scale a global fleet of GPU and CPU servers across hyperscale and edge deployments worldwide.
The ideal candidate has 4+ years of hands-on experience with server and network hardware in colocation or data center environments, including racking, cabling, troubleshooting, and hardware upgrades. This role requires a self-driven professional who can maintain 99.99%+ uptime standards and perform effectively during critical incidents.
Responsibilities include deploying and supporting GPU-dense, liquid-cooled infrastructure, hardware diagnostics and repairs, rack-and-stack operations, cable management, warranty coordination, spare parts management, and escalation handling in a 24x7 production environment.
Key Responsibilities
- Install and perform ongoing maintenance on servers and network equipment, including rack and stack of servers and switches, cabling, and physical configuration of devices.
- Respond to reported server, network, and infrastructure issues.
- Work with facility staff to ensure that power, cooling, and all other facility-provided services are functioning properly; coordinate maintenance activities and ensure outages are addressed and escalated accordingly.
- Work through assigned tickets and work requests effectively, performing diagnosis and repairs to hardware.
- Replace server components such as CPUs, memory, drives, motherboards, and network cables.
- Work with vendor warranty technicians to ensure warranty issues are resolved promptly and properly.
- Proactively identify issues and areas to improve efficiency and develop plans for resolution.
- Set up, maintain, and document spare parts inventory.
- Collaborate with cross-functional teams to improve deployment efficiency, automation, and monitoring.
- Escalate issues as needed to ensure prompt resolution by assisting system administrators in debugging network, hardware, and Linux OS-related issues.
Qualifications
- Experience working in a data center as a technician or in a similar role installing, configuring, and troubleshooting server and network equipment.
- Experience with large-scale network and server deployments.
- Strong understanding of GPU hardware and liquid-cooled rack solutions.
- Ability to work independently and manage projects within the facility.
- Strong understanding of servers, network equipment, and data center facility services such as power, cooling, and cross-connections to service providers.
- Experience troubleshooting, building, repairing, and upgrading servers.
- Excellent verbal and written communication skills, collaborative nature, and a friendly, can-do attitude.
- Ability to work in a fast-paced, high-availability environment and respond proactively to incidents.
- Strong desire and aptitude for problem-solving.
- Ability to logically analyze and solve problems.
- Physical ability to stand for extended periods and lift up to 50 lbs.