Job DescriptionJob DescriptionInfrastructure Engineer (Systems & Storage)
We are seeking a versatile Infrastructure Engineer to act as the guardian of our data integrity and physical backbone. This role manages the "Persistence Layer" of our global healthcare-related suite—ensuring our high-performance storage and network stacks are resilient, secure, and ready for massive scale.
You are a technical "Generalist" who is equally comfortable in a terminal as you are in a data center. You will own the health of our NetApp and Qumulo storage clusters and maintain the network security boundaries that keep our patient data safe.
Key Responsibilities
-
Network & Security: Maintain Fortigate firewalls and Cisco network infrastructure, ensuring high availability and secure VLAN management.
-
Storage Management: Administer high-performance NetApp clusters and high-capacity Qumulo systems.
-
Data Center Operations: Manage the physical health of our East Coast data centers, including racking, cabling, and hardware lifecycle management.
-
Collaboration: part of a global team, working closely with off-shore infrastructure engineers. Willing to initiate discussions with development teams early in the cycle.
-
Incident Response: Participate in a modernized, data-driven incident response process to ensure 24/7 stability for our production environments.
What You Bring
-
Storage Depth: 2+ years of experience with enterprise storage (specifically NetApp/ONTAP) including volume management and snapshots.
-
Systems Generalist: A solid foundation in Linux (RHEL, OL8) and VMware virtualization.
-
Networking Fundamentals: Comfortable managing firewall rules and switching in a complex, multi-site environment.
-
Operational Discipline: Experience working within highly regulated frameworks like HIPAA, SOC2, or ARC-AMPE.
-
Mobility: Ability to travel to our East Coast data center locations for hands-on hardware.
Skills & Mindset
-
Cloud Infrastructure Management: Experience with Oracle OCI is a plus.
-
Automation-First: Experience with (or a strong desire to master) Ansible, Terraform, and Python is a plus.
-
Monitoring & Logs: Experience with modern observability tools like Datadog APM and CloudStrike Falcon is a plus.
-
Problem Solver: You don't just fix the ticket; you find a way to automate the fix so the ticket never returns.