Skip to main content

NOC Infrastructure Engineer, Data Center

Job DescriptionJob DescriptionAqueduct Technologies is seeking a Staff-level NOC Infrastructure Engineer to serve as a technical leader and escalation point within the Network Operations Center (NOC) team, specializing in data center infrastructure and hybrid cloud environments. You will own complex incident resolution and deep technical investigations across servers, virtualization, storage, backup/recovery, and , with a strong emphasis on resilience, recoverability, and operational excellence.
The NOC Infrastructure Engineer embodies our values of professionalism, empathy, and technical excellence. We’re looking for someone who operates with a high level of autonomy and accountability, driving incident resolution, strengthening documentation and standards, and mentoring others to improve team capability and reliability of services.Staff-Level Expectations:

  • Perform the role independently with little need for oversight except with particularly difficult issues or situations.
  • Serve as the primary ticket owner for infrastructure queue work based on assignment, expertise, and business need.
  • Invest dedicated time in training with the goal of becoming a Subject Matter Expert in at least half of Aqueduct’s core infrastructure applications/support areas.
  • Appropriately engage the Lead or higher-level resource when significant issues arise.
  • Participate in the On-Call after-hours support rotation.

Core Responsibilities:

  • Data Center Infrastructure Operations
  • Support and troubleshoot customer data center environments including:
  • Physical servers, firmware/driver baselines, hardware health, and break/fix triage
  • Virtualization platforms (e.g., VMware vSphere/ESXi and/or Hyper-V) including cluster health, resource contention, HA/DRS behavior, datastore and VM performance issues
  • Storage systems (SAN/NAS/Storage arrays) including capacity, performance, multipathing, snapshots, and connectivity considerations

Execute and improve operational practices

  • Patch management coordination (host/guest) with clear risk controls and validation steps
  • Capacity and performance monitoring; identify bottlenecks and single points of failure
  • Standardize configurations and baselines to reduce incidents

Backup, Recovery, and Resilience

  • Administer and support Veeam and Rubrik environments, including:
  • Job health, repository/cluster capacity, retention policies, and alerting
  • Restore operations (file, application, VM, and full environment recovery) under time pressure
  • Recovery testing, validation, and documentation (runbooks, RTO/RPO alignment)
  • Improve recoverability posture (immutability where applicable, ransomware recovery workflows, least-privilege access)
  • Proactively identify backup gaps and propose improvements to reduce exposure and improve recovery outcomes.

Hybrid Cloud &

  • Administer and troubleshoot Azure core infrastructure services:
  • Networking/connectivity (VNets, routing, firewalling patterns), compute, storage, and governance (policy, tagging/standards)
  • Monitoring and alerting via Azure Monitor / Log Analytics (or equivalent) with actionable signal quality
  • Manage and troubleshoot Microsoft Entra ID (Azure AD)
  • lifecycle basics, access administration, RBAC, privileged access patterns (e.g., PIM)
  • Conditional Access design/troubleshooting and secure authentication policies (MFA, protection concepts)
  • Support hybrid dependencies as relevant (e.g., AD integration, directory sync/ flows) without turning the role into “pure Microsoft admin.

Incident Ownership, Documentation, and Continuous Improvement

  • Lead escalated ticket troubleshooting using Aqueduct processes and best practices.
  • Maintain exceptional notes, clear reasoning, and step-by-step detail in all work.
  • Review customer environments proactively to identify risks, misconfigurations, or single points of failure.
  • Keep documentation updated, accurate, and thorough for all , access, and cloud services.
  • Identify recurring issues and propose improvements that reduce future incidents (automation, standards, documentation, monitoring).

Change Control

  • Fully participate in and comply with Aqueduct’s Change Control processes.
  • Plan and execute infrastructure-impacting changes carefully, following risk-mitigation and rollback best practices.

Training & Growth

  • Maintain and pursue deep technical specialization across Azure, Entra, Veeam, and Rubrik.
  • Engage in cross-training to expand into additional NOC practice areas.
  • Act as a mentor and knowledge resource to junior NOC engineers.

Qualifications:

  • Education & Experience:
  • Bachelor’s degree in Information Technology, Computer Science, or related field (or equivalent experience).
  • 5+ years of experience of progressive experience supporting data center infrastructure.
  • Demonstrated experience owning complex incidents independently (including after-hours/high-impact events).
  • Strong troubleshooting ability across virtualization, storage, and server infrastructure.
  • Hands-on experience with backup/restore operations in Veeam and/or Rubrik.
  • Working knowledge of Azure operations and Entra ID administration in support of hybrid environments.
  • Strong change hygiene: risk assessment, rollback planning, and clean validation practices.
  • Clear, disciplined documentation and customer-facing communication.
  • Mentor who guides and uplifts junior and mid-level engineers.

:

  • Experience in an MSP or multi-tenant environment.

Aqueduct Technologies is committed to developing a diverse and talented team. We celebrate and support and are committed to making an inclusive environment for all employees and applicants including women, minorities, individuals with disabilities, members of the LGBTQIA community, veterans, and any other legally protected group. We are an Equal Opportunity Employer and do not discriminate against any employee or applicant on the basis of any status protected by federal, state, or local laws.
Aqueduct Technologies is one of the largest IT solutions providers in the US, recognized for our relentless pursuit of customer satisfaction, our corporate culture, technology leadership, and our commitment to the local community. We pride ourselves on our world-class engineering, the investments we make in our employees and our systems, and on our loyal base of customers and manufacturers. Recognized as one of the fastest-growing, private companies in Massachusetts—and awarded the Best Place to Work in Boston for six, consecutive years—there is no better time to join Aqueduct than now!

We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.

NOC Infrastructure Engineer, Data Center

Canton, MS
Full time

Published on 01/30/2026

Share this job now