Skip to main content

Senior Performance Systems Engineer

Job DescriptionJob DescriptionSalary:

Cornelis Networks delivers the worlds highest performance scale-out networking solutions for AI and HPC datacenters. Our differentiated architecture seamlessly integrates hardware, software and system level technologies to maximize the efficiency of GPU, CPU and accelerator-based compute clusters at any scale. Our solutions drive breakthroughs in AI & HPC workloads, empowering our customers to push the boundaries of innovation. Backed by top-tier venture capital and strategic investors, we are committed to innovation, performance and scalability - solving the worlds most demanding computational challenges with our next-generation networking solutions.

We are a fast-growing, forward-thinking team of architects, engineers, and business professionals with a proven track record of building successful products and companies. As a global organization, our team spans multiple U.S. states and six countries, and we continue to expand with exceptional talent in onsite, hybrid, and fully remote roles.

Cornelis Networks is hiring a talented Senior Performance Systems Engineerto help drive innovation and contribute to the development of cutting-edge technologies in the semiconductor industry. In this role, you will be responsible for the day-to-day management and maintenance of our performance testing lab, ensuring optimal functionality and availability of server, networking, and GPU infrastructure. You will work closely with our performance engineering team to support their needs, manage inventory, and plan for future infrastructure requirements.

Key Responsibilities:

  • Software Installation and Configuration: Install, configure, and troubleshoot operating systems, libraries/packages, and specialized software required for performance testing.
  • Inventory Management: Maintain accurate inventory of lab equipment, including hardware, software licenses, and spare parts.
  • Troubleshooting and Maintenance: Diagnose and resolve hardware and software issues, perform routine maintenance, and implement preventative measures.
  • Requirements Gathering: Collaborate with the performance testing team to understand their infrastructure needs and translate them into actionable plans.
  • Documentation: Create and maintain detailed documentation of lab configurations, procedures, and troubleshooting steps.
  • Vendor Management: Liaise with vendors for hardware and software procurement, support, and maintenance.
  • Capacity Planning: Assist in planning for future lab infrastructure requirements, including capacity, power, and cooling.

Minimum Qualifications:

  • Education: Associate's or Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
  • 5 years of experience in a data center or lab environment.
  • Experience with Linux operating systems in a cluster environment, including provisioning of servers through network/PXE boot (Cobbler, Warewulf, Ansible).
  • Experience with server hardware installation including racking, cabling, PCIe card installation, etc.
  • Experience with resource management software (SLURM, PBS, etc).
  • Technical Skills:
    • Strong understanding of server hardware (CPU, memory, storage).
    • Proficiency in network cabling and troubleshooting.
    • Experience with GPU hardware and drivers.
    • Knowledge of data center best practices.
    • Familiarity with monitoring, management, and telemetry tools.
  • Soft Skills:
    • Excellent problem-solving and troubleshooting skills.
    • Strong communication and interpersonal skills.
    • Ability to work independently and as part of a team.
    • Detail-oriented and organized.
    • Ability to prioritize and manage multiple tasks

Qualifications:

  • Understanding of HPC and AI performance benchmarking
  • Ability to run a range of performance benchmarks from basic network tests through application-level testing
  • Correlate performance observations with cluster-level configuration changes and improve overall cluster efficiency and performance.

Location: This is a remote position for employees residing within the United States.

We offer a competitive compensation package that includes equity, cash, and incentives, along with health and retirement benefits. Our dynamic, flexible work environment provides the opportunity to collaborate with some of the most influential names in the semiconductor industry.

At Cornelis Networks your base salary is only one component of your comprehensive total rewards package. Your base pay will be determined by factors such as your skills, qualifications, experience, and location relative to the hiring range for the position. Depending on your role, you may also be eligible for performance-based incentives, including an annual bonus or sales incentives.

In addition to your base pay, youll have access to a broad range of benefits, including medical, dental, and vision coverage, as well as and life insurance, a dependent care flexible spending account, accidental injury insurance, and pet insurance. We also offer generous paid holidays, 401(k) with company match, and Open Time Off (OTO) for regular full-time exempt employees. Other paid time off benefits include sick time, bonding leave, and leave.

Cornelis Networks does not accept unsolicited resumes from headhunters, recruitment agencies, or -based recruitment services. Cornelis Networks is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to , , , , , or expression, , , , status, genetic information, protected veteran status, or any other characteristic protected by law. We encourage applications from all qualified candidates and will accommodate applicants needs under the respective laws throughout all stages of the recruitment and selection process.

remote work