Job DescriptionJob Description
This position is posted by Jobgether on behalf of d-Matrix. We are currently looking for a Machine Learning Computer Architect, Senior Staff - Workload Analysis in Santa Clara, CA.
We are seeking a highly skilled Machine Learning Computer Architect to advance the performance of next- AI applications at the intersection of hardware and software. This role focuses on analyzing emerging ML workloads, designing high-performance accelerator features, and bridging the gap between software algorithms and hardware systems. You will work closely with cross-functional teams to create analytical models, propose new hardware/software features, and optimize inference accelerators for data center environments. This position offers the opportunity to work on cutting-edge generative AI technologies and contribute to innovations that will shape the future of AI infrastructure.
Accountabilities:
The Machine Learning Computer Architect will be responsible for:
- Analyzing emerging machine learning workloads, including multi-modal LLMs, chain-of-thought reasoning models, and generative video/audio models.
- Developing and proposing hardware and software features to accelerate AI inference and optimize data movement across tensor cores, storage, and network layers.
- Creating analytical performance models and using architecture simulators to project system efficiency for current and future hardware .
- Collaborating with partner teams across product, hardware design, compilers, inference servers, and kernel development.
- Staying current with research trends in ML architecture, algorithms, and accelerator technologies to ensure state-of-the-art solutions.
- Contributing to publications or technical documentation for internal and external audiences.
Requirements
Candidates should meet the following qualifications:
- BSEE with 7+ years of industry experience or MSEE with 4+ years of industry experience.
- Strong foundation in computer architecture, hardware-software co-design, performance modeling, and machine learning fundamentals (particularly DNNs).
- Programming fluency in C/C++ and Python.
- Experience developing analytical performance models or architecture simulators (e.g., gem5, GPGPU-Sim).
- Research background with publications in top-tier architecture or ML venues is a plus (e.g., ISCA, MICRO, ASPLOS, HPCA, DAC, MLSys).
- Self-motivated, collaborative team player with initiative and strong problem-solving skills.
- Ability to work in a hybrid environment and contribute to complex, high-impact projects.
Benefits
This role offers:
- Competitive salary range: $132K–$235K, depending on experience
- Equity grants and performance-based bonuses
- Comprehensive medical, dental, and vision coverage
- 401(k) retirement plan and other financial benefits
- Flexible hybrid work arrangements
- Opportunity to work on state-of-the-art AI hardware/software innovations
- Collaborative, inclusive, and high-impact work environment
Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.