🌿 Back to all jobs
🥝 NVIDIA Senior Engineer in GPU Inference
NVIDIA | toronto, Canada | Posted June 05, 2026
Job Description
Advance your career with NVIDIA as a Senior Engineer, focusing on GPU inference systems for AI. Drive optimization and collaboration while enhancing performance across large-scale models.
In this crucial role, you will architect high-performance inference stacks and fine-tune NVIDIA's GPU solutions to achieve top productivity. Your expertise will significantly contribute to hitting industry benchmarks and implementing advanced GPU kernels within a multi-cloud environment.
Key Responsibilities:
• Develop and optimize vLLM features with cutting-edge GPU technology
• Benchmark and profile GPU kernels for enhanced efficiency
• Create robust tools for inference benchmarking methods
• Spearhead orchestration of large-scale inference deployments
• Publish innovative research to elevate machine learning systems
Requirements:
• Extensive background in computer science with advanced degree options
• Proficient in Py...