🌿 Back to all jobs

🥝 Senior Deep Learning Research Engineer, LLM Inference

NVIDIA | Tel Aviv, Israel | Posted June 02, 2026

Job Description

We are seeking a Deep Learning Research Engineer to join our team and help develop the next generation of Large Language Model (LLM) inference algorithms. You will work on technologies that directly enhance NVIDIA's software, making the latest LLMs more efficient and accessible to users worldwide. This role is designed for someone with strong research foundations who also wants to build software that runs and scales into production systems across the world.

By joining us, you will be part of a strategic effort to establish NVIDIA as the definitive platform for high-performance LLM inference. The work requires a combination of research taste, experimental rigor, and engineering ownership: you will explore new ideas, run rigorous evaluations, and help transform successful approaches into tools and implementations.


What you'll be doing:
+ Develop and improve benchmarks, profiling workflows, and evaluation pipelines that make inference performance measurable and...

Apply for This Position

Submit Application