Job Description
We are looking for a Sr. Engineer to design, build, and scale the infrastructure powering NVIDIA’s AI agent ecosystem. You will work at the intersection of distributed systems, developer platforms, and agentic AI — building the foundational services that enable teams across the company to develop, deploy, orchestrate, and operate autonomous AI agents at production scale.
What you will be doing:
+ Build and develop platform services that own the full agent lifecycle from registration through deployment, execution, and teardown
+ Architect Kubernetes-based execution environments with pod lifecycle management, namespace isolation, persistent storage, and identity propagation
+ Develop and maintain automated CI/CD pipelines using GitLab CI and ArgoCD, including reusable pipeline templates and deployment blueprints that standardize how agents are built across teams
+ Build framework-agnostic infrastructure supporting multiple agent SDKs (Claude Code, OpenAI Codex...