This role builds and runs the infrastructure our Generative AI products depend on: the pipelines that ship code, the platforms that run services and models, and the controls that keep all of it secure and reliable. AI workloads bring their own demands. GPUs, model serving, inference autoscaling, and token cost all shape the work, and you have run workloads like these before. You should be comfortable owning infrastructure as code, CI/CD, observability, and security on AWS, and ready to set the operational standards a growing team will lean on.
WHAT YOU WILL DO