🌿 Back to all jobs

🥝 Reinforcement Learning Research Intern

CloudNuro | Hyderabad, India | Posted June 05, 2026

Job Description

Program structure

Track: Research engineering

Reports to: Staff research engineer, EOS Intelligence Plane team

Duration: 20–24 weeks, full-time preferred

Primary languages: Python (PyTorch or JAX), familiarity with Stable Baselines / CleanRL / TorchRL

Outcome: A trained, sim-validated routing policy that demonstrably improves utility- per-dollar over the production baseline

Compensation: stipend per internal scale;
conversion to full-timeconsidered for strong performers.

Mentorship: each intern is paired with a senior engineer or researcher who is the technical owner of the area.


How to apply: Send


- Resume / CV (PDF).


- A link to a GitHub profile, portfolio, or representative project.


- The role num...

Apply for This Position

Submit Application