🌿 Back to all jobs

🥝 Reinforcement Learning Engineer

Appit LLC | montreal (administrative region), Canada | Posted June 01, 2026

Job Description

APPIT Software Solutions is hiring a Reinforcement Learning Engineer in Montreal, Canada . Design reinforcement learning systems at APPIT Software in Montreal, building adaptive AI agents for optimization, autonomous decision-making, and RLHF alignment of large language models.

Responsibilities

  • Design and implement reinforcement learning algorithms for enterprise optimization problems
  • Build RLHF and reward modeling pipelines for LLM alignment and fine-tuning
  • Develop simulation environments for training and evaluating RL agents
  • Implement multi-agent reinforcement learning systems for complex coordination tasks
  • Optimize RL training stability and sample efficiency using state-of-the-art techniques
  • Collaborate with research teams to translate RL advances into production applications

Requirements

  • 5+ years of ML experience with 2+ years focused on reinforceme...

Apply for This Position

Submit Application