We are looking for a highly capable engineer/researcher to lead the R&D of Small Language Models (SLMs) and Vision-Language Models (VLMs) for edge / low-latency and cost-efficient production scenarios. You will own the continuous pretraining, supervised instruction tuning (SFT), and compression/distillation pipelines, and work closely with platform teams to deliver reliable, measurable improvements in inference efficiency, tool‑use success rate, and overall model quality.
Conduct continuous pretraining and SFT for SLMs and VLMs to improve task performance and domain adaptation.
Build reproducible training workflows in PyTorch , including data processing, training, evaluation, and model versioning.
...