JOB SUMMARY
We are seeking an experienced Computer Vision Engineer with a strong background in AI and Large Language Models (LLMs). The ideal candidate will design, build and deploy computer vision solutions that integrate with generative AI and LLM frameworks to interpret, analyze and describe visual data. This role bridges the gap between image understanding and natural language processing, enabling intelligent visual‑language applications.
KEY RESPONSIBILITIES
- Develop and implement computer vision models for image classification, object detection, segmentation, facial recognition and visual understanding.
- Integrate vision models with LLMs (e.g. GPT, LLaVA, CLIP or multimodal models) to build systems that interpret and describe visual content.
- Design AI pipelines that combine text, images, and video data for multimodal learning and reasoning.
- Utilize deep learning frameworks (TensorFlow, PyTorch, OpenCV) to prototy...