Title: Vision AI Engineer

Job ID: 20935

Location:

Elect – 100 Jurong East Street, SG

Description:

Role: Vision AI Engineer

Location: Singapore
Employment Type: Full-time

About the Role

We’re looking for a Vision AI Engineer to help build next‑generation video intelligence systems powered by modern vision‑language models. You’ll work across the full video understanding stack—combining multimodal foundation models with established analytics approaches to deliver reliable, production‑ready AI solutions.

Key Responsibilities

Build end‑to‑end video analytics pipelines using vision‑language models.
Fine‑tune and adapt foundation models for domain‑specific video understanding.
Integrate VLM reasoning with traditional video analytics components.
Develop and maintain inference pipelines for video and multimodal data.
Deploy and optimize models for scalable, high‑performance production use.
Diagnose model issues and strengthen system stability and robustness.
Collaborate with product and engineering teams to deliver AI-driven features.

Required Qualifications

Strong background in computer vision, video analytics, or AI engineering.
Practical experience with vision‑language and video‑language architectures.
Hands-on experience fine‑tuning, evaluating, and deploying deep learning models.
Familiarity with foundation models such as CLIP‑based architectures, BLIP/BLIP‑2, and open‑source VLMs (e.g., Qwen‑VL, InternVL).
Proficiency in Python and deep learning frameworks (e.g., PyTorch).
Solid understanding of CNNs, Transformers, and attention mechanisms.
Experience with model optimization techniques (quantization, batching, memory strategies).
Experience deploying models on Docker, cloud platforms, or on‑prem GPU systems.

Preferred Qualifications

Master’s or PhD in Computer Vision, Machine Learning, AI, or related fields.
Experience with real‑time or near‑real‑time video analytics.
Familiarity with traditional VA methods (detection, tracking, motion analysis).
Exposure to MLOps workflows (versioning, CI/CD, monitoring).
Interest in modern VLM and video understanding research.

What We Offer

Opportunities to work on cutting‑edge multimodal AI technologies.
Ownership of production‑scale video intelligence pipelines.
A collaborative environment that blends research and engineering.