LLM MLOps Engineer
A leading organisation in Hong Kong is seeking an LLM MLOps Engineer to join their renowned Data Monetisation Team. You will design, build, and optimise scalable AI systems that enhance customer experiences and drive measurable business impact.
Key responsibilities:
As an LLM MLOps Engineer, you will play a key role in designing and optimising advanced AI systems leveraging RAG techniques.
- Support the design and maintenance of end-to-end ML and RAG pipelines for production-ready AI systems
- Deploy, integrate, and scale machine learning and retrieval-based models in production environments
- Automate workflows for model training, evaluation, validation, and deployment
- Optimise RAG components including embeddings, document chunking, re-ranking, vector retrieval, and prompt engineering with guardrails
- Monitor model and retrieval performance, data drift, query relevance, and response quality using metrics and A/B testing
- Manage model/version control, experiment tracking, vector databases, feature stores, and data pipelines
- Ensure system scalability, reliability, security, and compliance while troubleshooting production issues across data, models, and infrastructure
Candidate profile:
To excel in this role, you will bring proven expertise in building RAG pipelines, strong programming skills in Python, and hands-on experience with embedding models like OpenAI or Hugging Face.
- Master’s/PhD in Statistics, Machine Learning, Mathematics, Computer Science, Economics, or related quantitative field, or equivalent experience
- 3–5+ years in Data Science with experience in leading teams and delivering ML-driven business impact; exposure to Retail/FMCG/Property/Telecom is a plus
- Strong expertise in statistical analysis, machine learning, predictive modelling, data mining, and high-dimensional data processing
- Hands-on experience with MLOps, ML lifecycle management, model deployment, and pipeline automation
- Strong proficiency in Python and SQL, with experience using distributed systems (Hadoop, Spark) and cloud platforms (AWS, Azure, or GCP)
- Proven experience building RAG pipelines, vector databases (Pinecone, FAISS, Weaviate, Chroma), embeddings, and NLP/LLM frameworks (LangChain, LlamaIndex)
- Knowledge of prompt engineering, LLMs, reinforcement learning, and Model Context Protocol (MCP) is a plus
- Strong communication skills with ability to translate technical concepts to business impact; collaborative, proactive, and ownership-driven mindset
About the company:
A large Asia-based organisation operating in the telecommunications and digital services industry, providing a wide range of IT, cloud, and business support solutions. The company focuses on enabling digital transformation for enterprise and consumer clients through integrated service platforms. It is known for its strong operational scale, reliable infrastructure, and ability to deliver technology-driven services that support efficiency, connectivity, and customer experience across multiple sectors.
Keywords: MLOps, LLMOps, RAG, Python, SQL, machine learning, NLP, Cloud, Pipelines
What's next:
If you’re ready to make a meaningful impact by shaping how advanced AI technologies transform business outcomes, apply now!
About the job
Contract Type: Perm
Specialism: Tech & Transformation
Focus: AI, Data & Analytics
Industry: Telecommunications
Salary: Negotiable
Workplace Type: On-site
Experience Level: Senior Management
Location: Hong Kong
FULL_TIMEJob Reference: DZC7I9-5E7B717A
Date posted: 5 May 2026
Consultant: Krishi Shah
hong-kong tech-transformation/data-science-big-data 2026-05-05 2026-07-04 telecommunications Hong Kong HK Robert Walters https://www.robertwalters.com.hk https://www.robertwalters.com.hk/content/dam/robert-walters/global/images/logos/web-logos/square-logo.png true