LLM MLOps Engineer

A leading organisation in Hong Kong is seeking an LLM MLOps Engineer to join their renowned Data Monetisation Team. You will design, build, and optimise scalable AI systems that enhance customer experiences and drive measurable business impact.

Key responsibilities:

As an LLM MLOps Engineer, you will play a key role in designing and optimising advanced AI systems leveraging RAG techniques.

Support the design and maintenance of end-to-end ML and RAG pipelines for production-ready AI systems
Deploy, integrate, and scale machine learning and retrieval-based models in production environments
Automate workflows for model training, evaluation, validation, and deployment
Optimise RAG components including embeddings, document chunking, re-ranking, vector retrieval, and prompt engineering with guardrails
Monitor model and retrieval performance, data drift, query relevance, and response quality using metrics and A/B testing
Manage model/version control, experiment tracking, vector databases, feature stores, and data pipelines
Ensure system scalability, reliability, security, and compliance while troubleshooting production issues across data, models, and infrastructure

Candidate profile:

To excel in this role, you will bring proven expertise in building RAG pipelines, strong programming skills in Python, and hands-on experience with embedding models like OpenAI or Hugging Face.

Master’s/PhD in Statistics, Machine Learning, Mathematics, Computer Science, Economics, or related quantitative field, or equivalent experience
3–5+ years in Data Science with experience in leading teams and delivering ML-driven business impact; exposure to Retail/FMCG/Property/Telecom is a plus
Strong expertise in statistical analysis, machine learning, predictive modelling, data mining, and high-dimensional data processing
Hands-on experience with MLOps, ML lifecycle management, model deployment, and pipeline automation
Strong proficiency in Python and SQL, with experience using distributed systems (Hadoop, Spark) and cloud platforms (AWS, Azure, or GCP)
Proven experience building RAG pipelines, vector databases (Pinecone, FAISS, Weaviate, Chroma), embeddings, and NLP/LLM frameworks (LangChain, LlamaIndex)
Knowledge of prompt engineering, LLMs, reinforcement learning, and Model Context Protocol (MCP) is a plus
Strong communication skills with ability to translate technical concepts to business impact; collaborative, proactive, and ownership-driven mindset

About the company:

A large Asia-based organisation operating in the telecommunications and digital services industry, providing a wide range of IT, cloud, and business support solutions. The company focuses on enabling digital transformation for enterprise and consumer clients through integrated service platforms. It is known for its strong operational scale, reliable infrastructure, and ability to deliver technology-driven services that support efficiency, connectivity, and customer experience across multiple sectors.

Keywords: MLOps, LLMOps, RAG, Python, SQL, machine learning, NLP, Cloud, Pipelines

What's next:

If you’re ready to make a meaningful impact by shaping how advanced AI technologies transform business outcomes, apply now!

Similar jobs

View more jobs

LLM MLOps Engineer

Share

Similar jobs