Site Reliability Engineer/DevOps (Investment Bank, 50k)
A prestigious investment bank in Hong Kong seeks a Site Reliability Engineer to enhance the reliability, performance, and automation of mission-critical systems within a collaborative and growth-focused technology environment.
Key responsibilities:
As a Site Reliability Engineer, you will ensure the reliability, scalability, and security of critical banking systems through CI/CD design, automation, monitoring, and cross-team collaboration.
- Drive system reliability, availability, and performance through proactive monitoring, incident management, and SRE best practices
- Build and enhance observability platforms, including monitoring, alerting, dashboards, and telemetry pipelines across critical systems
- Manage and maintain Linux server environments (RHEL 7/8/9), ensuring security, stability, patch compliance, and operational efficiency
- Analyse logs, metrics, and system behaviour to troubleshoot incidents, resolve performance bottlenecks, and improve service resilience
- Partner with engineering teams to strengthen platform scalability, security, automation, and operational excellence
- Administer Kubernetes environments to ensure reliable workload delivery, platform health, and infrastructure observability
- Support business continuity through on-call participation, disaster recovery exercises, continuous improvement initiatives, and emerging SRE technologies and tools
Candidate profile:
To excel in this role, you bring experience supporting large-scale systems, strong DevOps expertise in CI/CD, cloud and containers, and the ability to collaborate effectively under pressure.
- Bachelor's degree in Computer Science, Engineering, or a related discipline, with 3-7+ years' experience in SRE, platform engineering, or production support environments
- Strong expertise in monitoring and observability platforms, including Prometheus, Elasticsearch, Grafana, Kibana, and enterprise monitoring tools
- Hands-on experience managing Linux (RHEL 7/8/9) and Kubernetes environments within large-scale, high-availability production systems
- Solid understanding of SRE principles, incident management, disaster recovery, automation, CI/CD pipelines, networking, and distributed systems troubleshooting
- Proficiency in scripting and automation tools (e.g., Python, Bash, Ansible), with strong problem-solving, multitasking, and communication skills; AI/ML infrastructure experience is an advantage
About the company:
A well-established global financial institution provides innovative capital markets and investment solutions to clients across international markets. The organisation fosters a collaborative, technology-driven environment with strong emphasis on professional growth, innovation, and building the next generation of financial services capabilities.
Keywords: site reliability engineering, DevOps, CI/CD pipelines, Azure, cloud environment, investment bank
What’s next?
Build resilient systems and power the future of financial technology. Apply now!
About the job
Contract Type: Perm
Specialism: Tech & Transformation
Focus: DevOps, SRE Engineer & Application support
Industry: Banking
Salary: Up to HKD50,000 per annum
Workplace Type: On-site
Experience Level: Associate
Location: Central
FULL_TIMEJob Reference: 2IIUZ3-AFCE50CC
Date posted: 9 June 2026
Consultant: Melanie Wu
hong-kong tech-transformation/devops 2026-06-09 2026-08-08 banking Central Central and Western District HK HKD 50000 50000 50000 YEAR Robert Walters https://www.robertwalters.com.hk https://www.robertwalters.com.hk/content/dam/robert-walters/global/images/logos/web-logos/square-logo.png true