Senior Site Reliability Engineer
Location
Menteng, Jakarta
Job Type
Full-time
Category
other-general
Posted
May 25, 2026
Responsibilities
- Design, build, and maintain scalable, reliable, and secure infrastructure across AWS (including Elastic Beanstalk) and Azure.
- Develop and manage CI/CD pipelines using Azure DevOps, GitHub Actions, or similar tools to ensure smooth and automated deployments.
- Operate, monitor, and troubleshoot Kubernetes clusters (EKS, AKS, or self-managed) to ensure system stability and uptime.
- Implement comprehensive observability solutions using Prometheus, Grafana, Loki, and Alertmanager.
- Automate infrastructure provisioning and configuration using Terraform, Helm, CloudFormation, and/or Ansible.
- Define, measure, and improve system reliability through SLOs, SLIs, and SLAs.
- Enhance system resilience and incident response through proactive monitoring and capacity planning.
- Manage secrets, access control, and security policies to maintain a robust and compliant infrastructure.
- Participate in o...