G13 - Operations Support Engineer

FPT Asia Pacific • singapore, singapore, Singapore • Posted June 05, 2026

Location singapore, singapore
Job Type Full-time
Category Other-General
Posted June 05, 2026

Responsibilities:

  • Design & own service observability usage model: ensure all service metrics, logs, traces flow into Elastic Cloud (authoritative); maintain dashboards & SLOs; evaluate pragmatic use of CloudWatch, AWS Managed Prometheus / Grafana for supplemental or fallback views.
  • Build proactive, noise‑reduced alerting and incident response playbooks; drive post‑incident RCA & remediation tracking (closure SLA).
  • Optimize service performance (profiling, caching layers, autoscaling heuristics, concurrency tuning) meeting latency & throughput targets.
  • Implement secure supply chain & runtime controls (image scanning, SBOM consumption, secrets management, TLS / mTLS) leveraging shared platform tooling.
  • Curate operational runbooks, golden dashboards, reliability readiness + production readiness checklists.
  • Integrate model / guardrail service telemetry (latency, queue depth, GPU/CPU utilization) into unified Elastic ...

Interested in this role?

Click the button below to start your application.

Apply Now