Site Reliability Engineer for AI and DevOps Support

PowerToFly • mississauga, peel region, Canada • Posted May 27, 2026

Location mississauga, peel region
Job Type Full-time
Category Other-General
Posted May 27, 2026
Enhance AI and DevOps platform stability as a dedicated Site Reliability Engineer. Collaborate with cross-functional teams to resolve incidents and boost operational efficiency in a dynamic support environment.
This position seeks an experienced SRE to aid our AI and DevOps Platform Support team. The role involves assisting with application stability, improving service levels, and coordinating with offshore managed services. Key skills include troubleshooting, communication, and a solid understanding of platform operations.
Key Responsibilities:
• Resolve incidents to maintain platform stability
• Coordinate daily operational activities and vendor interactions
• Support onboarding activities using established standards
• Contribute to performance tuning and cost-efficiency initiatives
• Participate in resilience and disaster recovery activities
Requirements:
• 5–8 years in technical support or platform operations
• Familiarity with Kubernetes and CI/CD too...

Interested in this role?

Click the button below to start your application.

Apply Now