Location: Prague
Remote / Hybrid
Working Hours: Normal Working hours however need to work 4 hours overlapping with PST hours
Employment Type: Full-Time
Company: Aptly Technology Corporation
Client Project: Microsoft Azure
About Aptly Technology Corp
Aptly Technology Corporation is a trusted technology services provider specializing in scalable IT solutions and strategic talent placement. As a partner on key Microsoft Azure initiatives, we are seeking a highly skilled Site Reliability Engineer (SRE) to join our elite engineering team supporting Azure infrastructure reliability and automation efforts.
Role Overview
As a Site Reliability Engineer, you will be responsible for maintaining the reliability, scalability, and performance of critical Azure infrastructure services. You will work at the intersection of infrastructure engineering and automation, enabling smooth CI/CD processes and robust incident management for high-impact Microsoft Azure systems.
Key Responsibilities
- Design, implement, and support scalable infrastructure solutions using microservices architecture.
- Develop and maintain CI/CD pipelines and deployment strategies aligned with DevOps best practices.
- Build and manage Infrastructure as Code (IaC) solutions for automated provisioning and configuration.
- Monitor system performance and availability; proactively detect and resolve site reliability issues.
- Lead incident response for Severity 1 and 2 incidents, ensuring rapid resolution in live-site conditions.
- Collaborate closely with cross-functional teams to drive continuous improvement in system reliability and developer experience.
- Maintain clear and precise documentation of systems, procedures, and runbooks.
Required Qualifications
- 3+ years of hands-on professional experience with .NET, C#, and PowerShell/scripting.
- 3+ years of experience in microservices architecture, including implementation and Azure DevOps deployment strategies (CI/CD).
- 2+ years of experience in Cloud, Infrastructure, or Platform Engineering, preferably in an Azure environment.
- 2+ years working with infrastructure automation or IaC frameworks (e.g., Terraform, ARM templates, Bicep, etc.).
- 2+ years of proven Site Reliability Engineering (SRE) experience, especially in managing Severity 1 and 2 incidents under live conditions.
- Experience with Azure DevOps, GitHub Actions, or similar CI/CD tools.
- Familiarity with observability tools (e.g., Azure Monitor, Prometheus, Grafana).
- Prior experience working in enterprise-scale cloud environments.
- Certifications such as Microsoft Certified: Azure Administrator Associate or Azure DevOps Engineer Expert.
- Strong attention to detail and commitment to delivering high-quality, accurate results.
- Excellent communication and collaboration skills with the ability to thrive in a dynamic, team-oriented environment.
What You Won’t Need
- This role does not require front-end development or UI/JavaScript expertise.
Why Join Aptly?
- Work on high-impact cloud projects in partnership with Microsoft.
- Be part of a forward-thinking team focused on innovation, automation, and operational excellence.
- Flexible work arrangements and competitive compensation.
- Opportunities for professional growth in a fast-paced and supportive environment.
Apply now to become part of a team shaping the future of Azure infrastructure reliability.