Site Reliability Engineer
Cape Town, Western Cape, South Africa · Full Time
Be the first to apply
- Experience
- 3+ yrs
- Salary
- —
- Openings
- 1
- Posted
- 2 hours ago
- Work mode
- In office
- Eligibility
- Candidates with at least 3 years of experience in DevOps or Site Reliability Engineering roles are suitable. Applicants should also be comfortable working with backend development, AWS infrastructure, Terraform, and observability tooling.
- Resume
- Required to apply
Where you'll work
Job description
About Robin
Robin AI is working to reshape the legal sector by making contracts easier to understand and use for everyone. The company develops Legal AI technology using its own models, licensed datasets, and strategic collaborations with Anthropic and AWS. Founded in 2019, Robin AI now operates across four continents and supports major organisations such as GE, Pfizer, KPMG, and UBS.
Role Overview
As a Site Reliability Engineer, you will contribute to the development and upkeep of the cloud infrastructure and applications behind Robin AI’s Legal AI platform. You will work closely with engineering teams to create dependable monitoring, incident response, and deployment practices that keep services highly available and reliable for customers around the world.
Key Responsibilities
- Keep Robin’s systems dependable, highly available, and able to scale as demand grows.
- Introduce and maintain strong observability across the service-oriented stack using logs, traces, metrics, and alerts.
- Build, run, and improve infrastructure that supports product teams as the company expands into new geographies.
- Reduce repetitive operational work by introducing practical automation.
- Partner with development leads to improve how code is built, tested, and deployed.
- Take part in on-call coverage and strengthen incident management practices to support round-the-clock reliability.
Required Background
- At least 3 years of experience in DevOps or Site Reliability Engineering.
- Working knowledge of at least one backend language; Python is used in this environment.
- Strong hands-on experience with AWS services such as ECS, S3, RDS, and Lambda, with infrastructure managed through Terraform.
- Ability to troubleshoot issues end to end, from the browser and network layers through containers and data stores.
- Familiarity with observability tools and frameworks; OpenTelemetry, CloudWatch, and DataDog are used here.
- Strong analytical thinking and communication skills.
- Exposure to AI/ML infrastructure deployments will be considered an advantage.
What You’ll Get
- Competitive compensation.
- A generous equity package so every team member shares ownership in Robin AI.
- 20 days of paid time off per year, plus South African public holidays.
- Clear growth paths, with strong performers prioritised for promotion.
Culture and Working Environment
Robin values creativity, resourcefulness, and a strong drive for excellence. Team members are encouraged to challenge themselves, take thoughtful risks, and explore new ideas. The company aims to foster a setting where people can grow, contribute meaningfully, and help solve complex problems together.
Diversity, Equity and Inclusion
Robin is committed to building one of the most diverse technology organisations globally. In 2024, more than 30% of employees came from ethnic minority backgrounds and 51% of roles were held by women. The company believes diverse perspectives are essential to innovation and is focused on creating an inclusive environment where that innovation can thrive.
Additional Information
Robin follows a direct hiring approach. Any speculative CVs submitted through agencies will be treated as a gift.