We are a platform that revolutionized the Brazilian sports betting market, creating a unique ecosystem with technological solutions, strategic marketing, exclusive services, and numerous success stories.
As part of the Site Reliability Engineering team at Flutter Brazil, you will work in a highly collaborative and fast-moving environment, building the infrastructure foundations that enable our products to operate reliably at scale. This role has a strong software engineering focus, where you will design and build systems, tools, and automation that improve reliability, scalability, and developer experience.
You will design and maintain cloud infrastructure, strengthen observability, optimize performance, and develop internal platforms and services that empower engineering teams through code-first solutions.
This role offers the opportunity to work with modern cloud technologies, distributed systems, and high-availability environments, ensuring that our platform remains secure, resilient, and efficient.
- Design, implement, and maintain cloud infrastructure using Infrastructure as Code (ArgoCD, Helm, Terraform).
- Monitor infrastructure and observability systems using Grafana and Prometheus.
- Manage and optimize AWS services, including EKS and RDS.
- Implement and maintain modern CI/CD practices with GitHub.
- Manage cloud security and access controls.
- Optimize cloud resource utilization and cost efficiency.
- Design and build internal tools, services, and automation using software engineering best practices.
- Develop scalable systems and frameworks that reduce operational toil and improve reliability.
- Build self-service tools that empower developers and increase autonomy.
- Share knowledge and promote technical exchange across engineering teams.
You will bring deep technical expertise in cloud reliability, automation, and distributed systems, paired with a collaborative mindset and strong problem-solving skills. You have a strong software engineering background and are comfortable building production-grade systems, not only managing infrastructure. You thrive in environments where reliability, observability, and performance are critical.
- 5+ years of experience as a Software Engineer, Platform Engineer, SRE, or related role.
- Advanced English proficiency.
- Experience with Infrastructure as Code (ArgoCD or Flux, Terraform, Helm or Kustomize).
- Proficiency with Kubernetes and container orchestration.
- Portuguese speaker
- Experience with cloud services, preferably AWS (EKS, RDS).
- Strong programming skills with experience building backend services or internal platforms (preferably in Golang or Python).
- Understanding of security best practices.
- Experience with automation and CI/CD pipelines (GitHub or GitLab).
- Experience with observability systems (Grafana, Prometheus, Kiali).
- Ability to troubleshoot and debug complex distributed systems.
- Familiarity with multi-cloud architectures.
- Experience with data streaming and event-driven architectures.
- Security by Design mindset: Zero Trust, IAM, RBAC, OPA.
- Product mindset: building platforms that deliver real value to developers.
- Experience collaborating with DevOps, SRE, and Software Engineering teams.
- Strong automation mindset: reducing toil and eliminating repetitive work.
- Experience with Service Mesh (Istio, Linkerd, Consul).
- Experience with Kubernetes Operators and Custom Controllers (Kubebuilder, Operator SDK).
- Experience defining SLOs and SLIs that improve platform reliability.
- Experience designing and building developer platforms or internal tooling used by engineering teams.
It’s ok if you don’t think you tick every box on this list. We love people who want to challenge themselves and are passionate about what they do. If you believe you can contribute in some areas and are eager to learn, we encourage you to apply.
✨ Competitive compensation
Access to TotalPass
️ Paid time off
Remote-first environment
Growth and learning opportunities through the Flutter Edge global network