Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications? Do you thrive in environments centered on cloud technology, microservices, and machine learning? If so, we may have the perfect opportunity for you.
About the Role
As a Senior Site Reliability Engineer, you’ll be a key part of a team of skilled specialists dedicated to building and managing advanced digital products. You will join an agile department focused on developing and supporting high-impact systems in alignment with our digital strategy. Our team leverages cutting-edge AI and machine learning to deliver solutions that add real business value.
Tasks
As a Senior Site Reliability Engineer, you will play a central role in ensuring system reliability, performance, and scalability in a global, cloud-native environment. This includes:
System Reliability & Performance : Maintain 24/7 system availability and performance, including participation in on-call support. You’ll design and implement proactive monitoring, automated healing solutions, and robust error budgeting to support high availability and fault tolerance.
Capacity & Reliability Planning : Develop capacity plans to maintain consistent service quality and uphold reliability standards.
Cross-Functional Collaboration : Work closely with IT and business teams to enhance system availability, functionality, security, and performance.
Technical Proficiency : Leverage leading technologies such as Java, Kubernetes, Kafka, Mongo DB, and Docker to support complex, high-transaction, real-time data processing systems.
Requirements
You bring a background as an SRE or Dev Sec Ops Engineer, with strong experience in cloud technologies and a high level of expertise in:
Automation & Orchestration : Skilled in Kubernetes, Kafka, and automation tools.
Technical Problem Solving : Strong analytical skills with a collaborative, solutions-oriented mindset.
Infrastructure & Security : Understanding of network stack, security best practices, and experience with ITSM and Atlassian tools.
Communication Skills : Fluent in English, with excellent communication abilities to collaborate across diverse teams.
#J-18808-Ljbffr
Site Reliability Engineer,
Free
Site Reliability Engineer,
Portugal,
Modificado May 4, 2025
Descrição
Detalhes do trabalho
⇐ Trabalho anterior |
Próximo trabalho ⇒ |
Propaganda