Systems Reliability Engineer

3 semanas atrás


Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

Social network you want to login/join with:

Systems Reliability Engineer (Full-remote), LisbonClient:

Noesis

Location:

Lisbon, Portugal

Job Category:

Other

EU work permit required:

Yes

Job Reference:

910708a83947

Job Views:

2

Posted:

22.03.2025

Expiry Date:

06.05.2025

Job Description:

Noesis is looking for candidates with the following profile:

Main Tasks and Responsibilities:

  1. Lead and onboard services and teams to the reliability tenets;
  2. Establish and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs);
  3. Design and implement scalable, reliable, and secure infrastructure, while ensuring cloud-native best practices;
  4. Collaborate with software development teams to ensure systems are resilient (observable, fault-tolerant, recoverable, scalable) and performant;
  5. Implement monitoring, alerting, logging, and tracing solutions to detect and respond to incidents;
  6. Lead incident response efforts, ensuring quick resolution and minimal downtime, and conduct RCA/post-mortems;
  7. Automate every operational task, with a special focus on fast incident detection & recovery;
  8. Foster a culture of continuous improvement and knowledge sharing;
  9. Communicate effectively with stakeholders, providing updates on system reliability and performance;

Requirements:

  1. BSc, MSc, in Software Engineering/Computer Science or related fields;
  2. 2+ years of experience in a similar role or experience as a senior systems administrator;
  3. Proficiency in at least one high-level programming language (C++, Python, Java, C#, etc.);
  4. Experience with automation tools;
  5. Experience with Grafana, ELK stack, Prometheus, or others;
  6. Strong troubleshooting and debugging skills;
  7. Strong understanding of designing resilient systems;
  8. Expertise in debugging complex distributed systems;
  9. Fluency in English and excellent communication skills;
  10. Participate in on-call rotation to provide 24/7 support for production systems, with 'Follow the Sun';

Experience in any of the following is valued, but not fully required:

  • Containerization technologies and orchestration platforms, mainly Kubernetes and EKS (CKA, CKAD, CKS certifications are valued);
  • Familiarity with AWS services;
  • Experience with automation and Infrastructure as Code (IaC) tools, such as AWS CloudFormation, Terraform, etc;

If you meet these conditions and would like to join an innovative organization that continuously invests in training its talents, send us your application.

Join us. Let's innovate together

All our recruitment and selection processes are based on equal opportunities, valuing the competence and potential of each person and ensuring that no candidate is discriminated on the grounds of gender, ethnicity, sexual orientation, age, religion or physical condition.

#J-18808-Ljbffr

  • Lisboa, Lisboa, Portugal Ekkiden Tempo inteiro

    About the OpportunityEkkiden offers a unique opportunity for an experienced System Reliability Engineer to join our IT Market Services department. As a System Reliability Engineer, you will be responsible for designing and implementing observability solutions that provide visibility into complex systems and applications.Main Responsibilities:Design and...


  • Lisboa, Lisboa, Portugal Robert Walters plc Tempo inteiro

    About the RoleWe are seeking an experienced System Reliability Engineer to join our team in Lisboa. As a key member of our infrastructure team, you will play a pivotal role in ensuring the reliability, scalability, and efficiency of our systems and services.


  • Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    **Role Overview**TN Portugal is seeking a highly skilled Reliability Systems Engineer to join our Site Reliability Engineering team. As an integral member of the team, you will play a critical role in ensuring the reliability and scalability of our SaaS platform.


  • Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    About Us: At Raketech, we are committed to delivering exceptional services in the iGaming market. To achieve this goal, we need talented individuals like you to join our team as a Senior DevOps Engineer.Job Summary: We are looking for a skilled professional to enhance the reliability, scalability, and performance of our systems and platforms. The successful...

  • System Reliability Engineer

    2 semanas atrás


    Lisboa, Lisboa, Portugal Meritis | B Corp™ Tempo inteiro

    Job ProfileWe are seeking a highly skilled System Reliability Engineer to join our team. As a System Reliability Engineer, you will be responsible for configuring, implementing, and optimizing Dynatrace monitoring solutions to ensure maximum system performance and reliability.**Key Responsibilities:**- Configure, implement, and optimize Dynatrace monitoring...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    Senior Site Reliability Engineer - Cisco ThousandEyesLocation: Oeiras, PortugalPlease note that we have a hybrid approach to work and would like to find someone who can come into our offices in Lisbon (Lagoas Park) once a week.Who We AreCisco ThousandEyes is a leading Digital Experience Assurance platform that empowers organizations to deliver seamless...


  • Lisboa, Lisboa, Portugal Bosch Service Solutions SA. Tempo inteiro

    Job OverviewBosch Service Solutions SA is seeking a Reliability Engineer for Hydraulic Systems to join our team.About the PositionThis role is ideal for a mechanical engineer with experience in hydraulic testing and setup assembly/development, as well as experience in the testing area and product failure analysis. The successful candidate will be fluent in...


  • Lisboa, Lisboa, Portugal Cloudflare Tempo inteiro

    Our team is dedicated to helping build a better Internet by protecting and accelerating any Internet application online without adding hardware, installing software, or changing a line of code.We're seeking a talented Cloud System Reliability Engineer to join our team. In this role, you'll focus on ensuring the reliability and scalability of our cloud-based...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    Senior Site Reliability Engineer - Cisco ThousandEyesLocation: Oeiras, PortugalPlease note that we have a hybrid approach to work and would like to find someone who can come into our offices in Lisbon (Lagoas Park) once a week.Who We AreCisco ThousandEyes is a leading Digital Experience Assurance platform that empowers organizations to deliver seamless...


  • Lisboa, Lisboa, Portugal Cisco Systems, Inc. Tempo inteiro

    .Senior Site Reliability Engineer - Cisco ThousandEyesLocation: Oeiras, PortugalPlease note that we have a hybrid approach to work and would like to find someone who can come into our offices in Lisbon (Lagoas Park) once a week.Who We AreCisco ThousandEyes is a leading Digital Experience Assurance platform that empowers organizations to deliver seamless...


  • Lisboa, Lisboa, Portugal Robert Walters plc Tempo inteiro

    Job Description:We are seeking a skilled Site Reliability Engineer to join our team at Robert Walters plc. As an SRE, you will play a crucial role in ensuring the reliability, scalability, and efficiency of our systems and services.You will work closely with our Product and Development teams to design, deliver, and support new functionality while driving...


  • Lisboa, Lisboa, Portugal Centric Software Inc Tempo inteiro

    Job DescriptionAt Centric Software Inc, we are seeking an experienced Site Reliability Engineer to join our team. As a key member of our Site Reliability Engineering team, you will be responsible for the development and maintenance of our own Docker-based workloads.Key Responsibilities:Develop, maintain, and support our Docker-based workloadsOwning the...

  • Systems Reliability Engineer

    3 semanas atrás


    Lisboa, Lisboa, Portugal TN Portugal Tempo inteiro

    Social network you want to login/join with:Systems Reliability Engineer (Full-remote), LisbonClient:NoesisLocation:Lisbon, PortugalJob Category:OtherEU work permit required:YesJob Reference:910708a83947Job Views:2Posted:22.03.2025Expiry Date:06.05.2025Job Description:Noesis is looking for candidates with the following profile:Main Tasks and...

  • Reliability Engineer

    1 semana atrás


    Lisboa, Lisboa, Portugal Pandadoc Tempo inteiro

    Job OverviewPandadoc empowers growing organizations to thrive by automating document workflows. As a Senior Site Reliability Engineer, you will play a mission-critical role in ensuring our services are reliable and available to our customers.About the RoleWe are seeking a skilled engineer to join our team as a Senior Site Reliability Engineer. In this role,...


  • Lisboa, Lisboa, Portugal Hito Solutions Tempo inteiro

    About the RoleHito Solutions is a pioneering tech consulting company that has been helping businesses achieve their digital transformation goals for over 25 years. We are looking for an exceptional Reliability Systems Engineer to join our team.As a key member of our infrastructure delivery team, you will be responsible for designing, implementing, and...


  • Lisboa, Lisboa, Portugal GrabJobs Tempo inteiro

    We are seeking a highly skilled Reliability Systems Engineer to join our team at GrabJobs in Portugal. As a key member of our infrastructure team, you will be responsible for designing and implementing scalable and reliable systems that meet the needs of our customers.Job DescriptionOur ideal candidate will have a strong background in cloud computing and...

  • Site Reliability Engineer

    3 semanas atrás


    Lisboa, Lisboa, Portugal Pertemps ERP (part of Network EMEA) Tempo inteiro

    Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications? Do you thrive in environments centered on cloud technology, microservices, and machine learning? If so, we may have the perfect opportunity for you.About the RoleAs a Senior Site Reliability Engineer, you'll be a key part of...


  • Lisboa, Lisboa, Portugal Pertemps Erp (Part Of Network Emea) Tempo inteiro

    Are you excited by the idea of working within a global organization, helping to power a suite of innovative, cloud-native applications?Do you thrive in environments centered on cloud technology, microservices, and machine learning?If so, we may have the perfect opportunity for you.About the Role As a Senior Site Reliability Engineer, you'll be a key part of...


  • Lisboa, Lisboa, Portugal Sepio Tempo inteiro

    We are Sepio, a fast-growing cybersecurity start-up building the first Asset Risk Management platform. As a DevOps Engineer, you will play a critical role in ensuring the efficiency and reliability of our software delivery processes.About the PlatformOur platform provides visibility, control, and mitigation to zero trust, insider threat, BYOD, IT, OT, and...


  • Lisboa, Lisboa, Portugal CloudFlare Tempo inteiro

    About Us">At Cloudflare, we're on a mission to help build a better Internet. Today, the company runs one of the world's largest networks that powers millions of websites and other Internet properties for customers ranging from individual bloggers to SMBs to Fortune 500 companies. Our intelligent global network gets smarter with every request, resulting in...