Resilience Engineer LISBOA, Portugal D.Connectivity Engineering Posted 14 hours ago
3 semanas atrás
## Resilience EngineerLISBOA, PortugalBy connecting people, places, and things, Vodafone IoT enables organisations to thrive in the digital world. Leveraging our expertise in connectivity, our advanced IoT platform, and our extensive global reach, we deliver the results necessary for our customers' progress and success. We support businesses of all sizes and sectors in their efforts to connect for a better future.The Vodafone Internet of Things (IoT) suite of products and services is specifically designed to meet the demands of emerging business verticals. Our connection base has experienced a 20% year-over-year growth, reaching over 200 million connections by the end of the financial year 2025. Vodafone IoT maintains its leadership as a ten-time consecutive leader in the IoT Connectivity Gartner Magic Quadrant. To address the technological needs of IoT, Vodafone has developed an industry-leading IoT Connectivity Management Platform, targeting key strategic growth opportunities to meet the global requirements of IoT customers.Vodafone has also carved out the IoT Connectivity business to secure additional external investment and maintain our leading position in the industry through the following.1. Continue accelerating and enhancing our Platform as a Service for Vodafone customers on footprint.2. Introduce service propositions in markets beyond Vodafone's current footprint.3. Address long tail lower volume segment through digital self-service platform globally.We are seeking a senior Resilience Engineer to own and evolve the stability, availability, and recoverability of our IoT platforms. This role operates at the intersection of system architecture, reliability engineering, and operational excellence, with end-to-end accountability for designing resilience into our services. You will define and govern resilience strategies, influence platform architecture, and partner across product, infrastructure, and engineering teams to ensure our systems continue to perform under failure, scale, and unexpected disruption.* Developing and governing resilience strategies across system architecture, deployment, monitoring, and incident response;* Defining and tracking stability KPIs (e.g., MTTD, MTTR, error budgets), partnering with performance and operations teams to meet or exceed targets;* Designing and implementing fault injection testing, chaos engineering practices, and scenario-based simulations to validate platform robustness;* Collaborating with product, infrastructure, architecture and development teams to re-design services with built-in redundancy, failover, and graceful degradation;* Driving automation and observability improvements to reduce noise, increase fault detection speed, and support predictive failure mitigation;* Contributing to the design and maintenance of our Business Continuity and Disaster Recovery Plan (BCDR), ensuring IoT systems remain resilient and recoverable in the face of unexpected disruptions;* Owning the resilience roadmap and continuously assessing emerging threats, technologies, and architectural shifts to guide evolution of stability practices;* Evangelizing a culture of resilience through internal communication, workshops, and post-incident learning programs;* Deliver high-quality engineering solutions while continuously strengthening the resilience, scalability, and cost efficiency of our IoT platform;* Consistently meet or exceed delivery expectations by prioritizing the highest-leverage resilience initiatives that improve customer experience, business outcomes, and financial performance;* Build trusted, transparent, and outcome-driven relationships by providing clear technical direction and trade-off recommendations to business and engineering stakeholders.## **Who you are*** Educated to BSc degree level in Software Engineer or related discipline with Computer Science* Strong scripting and automation experience (e.g., Python, Bash, Go, PowerShell), with a demonstrated ability to replace manual processes with reliable, scalable automation;* Proven experience designing and operating high-availability, fault-tolerant systems, including the use of chaos engineering techniques and proactive failure-mitigation strategies;* Experience applying Business Continuity and resilience standards (e.g., ISO 22301) in the context of real-world platform design and operational readiness;* Hands-on experience designing or integrating monitoring, alerting, and automated testing frameworks to support early fault detection and system validation;* Broad experience working with Linux-based platforms across on-premises and cloud environments, with an understanding of how infrastructure choices impact reliability, scalability, and recovery;* Deep expertise in Site Reliability Engineering principles, including SLOs/SLIs, error budgets, observability, toil reduction, and automation, with the ability to apply them at platform and system scale to guide architectural decisions and long-term resilience strategy;* Proven ability to balance long-term platform stability with delivery velocity by making clear, data-driven trade-offs;* Strong understanding of security principles, practices, and standards, and the ability to incorporate them into resilient, real-world technical solutions;* Deep command of telemetry, logging, and alerting ecosystems (e.g., Prometheus, Grafana, ELK, Datadog, Splunk), with the ability to design signals that enable early fault detection and informed decision-making;* Experience defining meaningful SLIs and building dashboards that drive architectural insight, prioritization, and corrective action;* Proven experience leading blameless post-incident reviews, root cause analysis, and systemic improvements across multiple teams;* Expertise in identifying and addressing system bottlenecks, latency issues, and throughput constraints in distributed environments;* Proficiency in forecasting demand, planning capacity, and managing system growth in a cost-efficient and sustainable manner;* Strong track record of partnering with software engineering, infrastructure, product, and business teams to embed resilience into the full development lifecycle;* Fluency in English.We are a leading international Telco, serving millions of customers. At Vodafone, we believe that connectivity is a force for good. If we use it for the things that really matter, it can improve people's lives and the world around us. Through our technology we empower people, connecting everyone regardless of who they are or where they live and we protect the planet, whilst helping our customers do the same.Belonging at Vodafone isn't a concept; it's lived, breathed, and cultivated through everything we do. You'll be part of a global and diverse community, with many different minds, abilities, backgrounds and cultures. ;We're committed to increase diversity, ensure equal representation, and make Vodafone a place everyone feels safe, valued and included.If you require any reasonable adjustments or have an accessibility request as part of your recruitment journey, for example, extended time or breaks in between online assessments, please refer to for guidance.Together we can.Top skillsAnsible
#J-18808-Ljbffr
-
Lisboa, Portugal Vodafone Group Plc Tempo inteiroHybrid## System Reliability EngineerLISBOA, Portugal## **Join Us**At Vodafone, we’re not just shaping the future of connectivity for our customers – we’re shaping the future for everyone who joins our team. When you work with us, you’re part of a global mission to connect people, solve complex challenges, and create a sustainable and more inclusive...
-
Senior IoT Resilience Engineer
2 semanas atrás
Lisboa, Portugal Vodafone Group Plc Tempo inteiroA leading international telecommunications company is seeking a Senior Resilience Engineer in Lisbon, Portugal. The successful candidate will evolve the stability and recoverability of IoT platforms, driving automation and influencing architecture across teams. Candidates must have a BSc in Software Engineering, strong scripting skills, and experience in...
-
Operational Resilience Lead
Há 7 dias
Lisboa, Portugal Hiscox Tempo inteiroJob Type: Permanent Build a brilliant future with Hiscox About us: - Hiscox is a diversified international insurance group with a powerful brand, strong balance sheet, and plenty of room to grow. Listed on the London Stock Exchange and headquartered in Bermuda, Hiscox has over 3,000 staff across 14 countries and 34 offices. Structured by geography and...
-
Senior FLM Service Expert
3 semanas atrás
Lisboa, Portugal Vodafone Group Plc Tempo inteiro## Senior FLM Service Expert (m/f/d) for Vantage TowersLISBOA, Portugal* You will develop and implement harmonized operational processes across all local markets in close collaboration with stakeholders;* you will act as the Process Owner within TopCo and drive adoption in local markets and vendors leveraging on the Process Governance Framework. Define...
-
Manager New Business
1 semana atrás
Lisboa, Portugal Vodafone Group Plc Tempo inteiro## Manager New Business & partnershipsLISBOA, Portugal## **Join Us**At Vodafone, we’re not just shaping the future of connectivity for our customers – we’re shaping the future for everyone who joins our team. When you work with us, you’re part of a global mission to connect people, solve complex challenges, and create a sustainable and more inclusive...
-
Distributed Systems Software Engineer, Python
2 semanas atrás
Lisboa, Portugal Canonical Tempo inteiroDistributed Systems Software Engineer, Python / Go Join to apply for the Distributed Systems Software Engineer, Python / Go role at Canonical Distributed Systems Software Engineer, Python / Go 3 months ago Be among the first 25 applicants Join to apply for the Distributed Systems Software Engineer, Python / Go role at Canonical We are seeking a software...
-
Data Engineer/ Lead Data Engineer
2 semanas atrás
Lisboa, Portugal Ibrowse Tempo inteiroLocal ( distrito, região): Lisboa Função: 1 - Data Engineer 2- Lead Data Engineer Nível: SR Data de início: Imediato Duração da contratação: 1 ano Inglês B2 Required Qualifications to be successful for both roles (Senior/Lead profiles): Technical Skills - Strong experience in end-to-end data architecture and design, including ingestion,...
-
Solution Architect LISBOA, Portugal Identity
2 semanas atrás
Lisboa, Portugal Vodafone Group Plc Tempo inteiro## Solution ArchitectLISBOA, PortugalAt Vodafone, we’re not just shaping the future of connectivity for our customers – we’re shaping the future for everyone who joins our team. When you work with us, you’re part of a global mission to connect people, solve complex challenges, and create a sustainable and more inclusive world. If you want to grow...
-
Trabalhadores Quinta de Cogumelos
Há 3 dias
Lisboa, Portugal Dolphin Posted Tempo inteiroDolphin Posted, empresa de RH e Outsourcing, encontra-se, a recrutar trabalhadores para quinta de cogumelos, para a sua empresa cliente na Suécia: **Requisitos**: - Conhecimentos básicos de inglês - Carta de condução (preferencial) - Boa condição física - Passaporte atualizado - Experiência em trabalhado agrícola **Oferecemos**: - Alojamento de...
-
Empregados de Mesa/balcão
2 semanas atrás
Lisboa, Portugal Dolphin Posted Tempo inteiroA Dolphin Posted, empresa de RH e Outsourcing, encontra-se, empregados de mesa/balcão para a sua empresa cliente na Suécia: **Requisitos**: - Conhecimentos avançados de inglês (falado e escrito) - Carta de condução (preferencial) - Boa condição física - Pelo menos 2 anos de experiência comprovada na função **Oferecemos**: - Alojamento de...