Site Reliability Engineer
2 semanas atrás
Critical Manufacturing is dedicated to empowering high-performance operations to make Industry 4.0 a reality with the most innovative, comprehensive, and modular MES software. We have a global presence, but our headquarters, and the main technical center, are in Porto (Maia), Portugal, where we develop a state-of-the-art solution for Semiconductor, Electronics, Medical Devices, and Industrial Equipment.
Recognized as a Leader by Gartner, we are part of ASMPT, the world's largest supplier of best-in-class equipment, and technological process partner for the electronics and semiconductor industries.
The Role
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE you will be responsible for keeping an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation.
SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.
Requirements
What You Will Do
- Analyze and interpret distributed systems telemetry (metrics, logs, traces) to identify and address potential issues before they affect users
- Design, build, and maintain monitoring, alerting, and reliability tooling that improves system visibility and operational excellence
- Collaborate with software and infrastructure teams to improve resilience, scalability, and performance across our platform
- Participate in incident response and post-mortem analysis to ensure continuous learning and improvement
- Contribute to automation efforts that reduce toil and increase engineering productivity
What Success Looks Like
Within your first year, you will have:
- Improved reliability and observability of key production systems
- Reduced manual operational work by automating recurring processes
- Partnered effectively with development teams to embed SRE best practices into the software lifecycle
- Shaped scalable approaches to telemetry, monitoring, and incident response
Why Join Us
- Be part of a company shaping the future of manufacturing software
- Enjoy the freedom to experiment, innovate, and create systems that will last
- Join a team where storytelling, strategy, and technology meet to make Industry 4.0 real
What You Will Bring
- More than 2 years of experience in the role of Site Reliability Engineer
- A passion for investigation and problem-solving—digging deep until you understand how things work
- Strong belief that telemetry is essential for system health and continuous improvement
- Excellent English skills - spoken and written
What we consider a plus (not mandatory):
- Experience with cloud infrastructure (e.g., Azure) or container orchestration platforms (e.g., Kubernetes, OpenShift)
- Familiarity with Docker, Terraform, and reverse proxies (e.g., Traefik)
- Hands-on experience designing, analyzing, and troubleshooting large-scale distributed systems
- Ability to debug, optimize performance, and automate repetitive tasks
- Strong problem-solving mindset and collaborative communication style
Diversity, Equity and Inclusion are a source of commitment and innovation
At Critical Manufacturing, we welcome and encourage applications from individuals of all backgrounds, regardless of disabilities, diverse abilities, identities, or experiences. Our commitment is to create an inclusive environment where everyone has equal opportunities to succeed and thrive.
If you need accommodation during the recruitment process, please let us know - we're happy to support you.
-
Maia, Portugal Critical Manufacturing Tempo inteiroA leading manufacturing software company in Porto is seeking a Site Reliability Engineer to enhance system performance and reliability. In this role, you will analyze telemetry to identify issues, design monitoring tools, and collaborate with teams. Candidates should have over two years of experience in SRE, strong problem-solving abilities, and excellent...
-
Site Reliability Engineer
1 semana atrás
Maia, Portugal Critical Manufacturing Tempo inteiroCritical Manufacturing is dedicated to empowering high-performance operations to make Industry 4.0 a reality with the most innovative, comprehensive, and modular MES software. We have a global presence, but our headquarters, and the main technical center, are in Porto (Maia), Portugal, where we develop a state-of-the-art solution for Semiconductor,...
-
Site Reliability Engineer
2 semanas atrás
Maia, Portugal Critical Manufacturing Tempo inteiroCritical Manufacturing is dedicated to empowering high-performance operations to make Industry 4.0 a reality with the most innovative, comprehensive, and modular MES software. We have a global presence, but our headquarters, and the main technical center, are in Porto (Maia), Portugal, where we develop a state-of-the-art solution for Semiconductor,...
-
Site Reliability Engineer
Há 6 dias
Maia, Portugal Critical Manufacturing Tempo inteiroCritical Manufacturing is dedicated to empowering high-performance operations to make Industry 4.0 a reality with the most innovative, comprehensive, and modular MES software. We have a global presence, but our headquarters, and the main technical center, are in Porto (Maia), Portugal, where we develop a state-of-the-art solution for Semiconductor,...
-
Site Reliability Engineer
1 semana atrás
Porto, Portugal Bitcoin Devs Company Tempo inteiroOverview The Site Reliability Engineer (SRE) plays a critical role in ensuring the reliability, availability, and performance of our systems. The SRE will work closely with development teams to optimize and enhance the infrastructure and deployment processes, ultimately driving a culture of reliability within the organization. Key responsibilities...
-
Site Reliability Engineer
Há 7 dias
Porto, Porto, Portugal Bitcoin Devs Company Tempo inteiroOverview The Site Reliability Engineer (SRE) plays a critical role in ensuring the reliability, availability, and performance of our systems. The SRE will work closely with development teams to optimize and enhance the infrastructure and deployment processes, ultimately driving a culture of reliability within the organization. Key...
-
Site Reliability Engineer
3 semanas atrás
Porto, Portugal Hexa Consulting Tempo inteiroAt Hexa Consulting we want to spread transparency and enable diverse tech careers. Based in Portugal, we can answer the increasing demand in the IT sector. Our mission is to build strong relationships, be a leading partner through a differentiated approach in IT consulting and contribute to the professional and personal development of our team. We work...
-
Site Reliability Engineer
1 semana atrás
Porto, Porto, Portugal Hexa Consulting Tempo inteiroAt Hexa Consulting we want to spread transparency and enable diverse tech careers. Based in Portugal, we can answer the increasing demand in the IT sector.Our mission is to build strong relationships, be a leading partner through a differentiated approach in IT consulting and contribute to the professional and personal development of our team.We work with...
-
Site Reliability Engineer
Há 4 dias
Porto, Portugal Global Partner HR Solutions Tempo inteiroSite Reliability Engineer (Mid-Level) About the Role We're looking for a Mid-Level Site Reliability Engineer who's ready to take the next step in their career. You've got solid technical foundations and you're ready to lead projects with the right support structure. This is your opportunity to grow from executing tasks to owning infrastructure initiatives....
-
Mid Site Reliability Engineer
1 semana atrás
Porto, Portugal KCS iT Tempo inteiroMid Site Reliability Engineer (SRE) @Porto We’re looking for the special, unique and amazing YOU! At KCS IT, we look for individuals who stand out, who strive for improvement, and who share our values: dedication, energy, integrity, transparency, flexibility, trust, honesty, hard work, and teamwork. We stand for equality and value diversity, creating a...