Team Lead SRE (m/f/d)

Festanstellung, Vollzeit · Stuttgart, GER

Purchasing 4.0: Fully Digitized & Networked?

At Onventis we assign teams to work with the customer needs, we don't give them specific solutions to implement.
We strive to understand the problem first, LIVE it, and deliver the products that make a difference for our end users.
Being part of the journey to create the products that customers love and work with daily, is a great joy and experience.
You will make it happen as part of the empowered team, aligned with the business, autonomous enough to push back and make your own decisions, with a clear purpose in mind.
We think it's fun to come to work every day, and we think you will too!

Your job

Team Building, Leadership, and Development: Build and scale an effective SRE team responsible for managing platform operations across the organization, fostering a culture of continuous improvement and collaboration. Lead and mentor the team, conducting regular one-on-one meetings to provide feedback and performance evaluations. Set measurable goals for both team and individual growth, ensuring leadership development and operational excellence within the SRE team.
SRE Roadmap: Develop and maintain a clear SRE roadmap aligned with business and technology strategy. Execute the roadmap by planning resources and initiating strategic projects to enhance infrastructure reliability and operational efficiency.
Consolidate Operations: Take ownership of operations across all product areas, including integrating operations from acquired companies to ensure seamless performance and reliability.
Cloud Strategy and Operations: Oversee the strategic direction and operational management of cloud infrastructure, ensuring scalable, secure, and efficient operations. Lead the planning and execution of phased migrations to Azure, aligning with long-term cloud strategies and ensuring minimal disruption.
BCDR Planning: Lead the improvement and ongoing maintenance of the Business Continuity and Disaster Recovery (BCDR) plan, ensuring robust mechanisms are in place to mitigate service disruptions and data loss.
Monitoring and Observability: Optimize existing monitoring solutions to enhance visibility and ensure proactive incident management. Take full accountability for the end-to-end monitoring lifecycle.
Deployment Process Improvement: Streamline and enhance deployment pipelines for greater efficiency, reliability, and speed. Ensure adherence to best practices and automation across environments.
SRE Metrics & Performance Indicators: Implement and track key SRE metrics, including Service Level Indicators (SLIs), Service Level Objectives (SLOs), and error budgets, to quantitatively measure and improve system reliability.
Incident Management: Establish, refine, and oversee the incident management framework, ensuring a systematic approach to on-call rotation, incident detection, response, and post-mortem analysis.
Risk Management: Identify, assess, and mitigate operational risks, including security vulnerabilities, ensuring infrastructure is secure, compliant, and resilient to threats.
Security Management: Oversee the security posture of all infrastructure, ensuring that best practices and protective measures are implemented and maintained across systems.
Ops Help Desk (OHD) Oversight: Improve and manage the SRE Ops Help Desk process, establishing clear Service Level Agreements (SLAs) to ensure internal customer expectations are met. Continuously monitor and enhance help desk performance.
Team Process Management: Lead regular team planning sessions, retrospectives, and process refinements to foster continuous improvement, transparency, and the ability to adapt to organizational needs.
On-Call Schedule Management: Build and maintain the on-call schedule, ensuring adequate coverage, effective incident management, and balanced workloads for the team.
Reporting: Provide transparency and regular reports to leadership, ensuring visibility into SRE performance, incident trends, and team progress against strategic goals.
Stakeholder and Partner Collaboration: Serve as the primary point of contact with internal stakeholders, partners, and hardware vendors. Manage relationships and expectations, ensuring service delivery aligns with organizational needs.
Documentation: Ensure comprehensive documentation of systems, processes, and procedures to foster knowledge sharing and maintain operational consistency.

Your Profile

Proven experience in a senior SRE or similar leadership role, with a strong track record of managing and scaling technical teams.
Extensive knowledge of SRE principles, cloud infrastructure (particularly Azure), and automation tools (CI/CD, GitOps, etc.).
Strong leadership and communication skills, with the ability to inspire, mentor, and grow a high-performing technical team.
Hands-on technical expertise in areas such as cloud infrastructure, container orchestration (Kubernetes, Docker), CI/CD pipelines, and Infrastructure as Code (Terraform, Ansible).
Experience in designing and executing BCDR plans and managing incident response and post-mortem processes.
Strong understanding of SRE metrics (SLI, SLOs, error budgets) and their role in maintaining system reliability.
Ability to collaborate with stakeholders at all levels and communicate effectively with both technical and non-technical audiences.
Proven experience in managing cloud migrations and aligning technical operations with business objectives.

We offer you

Become part of the Onventis cloud platform for buyers and suppliers. Our solutions are used by 1,2 million registered users in 21 languages worldwide. A dynamic environment with flat hierarchies, fast decision-making processes and innovative development projects with creative freedom are included!

In addition, we offer you:

Time to think and collaborate with your team to create the best features.
Work alongside platform teams who care and drive success.
Focus on your job with a high-class workstation, dual monitors, and licensed software.
Experience our CORE culture in action: Customer-oriented, Open-minded, Responsible, Excellent.
Biannual feedback sessions on your expectations and performance.
Annual salary reviews to ensure your growth is rewarded.
Regular informal 1-1 meetings with your manager to stay connected.
Professional growth through collaboration with technical architects and senior colleagues.
Access to workshops, conferences, and an e-learning platform for continuous learning

Auf diese Stelle bewerben

About us

"We work every day to make global procurement simple, secure and connected with cloud services". At Onventis, we are driven by a common goal: to provide the best digital network for buyers and suppliers. We are a software pioneer that makes online procurement possible for companies. In addition to digital technology, this requires great ideas for the procurement management of tomorrow. Who makes them a reality? The Onventis team from Research & Development, IT, Product Management, Consulting & Services, Support, Sales, Marketing and Finance & Administration makes purchasing fit for the future for our internationally active customers, such as Conrad, Federal Mogule, Kühne & Nagel, Schott and Steigenberger - every day. Expressed in figures: From 6 locations - Stuttgart, Düsseldorf, The Hague, Stockholm, Paris and Vienna - around 190 colleagues ensure every day that around 1,2 million users process an annual business volume of 20 billion euros and more via the Onventis platform. The flat hierarchies, the broad scope for creativity and our interlinked teams create the space for innovation and personal development. Show us that you are a digital talent and make the Onventis success story yours.

Auf diese Stelle bewerben

Deine Bewerbung

Wir freuen uns über Dein Interesse, bei Onventis einzusteigen!
Zeig uns, wer du bist und fülle das folgende Formular aus.
Solltest du Schwierigkeiten mit dem Upload deiner Daten haben, wende dich gerne per Email an recruiting@onventis.de