Your role as a SRE professional, with a specialism in Infrastructure Engineering, in the Secure Development SRE Team is responsible for supporting BT to be in the best position to deliver the service performance, reliability and availability that internal and external customers expect.
We are a SRE team responsible for implementing, running and supporting a diverse range of tools and applications which support the management of our core IT infrastructure and used by BT to secure and protect our Networks.
This includes tools which manage network and IT security, Physical Security and compliance, automated software delivery and software discovery; as well as several other IT estate management functions.
• Supports the implementation of new software development life cycle automation tools, frameworks, and code pipelines (continuous integration/continuous delivery pipelines), helps to elevate the organisation using best practices with a focus on the re-use of application code, demonstrates consistent software delivery practices and produces continuous integration/continuous delivery platform solutions using Amazon Web Services Cloud, infrastructure as code (IaC), GitOps, and container technologies.
• Supports teammates and engineering teams to identify and implement requirements for building a high-end developer experience enabling quick, autonomous, and secure delivery of production changes.
• Supports the maintenance of monitoring tooling used to optimise systems for uptime, performance, and reliability.
• Executes tests to investigate how the infrastructure handles failure and scaling.
• Supports the execution of approaches that scale systems sustainably through automation mechanisms and evolves systems by pushing for changes that improve reliability and velocity.
• Supports the delivery of infrastructure as code software to improve the availability, scalability, latency, and efficiency of services.
• Executes quality control/quality assurance on new clusters and software deployments.
• Supports the operation and management of distributed storage architecture.
• Monitors queue and support processing to support in the identification of early warning of support issues.
• Supports in the implementation of ways to improve working processes within the area of site reliability engineering responsibility, such as contributing to the design of continuous integration/continuous delivery systems.
Mandatory:
• Broad technical experience across a range of Programming Languages (e.g, Python, GoLang)
• Broad technical experience across a range of IT infrastructure disciplines (eg, networks, datacentre infrastructure, operating systems etc)
• Experience of Continuous Improvement
• Strong experience of communicating complex detail to technical and non-technical audiences
• Working with wider programme and delivery organisations
Preferred: