Senior Site Reliability Engineer at Thought Machine

Senior Site Reliability Engineer

at Thought Machine

Apply Now

Already Applied? Save

About the job

ONSITE London, England

Open to new applications

Full-Time ~ Permanent

5 job requirements

Preview the competition


1 years Ansible experience, used daily	Must Have	I have this
1 years Chef experience, used daily	Must Have	I have this
1 years Kubernetes experience, used daily	Must Have	I have this
1 years Puppet experience, used daily	Must Have	I have this
1 years Terraform experience, used daily	Must Have	I have this

General information

Job Title

Senior Site Reliability Engineer

City

London

Country

United Kingdom of Great Britain and Northern Ireland

Division

Engineering

Department

Infrastructure

Description

Thought Machine’s mission is bold - to properly and permanently rid the world’s banks of legacy technology. To achieve this, we have developed the foundations of modern banking through core and payments technology which run natively in the cloud. What we are attempting is hard and means we need great people working together to build great technology.

We have grown rapidly in the past few years - growing our team to more than 550 individuals across offices in London, New York, Singapore and Sydney. We have raised more than $500m in funding and are now valued at $2.7bn. Our investors include Molten Ventures, Eurazeo, Intesa Sanpaolo, Temasek, Nyca Partners, JPMorgan Chase Strategic Investments, Standard Chartered Ventures, and more.

We have created a culture enabling our team to produce the best work in the industry, ensuring we have fun along the way. We’re regularly cited as having a fantastic workplace culture and have been recognised by Sifted magazine as having one of the highest Glassdoor ratings for a UK fintech company and the most generous employee share package in the industry. Global Finance Magazine named us one of the world’s most innovative fintechs, and the Financial Times recognised us as one of Europe’s fastest-growing companies in 2023.Thought Machine’s Site Reliability Engineers are the guardians of mission-critical systems for the world’s most influential financial institutions. As a member of our elite, globally distributed team, you’ll be entrusted with running and maintaining the robust production infrastructure that powers our customers’ cutting-edge Core Banking and Payments platforms. This is an opportunity to make a tangible impact on the global financial landscape while collaborating with brilliant minds to solve complex engineering challenges.

Thought Machine’s Site Reliability Engineers are the guardians of mission-critical systems for the world’s most influential financial institutions. As a member of our elite, globally distributed team, you’ll be entrusted with running and maintaining the robust production infrastructure that powers our customers’ cutting-edge Core Banking and Payments platforms. This is an opportunity to make a tangible impact on the global financial landscape while collaborating with brilliant minds to solve complex engineering challenges. This role will be part of the Site Reliability Engineering team at Thought Machine HQ in London, tackling the challenges of automating complex fleet management operations, mentoring team members, promoting communities of best practice within engineering as well as designing operational processes that provide effective interfaces between Thought Machine and our Saas customers. The SRE team is deeply involved in tackling the technical challenges of executing Thought Machine’s growth ambitions - expect to be working with senior stakeholders in the organisation and with our customers, and working on programmes and initiatives that are critical to the success of the company.

Duties

Supporting the product engineering teams in building highly fault-tolerant, scalable applications by participating in design discussions, engaging in RFCs and code reviews
Executing various department strategies - contributing to the design and scoping work for team members around disaster recovery, backup, redundancy and capacity planning activities
Being part of a support rota responsible for resolving alerts generated by proactive monitoring and working closely with client-facing roles to provide L2 support for client-initiated support requests
Regular maintenance of production systems that host Vault products
Driving the evolution of our SaaS products by defining and designing features that foster exceptional reliability and an unparalleled user experience
Implementing and regularly testing DR strategies to ensure the highest level of resilience and fault tolerance of the platform
Maintain and promote high-quality written documentation of assets, processes and runbooks that are used by the team in their day-to-day operations
Working with your Manager in growing team members in their technical skills as well as their understanding of Vault Products

Requirements

Essential

You possess an up-to-date understanding of design patterns relevant to hosting and networking architectures
You proactively champion product development, driven by a desire to build truly exceptional products, not just solve immediate challenges
You’re a high-agency individual who can independently drive projects to completion by effectively scaling your individual output with the appropriate delegation of work to team members
You have a strong background working in either Python, Golang or Java, having used one of these programming languages to execute a significantly sized project or initiative
You have experience working with Kubernetes or other container orchestration systems
You have expertise in one or more of the following areas: Database Administration, Networking, Observability Tools (such as Prometheus, Jaeger) or automation infrastructure
You have extensive experience working with either GCP or AWS

Desirable

Experience with automation/configuration management, e.g. Terraform, Puppet, Chef, Ansible

Benefits

Highly competitive salary
Pension plan (match up to 5%)
Life insurance - three times annual salary
Competitive maternity (six months fully paid) and paternity leave (four weeks fully paid)
Shared parental leave (matched to our maternity leave for the same point in time)
25 days holiday and bank holidays
Flexible working hours
Cycle-to-work scheme
Electric car scheme
Season ticket loan
Access to outstanding learning materials and courses
Sports and hobby clubs, subsidised by Thought Machine
All the latest tech you need
Start the day properly with fresh fruit and cereals
Huge range of healthy (and not-so-healthy) snacks, smoothies and drinks
A talented and experienced team as your colleagues
An environment where we encourage learning and progress
Two charity days a year
Weekly food pop-up

Thought Machine is committed to making a measurable positive impact on people’s everyday lives. We are an equal-opportunity employer and value diversity at our company.

We actively hire candidates who demonstrate technical excellence in their field and welcome people of all ages and backgrounds, providing everyone with equal access to professional development. You are encouraged to apply even if your experience doesn’t accurately match the job description. We also encourage applications from those with different abilities, including candidates with ADHD, autism, dyslexia or dyspraxia.

Thought Machine

Classification:

blurTagText

Enable 1-click access to other sources:

and more

Do your research faster with Quick Links

Details and stages

Reporting to: details unknown

the hiring process information will appear here if available.

Job ref blurredText

Posted on blurredText

Last checked on blurredText

Closing on blurredText

2 discussion comments

0 requirements

4 Saved as Applied

Qualify To Apply check results
Total attempts: 22 Unique: 10 Passed: 6

Understand who you are up against with Competitive Insights

Upgrade now

Discuss this job anonymously

Share your intel on a job vacancy and help other jobseekers.

Team inkscroll - 0 days ago

pretend that this is a blurredText long comment