At a Glance
- Tasks: Ensure reliability and performance of our cloud trading platform while automating processes.
- Company: Join Valstro, a dynamic fintech company with a focus on innovation.
- Benefits: Enjoy unlimited PTO, competitive pay, and a solid pension plan.
- Other info: Remote role with opportunities for professional growth and collaboration.
- Why this job: Make a real impact in a fast-paced environment with cutting-edge technology.
- Qualifications: 3+ years in site reliability engineering and proficiency in cloud infrastructure.
The predicted salary is between 60000 - 80000 £ per year.
Valstro is looking for a Site Reliability Engineer (SRE) to join our team! This person will help ensure the reliability, availability, and performance of our cloud native trading platform. The role entails building and maintaining infrastructure, automating processes, and working closely with the Development and Platform teams to ensure seamless integration and deployment of the service.
The successful candidate will serve as an essential link between the wider organization, executive leadership, and external vendors. Their responsibilities will include:
- Ensuring system reliability.
- Building and maintaining monitoring solutions for both production and UAT systems.
- Automating operational tasks.
- Responding to incidents.
- Continuously improving systems and processes.
This is a remote position that will report to the Site Reliability Lead.
What will you be doing?
- Act as a key intermediary between engineering, executive leadership, and external vendors.
- Ensure the reliability, availability, and performance of our cloud-based trading solutions.
- Develop and maintain monitoring solutions to track system performance and reliability.
- Automate operational tasks to improve efficiency and reduce manual intervention.
- Collaborate with development teams to ensure seamless integration and deployment.
- Respond to incidents and troubleshoot issues to minimize downtime.
- Continuously improve systems and processes to enhance reliability and performance.
- Participate in on-call rotations to provide 24/7 support for critical systems.
Requirements
- 3+ years experience supporting Production level systems.
- Strong experience in site reliability engineering, systems engineering, or a related field.
- Proficiency in cloud-based infrastructure (e.g. AWS, Azure, or Google Cloud).
- Experience with monitoring and logging tools (e.g., ELK, LGTM, Prometheus, Datadog).
- Expertise in automation and scripting (e.g., Golang, Python, Bash, Terraform).
- Knowledge of containerization and orchestration (e.g., Docker, Kubernetes).
- Ability to effectively communicate and liaise between stakeholders, including internal teams, executive management, and external vendors.
- Strong troubleshooting and problem-solving skills.
- Experience in establishing and enhancing reliability engineering practices and processes.
- Capable of operating effectively in a dynamic organizational environment with high delivery and quality expectations.
Fintech = bonus
Technical
- A recent bachelor's degree in Computer Science, Software Engineering or related field.
- Knowledge of SREing.
- Knowledge of observability and tooling particularly the Grafana stack.
Benefits
Valstro offers an excellent benefits package, including pension or 401 (k) plans, unlimited PTO and highly competitive compensation. Our leadership team brings a wealth of experience and deep industry knowledge, and despite being a young company, we believe we have carefully dialed in our product-market fit.
Site Reliability Engineer (SRE) in London employer: Valstro
Contact Detail:
Valstro Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Site Reliability Engineer (SRE) in London
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, attend meetups, and connect with people on LinkedIn. You never know who might have the inside scoop on job openings or can put in a good word for you.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repository showcasing your projects, especially those related to site reliability engineering. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by brushing up on common SRE scenarios and problem-solving questions. Practice explaining your thought process clearly, as communication is key when liaising between teams and stakeholders.
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining our team at Valstro.
We think you need these skills to ace Site Reliability Engineer (SRE) in London
Some tips for your application 🫡
Tailor Your CV: Make sure your CV is tailored to the Site Reliability Engineer role. Highlight your experience with cloud-based infrastructure and automation tools, as these are key for us at Valstro. We want to see how your skills align with our needs!
Craft a Compelling Cover Letter: Your cover letter is your chance to shine! Use it to explain why you're passionate about site reliability engineering and how you can contribute to our team. We love seeing genuine enthusiasm and a clear understanding of our mission.
Showcase Relevant Projects: If you've worked on any projects that demonstrate your expertise in monitoring solutions or automation, make sure to include them. We appreciate candidates who can show real-world applications of their skills, especially in a fintech context.
Apply Through Our Website: We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our company culture!
How to prepare for a job interview at Valstro
✨Know Your Tech Stack
Make sure you’re well-versed in the technologies mentioned in the job description, like AWS, Azure, or Google Cloud. Brush up on your knowledge of monitoring tools like ELK and Prometheus, as well as automation scripting in Golang or Python. Being able to discuss these confidently will show that you're ready to hit the ground running.
✨Showcase Your Problem-Solving Skills
Prepare to discuss specific incidents where you've had to troubleshoot issues in production systems. Use the STAR method (Situation, Task, Action, Result) to structure your answers. This will help demonstrate your strong problem-solving skills and how you can minimise downtime effectively.
✨Communicate Like a Pro
Since this role involves liaising between various stakeholders, practice articulating complex technical concepts in simple terms. Think about examples where you’ve successfully communicated with both technical teams and non-technical stakeholders. This will highlight your ability to bridge gaps and ensure seamless collaboration.
✨Emphasise Continuous Improvement
Be ready to talk about how you've contributed to improving systems and processes in your previous roles. Share specific examples of how you’ve automated tasks or enhanced reliability engineering practices. This shows that you’re not just about maintaining the status quo but are keen on driving progress.