At a Glance
- Tasks: Support and enhance our observability tools while troubleshooting incidents and improving workflows.
- Company: Join DRW, a leading trading firm with over 30 years of innovative market experience.
- Benefits: Enjoy a dynamic work environment with opportunities for growth and learning every day.
- Why this job: Be part of a high-expectation culture that values integrity, innovation, and teamwork.
- Qualifications: 5+ years in the industry with coding skills and familiarity with logging tools required.
- Other info: Experience with Kubernetes and observability tools like Splunk will make you stand out.
The predicted salary is between 48000 - 72000 £ per year.
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we operate using our own capital and trading at our own risk. Headquartered in Chicago with offices throughout the U.S., Canada, Europe, and Asia, we trade a variety of asset classes including Fixed Income, ETFs, Equities, FX, Commodities and Energy across all major global markets. We have also leveraged our expertise and technology to expand into three non-traditional strategies: real estate, venture capital and cryptoassets. We operate with respect, curiosity and open minds. The people who thrive here share our belief that it’s not just what we do that matters–it’s how we do it. DRW is a place of high expectations, integrity, innovation and a willingness to challenge consensus.
Our Observability team provides mission critical support for many of our centralized logging, metrics and tracing tools used throughout the firm. They manage the deployment and administration of these applications ensuring multi-tenant and highly available operation. In addition, they help interface with other teams to effectively use these tools to get the most out of the data produced. It's a fast-paced, dynamic environment that provides new technical challenges constantly and demands that you learn new things daily.
What you will do in this role:
- Provide best in class support for our suite of applications
- Troubleshoot production system incidents and create artifacts for postmortems to ensure that similar failures in the future are avoided
- Develop automation to facilitate administrative tasks supporting the onboarding and maintenance of various users and groups
- Test and automate upgrades of our applications to remain on our vendor's latest releases
- Constantly be improving our own logging, monitoring and alerting practices
- Interact with vendor support to debug and drive third-party issues to resolution
- Interface with other teams to be an ambassador of good observability practices
- Help teams identify data to ingest and how to make use of this data through dashboards and alerting
Required Experience:
- 5+ years of industry experience using various logging and monitoring tools
- Coding experience to automate repetitive tasks
- Familiarity with CI/CD systems and workflows
- Familiarity with git or other version control systems
- Persistent drive to improve workflows and make things better
- Ability to troubleshoot complex problems
- Solid written and verbal communication skills
- Ability to work well on a team as well as independently
What will make you stand out:
- Experience using Splunk, Grafana, Prometheus and other observability tools
- Experience using Kubernetes to deploy and maintain systems
- Experience using Jsonnet or other templating tools to render complex yaml/json
- Familiarity with gitops workflows
- Solid configuration management concepts and skills
Observability Site Reliability Engineer employer: DRW Holdings, LLC.
Contact Detail:
DRW Holdings, LLC. Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Observability Site Reliability Engineer
✨Tip Number 1
Familiarise yourself with the specific observability tools mentioned in the job description, such as Splunk, Grafana, and Prometheus. Having hands-on experience or even personal projects showcasing your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Engage with the community around observability and site reliability engineering. Join forums, attend meetups, or participate in online discussions to learn from others and share your insights. This not only enhances your knowledge but also helps you network with professionals in the field.
✨Tip Number 3
Demonstrate your problem-solving skills by preparing for potential technical interviews. Practice troubleshooting complex problems related to logging and monitoring systems, as this role requires a solid ability to diagnose and resolve issues effectively.
✨Tip Number 4
Showcase your coding abilities by automating tasks relevant to observability. Create small scripts or tools that demonstrate your proficiency in coding and automation, as this is a key requirement for the role and will highlight your proactive approach to improving workflows.
We think you need these skills to ace Observability Site Reliability Engineer
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience, especially in observability tools like Splunk and Grafana. Emphasise your coding skills and any experience with CI/CD systems, as these are crucial for the role.
Craft a Compelling Cover Letter: In your cover letter, express your passion for technology and problem-solving. Mention specific examples of how you've improved workflows or resolved complex issues in previous roles to demonstrate your fit for DRW's dynamic environment.
Showcase Communication Skills: Since solid written and verbal communication skills are essential, consider including a brief section in your application that illustrates your ability to communicate technical concepts clearly, perhaps through a project summary or a team collaboration example.
Highlight Continuous Learning: DRW values a willingness to learn new things daily. In your application, mention any recent courses, certifications, or self-directed learning you've undertaken related to observability or system administration to show your commitment to professional growth.
How to prepare for a job interview at DRW Holdings, LLC.
✨Showcase Your Technical Skills
Be prepared to discuss your experience with logging and monitoring tools like Splunk, Grafana, and Prometheus. Highlight specific projects where you used these tools to solve complex problems or improve workflows.
✨Demonstrate Problem-Solving Abilities
Expect to be asked about troubleshooting incidents. Prepare examples of past challenges you've faced, how you approached them, and the outcomes. This will show your analytical skills and persistence in resolving issues.
✨Communicate Clearly
Since solid communication skills are essential, practice explaining technical concepts in a clear and concise manner. Be ready to discuss how you’ve collaborated with other teams and shared observability best practices.
✨Emphasise Continuous Improvement
DRW values a drive to improve workflows. Share instances where you identified inefficiencies and implemented solutions, especially through automation or CI/CD practices. This will demonstrate your proactive mindset.