At a Glance
- Tasks: Support and enhance our observability tools while troubleshooting production incidents.
- Company: Join DRW, a leading trading firm with over 30 years of innovative market experience.
- Benefits: Enjoy a dynamic work environment with opportunities for growth and learning every day.
- Why this job: Be part of a fast-paced team that values curiosity, integrity, and innovation in technology.
- Qualifications: 5+ years in the industry with coding skills and familiarity with logging tools required.
- Other info: Experience with Kubernetes and observability tools like Splunk will make you stand out.
The predicted salary is between 48000 - 72000 £ per year.
DRW is a diversified trading firm with over 3 decades of experience bringing sophisticated technology and exceptional people together to operate in markets around the world. We value autonomy and the ability to quickly pivot to capture opportunities, so we operate using our own capital and trading at our own risk.
Headquartered in Chicago with offices throughout the U.S., Canada, Europe, and Asia, we trade a variety of asset classes including Fixed Income, ETFs, Equities, FX, Commodities and Energy across all major global markets. We have also leveraged our expertise and technology to expand into three non-traditional strategies: real estate, venture capital and cryptoassets.
We operate with respect, curiosity and open minds. The people who thrive here share our belief that it’s not just what we do that matters–it\’s how we do it. DRW is a place of high expectations, integrity, innovation and a willingness to challenge consensus.
Our Observability team provides mission critical support for many of our centralized logging, metrics and tracing tools used throughout the firm. They manage the deployment and administration of these applications ensuring multi-tenant and highly available operation. In addition, they help interface with other teams to effectively use these tools to get the most out of the data produced. It\’s a fast-paced, dynamic environment that provides new technical challenges constantly and demands that you learn new things daily.
What you will do in this role:
- Provide best in class support for our suite of applications
- Troubleshoot production system incidents and create artifacts for postmortems to ensure that similar failures in the future are avoided
- Develop automation to facilitate administrative tasks supporting the onboarding and maintenance of various users and groups
- Test and automate upgrades of our applications to remain on our vendor\’s latest releases
- Constantly improve our own logging, monitoring and alerting practices
- Interact with vendor support to debug and drive third-party issues to resolution
- Interface with other teams to be an ambassador of good observability practices
- Help teams identify data to ingest and how to make use of this data through dashboards and alerting
Required Experience:
- 5+ years of industry experience using various logging and monitoring tools
- Coding experience to automate repetitive tasks
- Familiarity with CI/CD systems and workflows
- Familiarity with git or other version control systems
- Persistent drive to improve workflows and make things better
- Ability to troubleshoot complex problems
- Solid written and verbal communication skills
- Ability to work well on a team as well as independently
What will make you stand out:
- Experience using Splunk, Grafana, Prometheus and other observability tools
- Experience using Kubernetes to deploy and maintain systems
- Experience using Jsonnet or other templating tools to render complex YAML/JSON
- Familiarity with GitOps workflows
- Solid configuration management concepts and skills
#J-18808-Ljbffr
Observability Site Reliability Engineer employer: DRW
Contact Detail:
DRW Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Observability Site Reliability Engineer
✨Tip Number 1
Familiarize yourself with the specific observability tools mentioned in the job description, like Splunk, Grafana, and Prometheus. Having hands-on experience or projects showcasing your skills with these tools can set you apart from other candidates.
✨Tip Number 2
Highlight any experience you have with CI/CD systems and workflows. Being able to demonstrate your understanding of these processes will show that you can effectively integrate observability practices into the development lifecycle.
✨Tip Number 3
Showcase your coding skills by sharing examples of automation you've developed to streamline administrative tasks. This could be through personal projects or contributions to open-source, which will illustrate your ability to improve workflows.
✨Tip Number 4
Prepare to discuss your troubleshooting experiences in detail. Be ready to share specific incidents where you successfully resolved complex problems, as this will demonstrate your critical thinking and problem-solving abilities.
We think you need these skills to ace Observability Site Reliability Engineer
Some tips for your application 🫡
Understand the Role: Take the time to thoroughly read the job description for the Observability Site Reliability Engineer position at DRW. Understand the key responsibilities and required experience, as this will help you tailor your application to highlight relevant skills.
Highlight Relevant Experience: In your CV and cover letter, emphasize your 5+ years of industry experience with logging and monitoring tools. Be specific about your coding experience and any familiarity with CI/CD systems, as these are crucial for the role.
Showcase Problem-Solving Skills: Provide examples in your application that demonstrate your ability to troubleshoot complex problems. Mention any past experiences where you successfully resolved incidents or improved workflows, as this aligns with DRW's expectations.
Communicate Clearly: Ensure that your written communication is clear and concise. Highlight your solid written and verbal communication skills, as effective communication is essential for collaborating with other teams and interfacing with vendor support.
How to prepare for a job interview at DRW
✨Showcase Your Technical Skills
Be prepared to discuss your experience with logging and monitoring tools like Splunk, Grafana, and Prometheus. Highlight specific projects where you utilized these tools to solve complex problems or improve workflows.
✨Demonstrate Problem-Solving Abilities
Expect to be asked about past incidents you've troubleshot. Prepare examples that showcase your analytical thinking and how you approached resolving production system issues, including any postmortem artifacts you created.
✨Emphasize Team Collaboration
DRW values teamwork, so be ready to discuss how you've worked with cross-functional teams in the past. Share experiences where you acted as an ambassador for good observability practices and how you facilitated communication between teams.
✨Highlight Continuous Improvement Mindset
Talk about your persistent drive to enhance workflows and processes. Provide examples of automation you've developed to streamline administrative tasks or how you've improved logging, monitoring, and alerting practices in previous roles.