At a Glance
- Tasks: Design and maintain cutting-edge network observability systems for a reliable GPU cloud network.
- Company: Join CoreWeave, a leader in innovative tech with a collaborative culture.
- Benefits: Enjoy competitive salary, family-level medical insurance, and tuition reimbursement.
- Other info: Flexible hybrid work environment with excellent career growth opportunities.
- Why this job: Make a real impact by empowering our network with advanced observability solutions.
- Qualifications: Experience with Prometheus, Grafana, Python, and network engineering in large-scale environments.
The predicted salary is between 60000 - 80000 £ per year.
We’re seeking a talented and experienced Senior Engineer for Network Observability to join our Network Observability team. In this role, you will be a key player in designing, developing, and maintaining the monitoring, telemetry, and observability systems that keep CoreWeave’s GPU cloud network operating reliably and at scale. You’ll focus on building solutions that provide real‑time insights into network performance, ensuring that issues are detected proactively and resolved quickly.
Your mission? To empower CoreWeave’s network with advanced observability: robust metrics, powerful analytics, and automated alerting—so well‑tuned that any anomalies become clear before they ever impact our customers.
- Develop, optimize, and maintain network observability platforms.
- Use your skills in Python and Golang to create and automate collectors, exporters, and dashboards that provide deep visibility into network health and performance.
- Collaborate with Network Engineering and Platform teams to ingest and unify logs, metrics, and events from a variety of platforms (Arista EOS, NVIDIA Cumulus Linux, Nokia SR OS, SR Linux, etc.) into a single observability pipeline.
- Design and implement scalable telemetry solutions using protocols like gNMI, SNMP, and streaming analytics.
- Ensure advanced alerting and anomaly detection with tools such as Prometheus, Grafana, and Alertmanager.
- Work closely with network developers, site reliability engineers, and security teams to integrate observability solutions across the broader infrastructure.
- Participate in design discussions, RFCs, and architectural decisions.
- Join a rotating on‑call schedule to troubleshoot and resolve observability‑related issues.
- Provide timely support to operations teams, quickly isolating and fixing problems when they arise.
- Guide junior team members, share best practices, and foster a culture of continuous learning and improvement within the observability domain.
Who You Are
- Deep familiarity with Prometheus, Grafana, Alertmanager, gNMI, and SNMP.
- Experience writing or extending custom metric collectors/exporters is a plus.
- Experience as a Network Engineer, SRE, Software Developer, or Systems Administrator in large‑scale environments.
- A track record of building and operating robust telemetry and monitoring solutions is a plus.
- Passion for automating tasks and processes.
- Comfortable containerizing solutions in Kubernetes, designing, building, and deploying container‑based workloads efficiently.
- Proficient with Python, Go, and Bash, plus familiarity with configuration management and templating tools (e.g., Ansible, Jinja2).
- Strong knowledge of Linux systems and IP networking concepts, with hands‑on experience in routing, switching, and network troubleshooting.
- Practical knowledge with a variety of platforms, including Arista EOS, NVIDIA Cumulus Linux, Nokia SR OS, and SR Linux.
- Collaborative, humble, and always ready to help others while staying open to learning from more senior colleagues.
Preferred Qualifications
- College Education: Bachelor’s degree in Computer Science or a related field.
- Machine Learning for Anomaly Detection: Hands‑on experience applying ML techniques or tools (e.g., TensorFlow, scikit‑learn) to proactively detect performance or security anomalies in network traffic.
- Network Certifications: Certifications like CCNA, CCNP, or similar.
- Advanced Metrics & Analytics: Hands‑on experience with data pipelines, event correlation, or anomaly detection in large‑scale environments.
- Distributed Tracing: Familiarity with OpenTelemetry, Jaeger, or Zipkin for end‑to‑end tracing across microservices and network components.
What We Offer
In addition to a competitive salary, we offer a variety of benefits to support your needs, including:
- Family‑level Medical Insurance
- Family‑level Dental Insurance
- Generous Pension Contribution
- Life Assurance at 4x Salary
- Critical Illness Cover
- Employee Assistance Programme
- Tuition Reimbursement
- Work culture focused on innovative disruption
To fulfil our obligation to protect client data, successful applicants offered employment with CoreWeave will be required to complete a basic criminal record check, conducted in compliance with GDPR. Employment offers are conditional upon receiving satisfactory check results.
Our Workplace
While we prioritize a hybrid work environment, remote work may be considered for candidates located more than 30 miles from an office, based on role requirements for specialized skill sets. New hires will be invited to attend onboarding at one of our hubs within their first month. Teams also gather quarterly to support collaboration.
CoreWeave is an equal opportunity employer, committed to fostering an inclusive and supportive workplace. All qualified applicants and candidates will receive consideration for employment without regard to race, color, religion, sex, disability, age, sexual orientation, gender identity, national origin, veteran status, or genetic information.
Export Control Compliance
This position requires access to export controlled information. To conform to U.S. Government export regulations applicable to that information, applicant must either be (A) a U.S. person, defined as a (i) U.S. citizen or national, (ii) U.S. lawful permanent resident (green card holder), (iii) refugee under 8 U.S.C. § 1157, or (iv) asylee under 8 U.S.C. § 1158, (B) eligible to access the export controlled information without a required export authorization, or (C) eligible and reasonably likely to obtain the required export authorization from the applicable U.S. government agency. CoreWeave may, for legitimate business reasons, decline to pursue any export licensing process.
Senior Engineer, Network Observability in London employer: Dormont Manufacturing Co
CoreWeave is an exceptional employer that prioritises innovation and employee growth, offering a dynamic work culture where collaboration thrives. With competitive benefits such as family-level medical and dental insurance, generous pension contributions, and a commitment to continuous learning, employees are empowered to excel in their roles. Located in a hybrid-friendly environment, CoreWeave fosters inclusivity and support, making it an ideal place for talented individuals seeking meaningful and rewarding careers in network observability.
StudySmarter Expert Advice🤫
We think this is how you could land Senior Engineer, Network Observability in London
✨Join Local Tech Meetups
Get out there and mingle with fellow developers by joining local tech meetups. It’s a fantastic way to meet people who might be working at Dormont Manufacturing Co or know someone who does. Plus, you can pick up some trendy tech skills and trends while you're at it!
✨Contribute to Open Source Projects
Show off your coding chops by jumping into open-source projects. Not only does this give you practical experience, but it also gets you noticed in the dev community. You'll create a killer portfolio that speaks volumes about your skills to Dormont Manufacturing Co.
✨Tap into Online Developer Communities
Don’t underestimate the power of online developer communities like GitHub, Stack Overflow, and even Reddit. Participate in discussions, share your projects, and build your visibility. We can often find opportunities through these channels that can lead to a full-time gig at companies like Dormont Manufacturing Co.
✨Explore Job Boards Specifically for Tech Roles
Keep your eyes peeled on job boards that focus on tech roles. Sites like TechCareers or Stack Overflow Jobs can often have listings for companies like Dormont Manufacturing Co that might not show up on broader job sites. Make it a habit to check these regularly, and don’t hesitate to apply directly through our website!
We think you need these skills to ace Senior Engineer, Network Observability in London
Some tips for your application 🫡
Show off your coding skills:When applying for a software engineering role, it's super important to showcase your coding skills. Make sure your CV includes your tech stack, any relevant programming languages you’re comfortable with, and examples of projects you've worked on. If you have a GitHub profile, link it up! We love to see code in action.
Tailor your portfolio:For a full-time role, we’d expect to see some solid examples of your work in your portfolio. Make sure to include at least two or three projects that highlight your problem-solving skills and your ability to work with different technologies. Focus on the projects that are most relevant to the position at Dormont Manufacturing Co.
Craft a killer cover letter:Your cover letter is your chance to stand out—make it personal! Explain why you want to work at Dormont Manufacturing Co and how your skills align with the role. Show us your passion for software development. We dig enthusiastic candidates who understand the value of collaboration and continuous learning!
Be clear and concise:When it comes to writing your CV and cover letter, clarity is key. Avoid jargon that could confuse us and stick to simple, direct language. Highlight your achievements with quantifiable results where possible, and keep everything easy to read. A well-organised application goes a long way!
How to prepare for a job interview at Dormont Manufacturing Co
✨Brush Up on Your Coding Skills
For a full-time software engineering role, it's crucial that we stay sharp with our coding abilities. Expect technical questions that might involve solving problems on the spot or discussing algorithms. Practise on platforms like LeetCode or HackerRank to get comfortable with the types of questions that often come up.
✨Know Your Tools and Frameworks
Make sure we’re well-acquainted with the tools and technologies listed in the job description. Familiarise ourselves with any specific frameworks or programming languages mentioned. If Dormont Manufacturing Co uses React or Node.js, for instance, be ready to discuss how we’ve used them in previous projects or coursework.
✨Showcase Your Projects
Bring along a portfolio that highlights our best work. This could be code samples, GitHub repositories, or any side projects we’ve built. Make sure we can talk through our thought process for each project, especially the challenges we faced and how we solved them—this shows our problem-solving skills in action.
✨Prepare for Behavioural Questions
While technical skills are key, full-time positions also require cultural fit. Be ready to discuss our previous experiences and how we handle teamwork, conflict, and deadlines. Brush up on the STAR method—Situation, Task, Action, Result—to clearly articulate our past experiences when discussing how we've contributed to a team.