Sr. Engineer II - EPICS, NG-SIEM (Hybrid) in London

Sr. Engineer II - EPICS, NG-SIEM (Hybrid) in London

London Full-Time 80000 - 100000 £ / year (est.) Home office (partial)
CrowdStrike

At a Glance

  • Tasks: Design and maintain systems for the world's largest SIEM platform, ensuring reliability and scalability.
  • Company: Join CrowdStrike, a global leader in cybersecurity with a mission to stop breaches.
  • Benefits: Competitive pay, wellness programs, flexible work, and professional growth opportunities.
  • Other info: Hybrid role with a vibrant office culture and global collaboration.
  • Why this job: Be part of a mission-driven team that makes a real impact in cybersecurity.
  • Qualifications: 10+ years in software or platform engineering with strong skills in large-scale systems.

The predicted salary is between 80000 - 100000 £ per year.

As a global leader in cybersecurity, CrowdStrike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. We work on large scale distributed systems, processing almost 3 trillion events per day and this traffic is growing daily. Our customers span all industries, and they count on CrowdStrike to keep their businesses running, their communities safe and their lives moving forward. We’re also a mission-driven company. We cultivate a culture that gives every CrowdStriker both the flexibility and autonomy to own their careers. We’re always looking to add talented CrowdStrikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters? The future of cybersecurity starts with you.

About the Role: Our mission is to make all of our customers' security-relevant data continuously available for automated detection and response, threat hunting, and other Falcon platform use cases. To enable this, the systems behind NG-SIEM (next-generation security information and event management) are growing to accommodate >100 PB of event and action data ingested every day, up to 10 years of retention, and dozens of millions of queries per hour across large sections of the data stored, for tens of thousands of customers.

As a Senior Engineer II on the newly established NG-SIEM EPICS (End-to-End Performance, Incident-response, Cost, and Scaling) team, you will own the reliability and scalability of the security industry's largest SIEM platform — treating these as software engineering problems rather than purely operational ones. The NG-SIEM platform comprises many decoupled components interacting across complex pipelines. As we scale, ensuring end-to-end health across ingest, search, and workflow execution requires deep cross-service expertise and coordinated action. You will be the engineer who builds the observability, automation, and scaling systems that keep the entire platform performing — not just individual components. You will join a distributed team of high-ownership technical leaders who share a strong passion for our mission: to stop breaches. This is a hybrid role based in one of our offices in London (United Kingdom), Aarhus (Denmark) or Dublin (Ireland) 2-3x a week.

What You'll Do:

  • End-to-end observability: Design, build, and maintain monitoring and synthetic test suites that provide deep visibility into the health of the entire NG-SIEM pipeline — from ingest through search and workflow execution — enabling rapid root cause analysis across component boundaries.
  • Coordinated scaling: Engineer orchestrated scaling solutions that treat the NG-SIEM pipeline as a unified system, proportionally increasing resources across all dependent components (Kafka, ingest pipelines, downstream services) to eliminate cascading bottleneck patterns.
  • Incident response engineering: Serve as a subject matter expert during platform-wide incidents (P2 and above), applying cross-service knowledge to diagnose and resolve multi-component failures. Partake in follow-the-sun on-call rotations, providing incident commander coordination for critical platform-wide events.
  • Capacity planning and cost management: Build and refine models for end-to-end capacity forecasting that account for all pipeline dimensions, including partner team dependencies (data services, GPS). Develop tooling to continuously track and surface cost drivers across the platform.
  • Automation and runbooks: Transform manual standard operating procedures into automated remediation workflows — including pipeline-wide scaling responses, CID rebalancing, and infrastructure healing — with the goal of resolving issues before customers are impacted.
  • Cross-team collaboration: Partner with cell-level teams, product engineering, GDI/3PI, and external stakeholders (e.g., CSM) to triage SLO breaches, drive problem management for large reliability efforts, and ensure consistent communication during incidents.
  • Platform improvements: Use your broad NG-SIEM knowledge to identify and drive systemic improvements across teams, contributing to the platform's long-term resilience and efficiency.

What You'll Need:

  • A passion for reliability engineering and curiosity about how large-scale running systems behave under pressure;
  • 10+ years of experience in software engineering, site reliability engineering, or platform engineering, with significant time spent on large-scale distributed systems, and the ability to make pragmatic tradeoffs between short-term delivery needs and long-term platform goals;
  • Strong proficiency in at least one systems programming language (Go, Java, Rust, or C++) and one scripting language (Python, Bash);
  • Deep experience with end-to-end observability — building monitoring pipelines, defining SLIs/SLOs, and creating dashboards that drive actionable insights across multi-service architectures;
  • Demonstrated ability to diagnose and resolve complex incidents spanning multiple distributed components operating 24/7;
  • Experience with coordinated capacity planning and scaling for systems with significant infrastructure footprints;
  • Hands-on experience with streaming platforms (Kafka or similar) and understanding of backpressure, partition management, and consumer group dynamics at scale;
  • Familiarity with infrastructure-as-code, CI/CD pipelines, and automated deployment practices;
  • A can-do attitude — you thrive collaborating in a team and are not afraid of taking on responsibilities;
  • Strong written and verbal communication skills — you will lead incident communications and produce post-incident analyses that drive lasting improvements;
  • Comfort working across time zones with globally distributed teams.

Bonus Points:

  • Experience in a similar reliability or platform engineering role at a hyperscaler (AWS, Azure, GCP) or large-scale SaaS provider;
  • Track record of building automated remediation and self-healing infrastructure;
  • Experience with cost modeling and unit economics for large compute and storage footprints;
  • Familiarity with cloud-native architectures and serverless computing paradigms;
  • Hands-on experience operating platforms processing over 1 trillion events per day or more than 10 PB of data per day;
  • Exposure to or experience with Log Management, cybersecurity products, or security operations workflows;
  • Experience with disaster recovery planning and execution for multi-region systems.

Benefits of Working at CrowdStrike:

  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

CrowdStrike is proud to be an equal opportunity employer. We are committed to fostering a culture of belonging where everyone is valued for who they are and empowered to succeed. We support veterans and individuals with disabilities through our affirmative action program. CrowdStrike is committed to providing equal employment opportunity for all employees and applicants for employment. The Company does not discriminate in employment opportunities or practices on the basis of race, color, creed, ethnicity, religion, sex (including pregnancy or pregnancy-related medical conditions), sexual orientation, gender identity, marital or family status, veteran status, age, national origin, ancestry, physical disability (including HIV and AIDS), mental disability, medical condition, genetic information, membership or activity in a local human rights commission, status with regard to public assistance, or any other characteristic protected by law. We base all employment decisions--including recruitment, selection, training, compensation, benefits, discipline, promotions, transfers, lay-offs, return from lay-off, terminations and social/recreational programs--on valid job requirements.

Sr. Engineer II - EPICS, NG-SIEM (Hybrid) in London employer: CrowdStrike

CrowdStrike is an exceptional employer that prioritises employee growth and well-being, offering competitive compensation, comprehensive wellness programmes, and professional development opportunities for all levels. With a vibrant office culture in London, Aarhus, or Dublin, employees enjoy the flexibility of a hybrid work model while being part of a mission-driven team dedicated to stopping breaches and innovating in cybersecurity. Join us to be part of a supportive community that values diversity and empowers you to take ownership of your career.

CrowdStrike

Contact Details:

CrowdStrike Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Sr. Engineer II - EPICS, NG-SIEM (Hybrid) in London

Tip Number 1

Network like a pro! Reach out to current CrowdStrikers on LinkedIn or at industry events. A friendly chat can give you insider info and maybe even a referral, which can really boost your chances.

Tip Number 2

Prepare for the interview by diving deep into CrowdStrike's mission and values. Show us how your passion for cybersecurity aligns with our goal to stop breaches. We love candidates who are genuinely excited about what we do!

Tip Number 3

Practice makes perfect! Run through common technical questions and scenarios related to large-scale distributed systems. We want to see how you think on your feet, so be ready to showcase your problem-solving skills.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows us you’re serious about joining the CrowdStrike team!

We think you need these skills to ace Sr. Engineer II - EPICS, NG-SIEM (Hybrid) in London

Reliability Engineering
Software Engineering
Site Reliability Engineering
Platform Engineering
Large-Scale Distributed Systems
Systems Programming (Go, Java, Rust, C++)
Scripting (Python, Bash)

Some tips for your application 🫡

Tailor Your Application:Make sure to customise your CV and cover letter for the role. Highlight your experience with large-scale distributed systems and any relevant projects that showcase your skills in reliability engineering. We want to see how you fit into our mission!

Showcase Your Passion:Let your enthusiasm for cybersecurity shine through! Share examples of how you've tackled challenges in previous roles, especially those related to incident response or automation. We love seeing candidates who are genuinely excited about what they do.

Be Clear and Concise:When writing your application, keep it straightforward. Use clear language and avoid jargon unless it's necessary. We appreciate a well-structured application that gets straight to the point, making it easy for us to see your qualifications.

Apply Through Our Website:Don’t forget to submit your application through our website! It’s the best way for us to receive your details and ensures you’re considered for the role. Plus, it shows you’re serious about joining our team at CrowdStrike!

How to prepare for a job interview at CrowdStrike

Know Your Stuff

Make sure you brush up on your knowledge of large-scale distributed systems and the specific technologies mentioned in the job description, like Kafka and observability tools. Be ready to discuss your past experiences and how they relate to the role.

Show Your Passion for Reliability

CrowdStrike is all about stopping breaches, so demonstrate your passion for reliability engineering. Share examples of how you've tackled complex incidents or improved system performance in your previous roles. This will show that you're aligned with their mission.

Prepare for Technical Questions

Expect technical questions that dive deep into your experience with programming languages and incident response. Practice explaining your thought process when diagnosing issues and how you approach capacity planning and scaling solutions.

Emphasise Collaboration Skills

Since this role involves cross-team collaboration, be prepared to discuss how you've worked with different teams in the past. Highlight your communication skills and any experiences where you led incident communications or drove problem management efforts.