Remote Member of Engineering (Pre-training / Data Acquisition) in Warrington

Remote Member of Engineering (Pre-training / Data Acquisition) in Warrington

Warrington Full-Time 60000 - 80000 £ / year (est.) Working from home possible
P

At a Glance

  • Tasks: Design and build web crawlers to acquire high-quality data for AI model training.
  • Company: Join Poolside, a pioneering company in the quest for Artificial General Intelligence.
  • Benefits: Remote work, competitive salary, and a supportive team culture.
  • Other info: Collaborative environment with opportunities for personal and professional growth.
  • Why this job: Be at the forefront of AI development and make a real impact on future technologies.
  • Qualifications: Experience in data acquisition, web crawling, and strong problem-solving skills.

The predicted salary is between 60000 - 80000 £ per year.

ABOUT POOLSIDE

In this decade, the world will create Artificial General Intelligence. There will only be a small number of companies who will achieve this. Their ability to stack advantages and pull ahead will define the winners. These companies will move faster than anyone else. They will attract the world's most capable talent. They will be on the forefront of applied research, engineering, infrastructure and deployment at scale. They will continue to scale their training to larger to balance this intensity, we’ve assembled a team of low ego and kind-hearted individuals who have built the special culture Poolside has. By building collaboratively and with intention, we create a compounding effect that moves the entire company forward towards our mission: reaching AGI through intelligence systems built for software development.

ABOUT THE ROLE

You'll be working alongside our pre-training data team, focused on one of the most foundational challenges in training frontier LLMs: acquiring the best possible pre-training data. The data we collect is upstream of everything. It directly shapes the capability of the models we train. As our first dedicated data acquisition engineer, you will spearhead and evolve systems that crawl the web at massive scale, rapidly ingest data from strategic partnerships, and build specialized tooling to maximize recall from high-value sources. You'll collaborate closely with pre-training data researchers and engineers to ensure that our sourcing of data maps to our training needs, to ensure we have the most capable pre-trained models.

YOUR MISSION

To deliver the highest-quality, diverse, and most comprehensive data corpus to fuel the pre-training of frontier models for software development.

RESPONSIBILITIES

  • Design, build, and operate a large-scale web crawler responsible for acquiring all openly accessible data on the internet
  • Develop specialized deep crawlers targeting high-value sources to improve recall and coverage
  • In collaboration with data researchers, own a long-term road map for data acquisition
  • Build observability, monitoring, and debugging tooling to ensure reliability and transparency across crawl infrastructure
  • Collaborate with pre-training, post-training, and evaluations teams to align data acquisition priorities with model training needs
  • Build high-throughput ingestion pipelines for rapidly onboarding partner data and evaluating it for quality

Remote Member of Engineering (Pre-training / Data Acquisition) in Warrington employer: poolside

Poolside is an exceptional employer that fosters a collaborative and kind-hearted work culture, attracting top talent in the field of Artificial General Intelligence. As a Remote Member of Engineering, you'll have the opportunity to work on groundbreaking projects that shape the future of software development while enjoying flexible working arrangements and a commitment to employee growth through innovative challenges and supportive teamwork.

P

Contact Details:

poolside Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Remote Member of Engineering (Pre-training / Data Acquisition) in Warrington

Tip Number 1

Network like a pro! Reach out to folks in the industry on LinkedIn or at meetups. We all know that sometimes it’s not just what you know, but who you know that can help you land that dream job.

Tip Number 2

Show off your skills! Create a portfolio or GitHub repository showcasing your projects and contributions. This gives potential employers a taste of what you can do and sets you apart from the crowd.

Tip Number 3

Prepare for interviews by practising common questions and scenarios related to data acquisition and web crawling. We recommend doing mock interviews with friends or using online platforms to boost your confidence.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love seeing candidates who are genuinely interested in joining our mission.

We think you need these skills to ace Remote Member of Engineering (Pre-training / Data Acquisition) in Warrington

Web Crawling
Data Acquisition
Data Ingestion
Tool Development
Collaboration
Monitoring and Debugging
Data Quality Evaluation

Some tips for your application 🫡

Show Your Passion for Data:When writing your application, let us see your enthusiasm for data acquisition and engineering. Share any relevant projects or experiences that highlight your skills in building systems for data collection. We love seeing candidates who are genuinely excited about the role!

Tailor Your CV and Cover Letter:Make sure to customise your CV and cover letter to align with the job description. Highlight your experience with web crawling, data ingestion, and collaboration with teams. This helps us see how you fit into our mission of reaching AGI through effective data strategies.

Be Clear and Concise:Keep your application clear and to the point. Use bullet points where possible to make it easy for us to read. We appreciate straightforward communication, so don’t hesitate to showcase your skills without unnecessary fluff!

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about our culture and values!

How to prepare for a job interview at poolside

Know Your Data

Make sure you understand the fundamentals of data acquisition and web crawling. Brush up on the latest technologies and methodologies used in large-scale data collection. Being able to discuss specific tools or frameworks you've worked with will show your expertise and enthusiasm for the role.

Show Your Collaborative Spirit

Since this role involves working closely with researchers and engineers, be prepared to discuss examples of how you've successfully collaborated in the past. Highlight any experiences where teamwork led to innovative solutions or improved processes, as Poolside values a kind-hearted and low-ego culture.

Prepare for Technical Questions

Expect technical questions that assess your problem-solving skills and understanding of data pipelines. Practice explaining your thought process clearly and concisely. You might even want to run through some coding challenges or system design scenarios relevant to data acquisition to sharpen your skills.

Align with Their Mission

Familiarise yourself with Poolside's mission to reach AGI through software development. Be ready to articulate how your skills and experiences align with their goals. Showing genuine interest in their vision will help you stand out as a candidate who is not just looking for a job, but is passionate about contributing to something bigger.