Remote Senior Software Engineer – LLM Evaluation (US-based) in Ramsbottom

Remote Senior Software Engineer – LLM Evaluation (US-based) in Ramsbottom

Ramsbottom Freelance 60000 - 80000 € / year (est.) Home office possible
Turing

At a Glance

  • Tasks: Create cutting-edge datasets and evaluate AI-generated code for efficiency and reliability.
  • Company: Join Turing, a leading research accelerator for frontier AI labs.
  • Benefits: Flexible remote work, competitive pay, and potential for contract extensions.
  • Other info: Dynamic role with opportunities for growth and innovation in AI technology.
  • Why this job: Make an impact in AI by collaborating with top researchers and engineers.
  • Qualifications: 3+ years of software engineering experience, strong Python skills, and excellent communication.

The predicted salary is between 60000 - 80000 € per year.

About Us: Based in San Francisco, California, Turing is the world’s leading research accelerator for frontier AI labs and a trusted partner for global enterprises deploying advanced AI systems. Turing supports customers by accelerating frontier research with high-quality data, advanced training pipelines, plus top AI researchers who specialise in software engineering, logical reasoning, STEM, multilinguality, multimodality, and agents; and by applying that expertise to help enterprises transform AI from proof of concept into proprietary intelligence with systems that perform reliably, deliver measurable impact, and drive lasting results on the P&L.

Ideal Background: This role is ideal for engineers who have built production systems at companies like Google, Microsoft, Apple, Amazon, Meta, or similar high-scale engineering organisations. We especially welcome graduates from top computer science programmes such as Stanford, MIT, Carnegie Mellon, UC Berkeley, Georgia Tech, and comparable institutions β€” though exceptional experience and skill always take precedence over pedigree.

Project Overview: As a Software Engineering evaluator, you will create cutting-edge datasets for training, benchmarking, and advancing large language models, collaborating closely with researchers. This includes curating code examples, providing precise solutions, and making corrections β€” with a primary focus on Python across backend services, data pipelines, and ML infrastructure, alongside JavaScript (including ReactJS), C/C++, Java, Rust, and Go. You will evaluate and refine AI-generated code for efficiency, scalability, and reliability, and work with cross-functional teams to enhance enterprise-level AI-driven coding solutions.

What Does a Typical Day Look Like? Work on AI model training initiatives by curating code examples, building solutions, and correcting code β€” primarily in Python, with additional work in JavaScript (including ReactJS), C/C++, Java, Rust, and Go. Evaluate and refine AI-generated code to ensure that it is efficient, scalable, and reliable. Collaborate with cross-functional teams to enhance AI-driven coding solutions against industry performance benchmarks. Build agents and automated verification tools in Python that can verify the quality of code and identify error patterns. Hypothesize on steps in the software engineering cycle (prototyping, architecture design, API design, production implementation, launch, experiments, monitoring, operational maintenance) and evaluate model capabilities on them. Design verification mechanisms that can automatically verify a solution to a software engineering task.

Required Skills: Several years of software engineering experience (3 years or more). Strong expertise in Python with deep knowledge of frameworks, tooling, and best practices for building production-grade software. Experience building full-stack applications and deploying scalable software using modern languages and tools. Deep understanding of software architecture, design, development, debugging, and code quality/review assessment. Excellent oral and written communication skills for clear, structured evaluation rationales.

Engagement Details: Commitment: flexible engagement, minimum 10 hrs/week, up to 40 hrs/week. Type: Contractor (no medical/paid leave). Duration: 1 month (potential extensions based on performance and fit). Location: Candidates must be based in the United States.

Evaluation Process: The application process takes 15–30 minutes. Completion of an AI video interview is required. Note: As part of assessments you will go through an AI video interview. After applying, you will receive an email with a login link. Please use that link to access the portal and complete your profile.

Remote working/work at home options are available for this role.

Remote Senior Software Engineer – LLM Evaluation (US-based) in Ramsbottom employer: Turing

Turing is an exceptional employer that fosters a dynamic and innovative work culture, perfect for Senior Software Engineers looking to make a significant impact in the AI field. With flexible remote working options and a commitment to employee growth, Turing provides opportunities to collaborate with top-tier researchers and contribute to cutting-edge projects that shape the future of AI technology. Join us in San Francisco, where your expertise will be valued, and your contributions will drive meaningful advancements in AI systems.

Turing

Contact Detail:

Turing Recruiting Team

StudySmarter Expert Advice🀫

We think this is how you could land Remote Senior Software Engineer – LLM Evaluation (US-based) in Ramsbottom

✨Tip Number 1

Get your tech skills sharp! Brush up on Python and any other languages mentioned in the job description. We want to see you confidently tackle coding challenges during interviews, so practice makes perfect!

✨Tip Number 2

Network like a pro! Connect with current employees or alumni from your university who work at Turing or similar companies. A friendly chat can give you insider tips and maybe even a referral!

✨Tip Number 3

Prepare for that AI video interview! Familiarise yourself with common questions and practice articulating your thought process clearly. We want to see how you think, so don’t hold back on showcasing your problem-solving skills.

✨Tip Number 4

Apply through our website! It’s the best way to ensure your application gets seen. Plus, it’s super easy and quickβ€”just 15-30 minutes of your time could land you an exciting role with us!

We think you need these skills to ace Remote Senior Software Engineer – LLM Evaluation (US-based) in Ramsbottom

Python
JavaScript
ReactJS
C/C++
Java
Rust
Go

Some tips for your application 🫑

Tailor Your Application:Make sure to customise your CV and cover letter for the role. Highlight your experience with Python and any relevant projects that showcase your skills in building production systems. We want to see how you fit into our world of AI-driven solutions!

Show Off Your Communication Skills:Since this role requires excellent written communication, ensure your application is clear and structured. Use concise language and avoid jargon unless necessary. We appreciate clarity as much as technical prowess!

Highlight Relevant Experience:If you've worked at big names like Google or Amazon, don’t shy away from mentioning it! But remember, exceptional experience matters more than just the name of the company. Share specific examples of your work that relate to AI and software engineering.

Apply Through Our Website:We encourage you to apply directly through our website for a smoother process. It’s quick and easy, and you’ll get all the info you need about the role and our team. Plus, we love seeing applications come in through our own channels!

How to prepare for a job interview at Turing

✨Know Your Tech Stack

Make sure you’re well-versed in Python and the other languages mentioned, like JavaScript and C++. Brush up on frameworks and best practices for building production-grade software. Being able to discuss your experience with these technologies confidently will show that you're a strong candidate.

✨Showcase Your Problem-Solving Skills

Prepare to discuss specific examples where you've evaluated and refined code for efficiency and scalability. Think of scenarios where you’ve collaborated with cross-functional teams to enhance coding solutions. This will demonstrate your ability to work in a team and tackle real-world challenges.

✨Practice Clear Communication

Since excellent communication skills are crucial, practice explaining complex technical concepts in simple terms. You might be asked to provide structured evaluation rationales, so being articulate and clear will set you apart from other candidates.

✨Familiarise Yourself with AI Concepts

Given the role's focus on AI model training and evaluation, brush up on relevant AI concepts and methodologies. Be ready to discuss how you would approach curating datasets and evaluating AI-generated code. Showing your understanding of AI will highlight your fit for the position.