ML Engineer (LLM)

ML Engineer (LLM)

Full-Time 48000 - 84000 £ / year (est.) No home office possible
O

At a Glance

  • Tasks: Own the lifecycle of large language models and integrate multimodal AI capabilities.
  • Company: Join Oh, a pioneer in hyper-realistic AI-driven content and multimodal products.
  • Benefits: Enjoy competitive pay, remote work, flexible hours, and rapid career growth.
  • Why this job: Be part of a fast-paced team pushing AI boundaries in an innovative culture.
  • Qualifications: 5+ years in ML engineering with expertise in Transformer-based LLMs and Python.
  • Other info: Ideal for those passionate about AI, gaming, and ethical frameworks.

The predicted salary is between 48000 - 84000 £ per year.

Machine Learning / LLM Engineer

About Oh:

Oh is pioneering hyper-realistic, uncensored AI-driven content , building a full-spectrum ecosystem of multimodal AI products. Our platform powers lifelike digital twins and AI characters across text, voice, and images.

With a mission to become the OpenAI of the spicy content industry , we iterate fast, push boundaries, and deploy cutting-edge, real-time conversational AI experiences at scale .

The Role:

Our platform integrates a variety of multimodal GenAI models. You will own the technical roadmap and full lifecycle of our large language models , most notably our flagship Llama 3.1 70B and other open-source models .

Your responsibilities will include:

  • Fine-tuning with custom and synthetic datasets
  • Deploying on GPU platforms to ensure low-latency, cost-efficient, and safe real-time interactions
  • Driving multimodal expansion —integrating text, voice, and image capabilities
  • Embedding robust safety and compliance measures
  • Keeping on top of recent development in the field and auditing new models for a wide range of purposes (e.g. conversational AI, intent classification, AI agents life planner)

Key Responsibilities:

LLM Fine-Tuning & Optimization

  • Fine-tune and optimize models (Llama 3.1 70B, GPT-based, Mistral, etc. ) using domain-specific and synthetic datasets
  • Enhance accuracy, reduce hallucinations , and improve alignment with user intent

Deployment & Infrastructure Management

  • Deploy scalable, memory-efficient models on GPU-based platforms (Runpod, AWS, Kubernetes clusters)
  • Optimize GPU inference with Torch , CUDA, TensorRT, vLLM, and DeepSpeed

Multimodal & Cross-Model Integration

  • Integrate additional open-source models to enable image prompt generation, voice synthesis, and dynamic character personalization
  • Expand multimodal AI capabilities (e.g. improve LLava-based vision models)

Data Pipeline & Evaluation

  • Design robust data pipelines for curation, cleaning, synthetic data generation, and versioning (DVC)
  • Implement evaluation metrics and continuous monitoring to ensure model quality

Real-Time Performance & System Optimization

  • Ensure low-latency, real-time performance using mixed-precision training, quantization, pruning, and distillation techniques

Safety, Moderation & Compliance

  • Embed robust safety, content moderation, and ethical AI frameworks to comply with GDPR and industry standards
  • Develop custom token filters and controlled response mechanisms

Monitoring, Diagnostics & Cost Management

  • Set up and maintain monitoring tools (Prometheus, Grafana, TensorBoard, Weights & Biases, Sentry) for performance tracking and cost optimization

Technical Skills & Requirements:

Experience:

  • 5+ years in machine learning engineering, NLP, or AI research with deep expertise in Transformer-based LLMs

Programming & Frameworks:

  • Strong proficiency in Python and Bash scripting
  • Hands-on experience with PyTorch , HuggingFace libraries (Transformers, Diffusers, PEFT, Accelerate ), and the common ML toolkit (e.g. SKLearn, Pandas, Numpy)
  • Familiarity with JAX/TensorFlow is a plus

LLM Specialization:

  • Proven expertise in fine-tuning LLMs using techniques like LoRA, QLoRA, PEFT, RLHF, and prompt engineering

GPU & Inference Optimization:

  • Experience with common inference speed optimisation and model quantization techniques.

Deployment & Orchestration:

  • Skilled in containerization (Docker) and orchestration (Kubernetes) for scalable ML deployments
  • Experience with major MLOps frameworks (MLFlow / KubeFlow ) preferred

Data Handling:

  • Proficient in data wrangling and preprocessing (Pandas , Dask )
  • Experience managing large-scale datasets using AWS (S3 , RedShift , EC2 )
  • Knowledge of data QC and monitoring tools (DVC , Great Expectations )

Additional Knowledge:

  • Understanding of retrieval-augmented generation (RAG) techniques
  • Familiarity with vector databases (FAISS, Pinecone, Weaviate)

Preferred Qualifications:

✅ Experience integrating and optimizing multimodal models (text, voice, image, video)

✅ Background in AI-driven gaming, digital experiences, or adult content

✅ Familiarity with CI/CD pipelines (GitLab CI, Jenkins) for ML workflows

✅ Interest or experience in crypto, Web3, or NFT-based AI models

✅ Prior exposure to AI governance, safety, or ethical AI frameworks

What We Offer:

Competitive Compensation:

  • Attractive salary, benefits, and equity participation

Remote & Flexible:

  • Remote-first work environment with flexible hours

Growth & Leadership:

  • Rapid career advancement and the opportunity to shape our AI strategy

Innovative Culture:

  • Join a fast-paced team at the forefront of advanced, uncensored AI applications

If you’re passionate about pushing the boundaries of AI-driven experiences and have a track record in developing, deploying, and optimizing cutting-edge LLMs , we want to hear from you!

ML Engineer (LLM) employer: OhChat

At Oh, we are not just building AI; we are redefining the landscape of digital content with a commitment to innovation and creativity. Our remote-first work culture fosters flexibility and collaboration, allowing you to thrive while contributing to groundbreaking projects in the spicy content industry. With competitive compensation, rapid career advancement opportunities, and a team that values your expertise, Oh is the perfect place for ML Engineers looking to make a meaningful impact in the world of AI.
O

Contact Detail:

OhChat Recruiting Team

StudySmarter Expert Advice 🤫

We think this is how you could land ML Engineer (LLM)

✨Tip Number 1

Familiarize yourself with the specific LLMs mentioned in the job description, especially Llama 3.1 70B. Understanding their architecture and fine-tuning techniques will give you a significant edge during discussions.

✨Tip Number 2

Showcase your experience with GPU deployment and optimization. Be prepared to discuss specific projects where you've successfully implemented low-latency solutions using tools like Torch and CUDA.

✨Tip Number 3

Highlight any experience you have with multimodal AI integration. Discuss how you've worked with text, voice, and image models in past projects to demonstrate your capability in expanding AI functionalities.

✨Tip Number 4

Stay updated on the latest trends in AI safety and compliance. Being knowledgeable about GDPR and ethical AI frameworks will show that you are not only technically skilled but also aware of the broader implications of your work.

We think you need these skills to ace ML Engineer (LLM)

Machine Learning Engineering
Natural Language Processing (NLP)
Transformer-based LLMs
Fine-tuning LLMs
Python Programming
Bash Scripting
PyTorch
HuggingFace Libraries
Data Pipeline Design
GPU Optimization
Containerization (Docker)
Kubernetes Orchestration
Data Wrangling and Preprocessing
AWS Services (S3, RedShift, EC2)
Monitoring Tools (Prometheus, Grafana, TensorBoard)
Quantization Techniques
Ethical AI Frameworks
CI/CD Pipelines
Multimodal Model Integration

Some tips for your application 🫡

Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, particularly with LLMs and multimodal models. Emphasize your expertise in fine-tuning and optimizing models, as well as any experience with GPU deployment.

Craft a Compelling Cover Letter: In your cover letter, express your passion for AI-driven experiences and detail how your background aligns with the company's mission. Mention specific projects or achievements that demonstrate your skills in NLP and AI research.

Showcase Technical Skills: Clearly outline your technical skills related to Python, PyTorch, and any relevant frameworks. Include examples of how you've used these tools in past projects, especially in relation to LLMs and multimodal integration.

Highlight Continuous Learning: Mention any recent courses, certifications, or self-study efforts that keep you updated on the latest developments in AI and machine learning. This shows your commitment to staying at the forefront of the field.

How to prepare for a job interview at OhChat

✨Showcase Your LLM Expertise

Be prepared to discuss your experience with large language models, especially fine-tuning techniques like LoRA and RLHF. Highlight specific projects where you've successfully optimized models and improved their performance.

✨Demonstrate Technical Proficiency

Familiarize yourself with the tools and frameworks mentioned in the job description, such as PyTorch, HuggingFace, and Docker. Be ready to answer technical questions or even solve problems on the spot related to these technologies.

✨Discuss Multimodal Integration

Since the role involves integrating text, voice, and image capabilities, prepare examples of how you've worked with multimodal AI systems. Discuss any challenges you faced and how you overcame them.

✨Emphasize Safety and Compliance Knowledge

Understand the importance of safety and compliance in AI applications. Be ready to talk about your experience with ethical AI frameworks and how you've implemented safety measures in past projects.

ML Engineer (LLM)
OhChat
O
  • ML Engineer (LLM)

    Full-Time
    48000 - 84000 £ / year (est.)

    Application deadline: 2027-03-07

  • O

    OhChat

Similar positions in other companies
Europas größte Jobbörse für Gen-Z
discover-jobs-cta
Discover now
>