At a Glance
- Tasks: Own the lifecycle of large language models and integrate multimodal AI capabilities.
- Company: Join Oh, a pioneer in hyper-realistic AI-driven content and multimodal products.
- Benefits: Enjoy competitive pay, remote work, flexible hours, and rapid career growth.
- Why this job: Be part of a fast-paced team pushing AI boundaries in an innovative culture.
- Qualifications: 5+ years in ML engineering with expertise in Transformer-based LLMs and Python.
- Other info: Ideal for those passionate about AI, gaming, and ethical frameworks.
The predicted salary is between 48000 - 84000 £ per year.
Machine Learning / LLM Engineer
About Oh:
Oh is pioneering hyper-realistic, uncensored AI-driven content , building a full-spectrum ecosystem of multimodal AI products. Our platform powers lifelike digital twins and AI characters across text, voice, and images.
With a mission to become the OpenAI of the spicy content industry , we iterate fast, push boundaries, and deploy cutting-edge, real-time conversational AI experiences at scale .
The Role:
Our platform integrates a variety of multimodal GenAI models. You will own the technical roadmap and full lifecycle of our large language models , most notably our flagship Llama 3.1 70B and other open-source models .
Your responsibilities will include:
- Fine-tuning with custom and synthetic datasets
- Deploying on GPU platforms to ensure low-latency, cost-efficient, and safe real-time interactions
- Driving multimodal expansion —integrating text, voice, and image capabilities
- Embedding robust safety and compliance measures
- Keeping on top of recent development in the field and auditing new models for a wide range of purposes (e.g. conversational AI, intent classification, AI agents life planner)
Key Responsibilities:
LLM Fine-Tuning & Optimization
- Fine-tune and optimize models (Llama 3.1 70B, GPT-based, Mistral, etc. ) using domain-specific and synthetic datasets
- Enhance accuracy, reduce hallucinations , and improve alignment with user intent
Deployment & Infrastructure Management
- Deploy scalable, memory-efficient models on GPU-based platforms (Runpod, AWS, Kubernetes clusters)
- Optimize GPU inference with Torch , CUDA, TensorRT, vLLM, and DeepSpeed
Multimodal & Cross-Model Integration
- Integrate additional open-source models to enable image prompt generation, voice synthesis, and dynamic character personalization
- Expand multimodal AI capabilities (e.g. improve LLava-based vision models)
Data Pipeline & Evaluation
- Design robust data pipelines for curation, cleaning, synthetic data generation, and versioning (DVC)
- Implement evaluation metrics and continuous monitoring to ensure model quality
Real-Time Performance & System Optimization
- Ensure low-latency, real-time performance using mixed-precision training, quantization, pruning, and distillation techniques
Safety, Moderation & Compliance
- Embed robust safety, content moderation, and ethical AI frameworks to comply with GDPR and industry standards
- Develop custom token filters and controlled response mechanisms
Monitoring, Diagnostics & Cost Management
- Set up and maintain monitoring tools (Prometheus, Grafana, TensorBoard, Weights & Biases, Sentry) for performance tracking and cost optimization
Technical Skills & Requirements:
Experience:
- 5+ years in machine learning engineering, NLP, or AI research with deep expertise in Transformer-based LLMs
Programming & Frameworks:
- Strong proficiency in Python and Bash scripting
- Hands-on experience with PyTorch , HuggingFace libraries (Transformers, Diffusers, PEFT, Accelerate ), and the common ML toolkit (e.g. SKLearn, Pandas, Numpy)
- Familiarity with JAX/TensorFlow is a plus
LLM Specialization:
- Proven expertise in fine-tuning LLMs using techniques like LoRA, QLoRA, PEFT, RLHF, and prompt engineering
GPU & Inference Optimization:
- Experience with common inference speed optimisation and model quantization techniques.
Deployment & Orchestration:
- Skilled in containerization (Docker) and orchestration (Kubernetes) for scalable ML deployments
- Experience with major MLOps frameworks (MLFlow / KubeFlow ) preferred
Data Handling:
- Proficient in data wrangling and preprocessing (Pandas , Dask )
- Experience managing large-scale datasets using AWS (S3 , RedShift , EC2 )
- Knowledge of data QC and monitoring tools (DVC , Great Expectations )
Additional Knowledge:
- Understanding of retrieval-augmented generation (RAG) techniques
- Familiarity with vector databases (FAISS, Pinecone, Weaviate)
Preferred Qualifications:
✅ Experience integrating and optimizing multimodal models (text, voice, image, video)
✅ Background in AI-driven gaming, digital experiences, or adult content
✅ Familiarity with CI/CD pipelines (GitLab CI, Jenkins) for ML workflows
✅ Interest or experience in crypto, Web3, or NFT-based AI models
✅ Prior exposure to AI governance, safety, or ethical AI frameworks
What We Offer:
Competitive Compensation:
- Attractive salary, benefits, and equity participation
Remote & Flexible:
- Remote-first work environment with flexible hours
Growth & Leadership:
- Rapid career advancement and the opportunity to shape our AI strategy
Innovative Culture:
- Join a fast-paced team at the forefront of advanced, uncensored AI applications
If you’re passionate about pushing the boundaries of AI-driven experiences and have a track record in developing, deploying, and optimizing cutting-edge LLMs , we want to hear from you!
ML Engineer (LLM) employer: OhChat
Contact Detail:
OhChat Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land ML Engineer (LLM)
✨Tip Number 1
Familiarize yourself with the specific LLMs mentioned in the job description, especially Llama 3.1 70B. Understanding their architecture and fine-tuning techniques will give you a significant edge during discussions.
✨Tip Number 2
Showcase your experience with GPU deployment and optimization. Be prepared to discuss specific projects where you've successfully implemented low-latency solutions using tools like Torch and CUDA.
✨Tip Number 3
Highlight any experience you have with multimodal AI integration. Discuss how you've worked with text, voice, and image models in past projects to demonstrate your capability in expanding AI functionalities.
✨Tip Number 4
Stay updated on the latest trends in AI safety and compliance. Being knowledgeable about GDPR and ethical AI frameworks will show that you are not only technically skilled but also aware of the broader implications of your work.
We think you need these skills to ace ML Engineer (LLM)
Some tips for your application 🫡
Tailor Your CV: Make sure your CV highlights relevant experience in machine learning, particularly with LLMs and multimodal models. Emphasize your expertise in fine-tuning and optimizing models, as well as any experience with GPU deployment.
Craft a Compelling Cover Letter: In your cover letter, express your passion for AI-driven experiences and detail how your background aligns with the company's mission. Mention specific projects or achievements that demonstrate your skills in NLP and AI research.
Showcase Technical Skills: Clearly outline your technical skills related to Python, PyTorch, and any relevant frameworks. Include examples of how you've used these tools in past projects, especially in relation to LLMs and multimodal integration.
Highlight Continuous Learning: Mention any recent courses, certifications, or self-study efforts that keep you updated on the latest developments in AI and machine learning. This shows your commitment to staying at the forefront of the field.
How to prepare for a job interview at OhChat
✨Showcase Your LLM Expertise
Be prepared to discuss your experience with large language models, especially fine-tuning techniques like LoRA and RLHF. Highlight specific projects where you've successfully optimized models and improved their performance.
✨Demonstrate Technical Proficiency
Familiarize yourself with the tools and frameworks mentioned in the job description, such as PyTorch, HuggingFace, and Docker. Be ready to answer technical questions or even solve problems on the spot related to these technologies.
✨Discuss Multimodal Integration
Since the role involves integrating text, voice, and image capabilities, prepare examples of how you've worked with multimodal AI systems. Discuss any challenges you faced and how you overcame them.
✨Emphasize Safety and Compliance Knowledge
Understand the importance of safety and compliance in AI applications. Be ready to talk about your experience with ethical AI frameworks and how you've implemented safety measures in past projects.