At a Glance
- Tasks: Build and train cutting-edge multimodal generative models for video and audio.
- Company: Join Luma AI, a leader in creative AI with a mission to build multimodal AGI.
- Benefits: Competitive salary, remote work options, and opportunities for professional growth.
- Why this job: Shape the future of creative AI and impact how users create and interact with media.
- Qualifications: Strong background in machine learning, generative modeling, and experience with PyTorch.
- Other info: Be part of a dynamic team driving innovation in AI technology.
Palo Alto, California • Remote - International • London, UK
About Luma AI: Luma’s mission is to build multimodal AGI. Through our research on video, 3D, and now multimodal models at Luma, we believe that AI needs to be jointly trained over all signal modalities – text, video, audio, images – analogous to the human brain. To advance our mission, we build and operate the full stack end-to-end, spanning foundation models, inference systems, and products. This integrated approach powers technologies like Ray3, which is seeing rapidly growing adoption among Fortune 500 companies across media, entertainment, and advertising. Backed by a recent $900M Series C and our partnership with Humain to build a 2 GW compute supercluster (Project Halo), our models and the Dream Machine platform are now enabling creatives worldwide to tell some of the most impactful stories of our time.
Where You Come In: This is a rare and foundational opportunity to define the future of creative AI. You will be at the forefront of building and training large-scale multimodal generative models, directly impacting how users create and interact with video and audio. This role offers the chance to bridge cutting‑edge research with magical, shipped products, working end‑to‑end on novel problems with no existing playbook.
What You’ll Do: This opportunity involves both the “science” and “engineering” parts of research. This is a multi‑stack opportunity where you will work on the intersection of modeling, data, systems, and evaluation.
- Modeling: Architect large-scale video and audio generative models, focusing on strong temporal coherence and high perceptual quality.
- Data: Design, implement, and run robust data pipelines for curating, filtering, and captioning massive video and audio datasets.
- Systems: Train large-scale video and audio generative models on massive datasets and GPU clusters.
- Evaluation: Define and build novel evaluation frameworks to measure realism, temporal consistency, controllability, and human‑aligned creative quality.
Who You Are: Strong foundation in machine learning and generative modeling, with experience in video, audio, or multimodal domains. Deep understanding of autoregressive, diffusion/flow‑based, or hybrid generative models, and their tradeoffs for long‑horizon generation. Hands‑on experience with PyTorch and large‑scale training (distributed, mixed precision, large datasets).
What Sets You Apart (Bonus Points): Experience in the following around data, modeling, or evaluation:
- Text‑to‑video/audio models
- Vision language models
- Audio language models
Your application are reviewed by real people.
Compensation: The base pay range for this role is $250,000 – $450,000 per year.
Research Scientist / Engineer — Video / Audio Generation employer: lumalabs.ai
Contact Detail:
lumalabs.ai Recruiting Team
StudySmarter Expert Advice 🤫
We think this is how you could land Research Scientist / Engineer — Video / Audio Generation
✨Tip Number 1
Network like a pro! Reach out to folks in the industry, especially those at Luma AI. A friendly chat can open doors that applications alone can't.
✨Tip Number 2
Show off your skills! Create a portfolio or a GitHub repo showcasing your projects related to video and audio generation. This gives us a taste of what you can do beyond your CV.
✨Tip Number 3
Prepare for interviews by brushing up on your knowledge of generative models and their applications. We want to see your passion and expertise shine through!
✨Tip Number 4
Apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you're genuinely interested in joining our team.
We think you need these skills to ace Research Scientist / Engineer — Video / Audio Generation
Some tips for your application 🫡
Show Your Passion: When writing your application, let your enthusiasm for AI and multimodal models shine through. We want to see that you’re genuinely excited about the opportunity to work on cutting-edge technology and how it can impact creativity.
Tailor Your CV: Make sure your CV is tailored to highlight your experience in machine learning, generative modeling, and any relevant projects. We love seeing specific examples of your work, especially if they relate to video or audio generation.
Craft a Compelling Cover Letter: Your cover letter is your chance to tell us why you’re the perfect fit for this role. Share your thoughts on the future of creative AI and how your skills align with our mission at Luma AI. Keep it engaging and personal!
Apply Through Our Website: Don’t forget to apply through our website! It’s the best way for us to receive your application and ensures you’re considered for the role. Plus, it gives you a chance to explore more about what we do at Luma AI.
How to prepare for a job interview at lumalabs.ai
✨Know Your Models Inside Out
Make sure you have a solid grasp of the generative models mentioned in the job description, like autoregressive and diffusion models. Be ready to discuss their trade-offs and how they apply to video and audio generation.
✨Showcase Your Hands-On Experience
Prepare to talk about your practical experience with PyTorch and large-scale training. Bring examples of projects where you've implemented data pipelines or trained models on massive datasets to demonstrate your skills.
✨Understand the Evaluation Frameworks
Familiarise yourself with evaluation metrics for generative models, especially those related to realism and temporal consistency. Being able to articulate how you would measure success in these areas will set you apart.
✨Connect Your Work to Their Mission
Luma AI is all about multimodal AGI. Think about how your past work aligns with their mission and be prepared to discuss how you can contribute to building creative AI that impacts users worldwide.