Machine Learning Data Engineer

Machine Learning Data Engineer

Full-Time 60000 - 80000 £ / year (est.) Home office (partial)
Synthesia

At a Glance

  • Tasks: Develop and maintain data processing pipelines using machine learning techniques.
  • Company: Join Synthesia, the world’s #1 AI video generation platform and a newly minted Unicorn!
  • Benefits: Competitive salary, flexible working hours, and opportunities for professional growth.
  • Other info: Collaborate with top researchers and engineers in a fast-paced, innovative environment.
  • Why this job: Make a real impact in AI and video content creation while working with cutting-edge technology.
  • Qualifications: 3+ years of experience in data engineering and strong coding skills in Python.

The predicted salary is between 60000 - 80000 £ per year.

About Synthesia

Synthesia is the world’s #1 AI video generation platform. It is a video production studio in a browser, allowing users to create personalised videos by choosing an avatar and entering a script in one of 60 languages. Our mission is to turn cameras into code and make everyone a creator.

About The Role

We are looking for an experienced Machine Learning Data Engineer who loves dealing with large quantities of text and audio data. The successful candidate will be proficient in using machine learning techniques to build data processing pipelines and prepare ready-to-train datasets for large models. This role provides a unique opportunity to make a high-impact contribution at the intersection of AI, Machine Learning, and Large Data.

You will be someone who loves to code and build working systems, with experience in the software development life cycle from ideation through implementation, to testing and release. You will join a group of more than 40 Engineers in the R&D department and collaborate with multiple research teams across diverse areas.

What will you be doing?

  • Designing, developing, and maintaining data processing pipelines, utilising machine learning techniques to handle vast amounts of text and audio data while ensuring data quality and accessibility.
  • Leveraging your understanding of machine learning algorithms and workflows to prepare data effectively for usage in large scale models.
  • Using Big Data tools and frameworks to process, analyse, and derive insights from structured and unstructured data.
  • Collaborating with other ML Engineers and Researchers to understand their data requirements and provide them with ready-to-train datasets.
  • Monitoring the performance of data pipelines and machine learning models, troubleshooting data-related issues, and performing root cause analysis to implement strategic solutions.
  • Staying up-to-date with emerging technologies and tools in machine learning and data engineering to continually improve our data infrastructure.
  • Documenting data pipeline architecture and workflow, presenting findings to relevant stakeholders, and providing training as needed.

Who are you?

  • You have a background in Computer Science, Engineering, or a related field with 3+ years of experience.
  • Proven experience as a Data Engineer, or similar role, with a demonstrated history in designing and building scalable data pipelines using Machine Learning techniques.
  • Familiarity with audio data processing and voice technologies is highly desirable.
  • You have excellent coding skills in Python and are passionate about the software development side of things.
  • You have solid proficiency in Unix-like command line operations, including the creation and execution of both quick one-liners and complex bash scripts.
  • You put emphasis on documenting your work in a clear and concise manner.
  • Ability to work effectively in a fast-paced, agile environment.
  • You have excellent verbal and written communication skills and are passionate about what you do!

Nice to have…

  • Transformers, Huggingface, Whisper ASR.
  • Multi-threaded Python AWS framework.

Machine Learning Data Engineer employer: Synthesia

At Synthesia, we pride ourselves on being a pioneering employer in the AI video generation space, offering a dynamic and innovative work culture that fosters creativity and collaboration. As a Machine Learning Data Engineer, you'll have the opportunity to work alongside leading experts in the field, contribute to groundbreaking projects, and enjoy a range of benefits including flexible working arrangements and professional development opportunities. Join us in our mission to revolutionise media creation while being part of a fast-paced, supportive environment that values your contributions and growth.

Synthesia

Contact Details:

Synthesia Recruitment Team

StudySmarter Expert Advice🤫

We think this is how you could land Machine Learning Data Engineer

Tip Number 1

Network like a pro! Reach out to folks in the industry, especially those at Synthesia. LinkedIn is your best mate here – drop them a message, ask about their experiences, and express your interest in the Machine Learning Data Engineer role.

Tip Number 2

Show off your skills! If you’ve got a portfolio or GitHub with projects related to machine learning or data engineering, make sure to highlight that. It’s a great way to demonstrate your coding chops and passion for the field.

Tip Number 3

Prepare for the interview by brushing up on relevant technologies and tools mentioned in the job description. Be ready to discuss your experience with data pipelines and machine learning techniques, as well as any challenges you've faced and how you overcame them.

Tip Number 4

Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, it shows you’re genuinely interested in joining the Synthesia team.

We think you need these skills to ace Machine Learning Data Engineer

Machine Learning Techniques
Data Processing Pipelines
Text and Audio Data Handling
Big Data Tools and Frameworks
Python Programming
Unix Command Line Operations
Data Quality Assurance

Some tips for your application 🫡

Tailor Your CV:Make sure your CV is tailored to the Machine Learning Data Engineer role. Highlight relevant experience, especially with data processing pipelines and machine learning techniques. We want to see how your skills align with what we're looking for!

Craft a Compelling Cover Letter:Your cover letter is your chance to shine! Share your passion for AI and machine learning, and explain why you’re excited about joining Synthesia. Let us know how you can contribute to our mission of making video content creation accessible for everyone.

Showcase Your Projects:If you've worked on any cool projects related to machine learning or data engineering, don’t hold back! Include links to your GitHub or any relevant portfolios. We love seeing practical examples of your work and how you tackle challenges.

Apply Through Our Website:We encourage you to apply directly through our website. It’s the best way to ensure your application gets into the right hands. Plus, it shows us you’re serious about joining our team at Synthesia!

How to prepare for a job interview at Synthesia

Know Your Stuff

Make sure you brush up on your machine learning algorithms and data processing techniques. Be ready to discuss your experience with tools like Transformers, Whisper, and any relevant frameworks. Synthesia is looking for someone who can hit the ground running, so showing off your knowledge will definitely impress them.

Showcase Your Projects

Prepare to talk about specific projects where you've built scalable data pipelines or worked with audio data. Bring examples of your code or even a portfolio if you have one. This will help demonstrate your hands-on experience and passion for the role.

Ask Smart Questions

During the interview, don’t hesitate to ask insightful questions about their current projects or challenges they face in data processing. This shows that you're genuinely interested in the role and helps you understand how you can contribute to their mission.

Communicate Clearly

Since you'll be collaborating with various teams, it's crucial to demonstrate your communication skills. Practice explaining complex concepts in simple terms, as this will show that you can effectively share your ideas and findings with others.