At a Glance
- Tasks: Join our AI team to build and scale massive datasets for innovative text-to-speech products.
- Company: Speechify, a leading tech company transforming reading accessibility.
- Benefits: Competitive salary, remote work, and a supportive entrepreneurial culture.
- Other info: Work in a fully distributed team with excellent career growth opportunities.
- Why this job: Make a real impact in a transformative industry that helps millions with learning differences.
- Qualifications: 5+ years in software development, proficiency in Python, and cloud infrastructure experience.
The predicted salary is between 60000 - 80000 £ per year.
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Google Docs, news articles, websites – into audio, so they can read faster, read more, and remember more. Speechify’s text-to-speech reading products include its iOS app, Android App, Mac App, Chrome Extension, and Web App.
Google recently named Speechify the Chrome Extension of the Year and Apple named Speechify its 2025 Design Award winner for Inclusivity. Today, nearly 200 people around the globe work on Speechify in a 100% distributed setting – Speechify has no office. These include frontend and backend engineers, AI research scientists, and others from Amazon, Microsoft, and Google, leading PhD programs like Stanford, high growth startups like Stripe, Vercel, Bolt, and many founders of their own companies.
Overview: We're looking to hire for our Data side of our AI team at Speechify. This role is responsible for all aspects of data collection to support our model training operations. We are able to build high-quality datasets at petabyte-scale and low cost through a tight integration of infrastructure, engineering, and research work. We are looking for a skilled Software Engineer to join us.
What You’ll Do:
- Be scrappy to find new sources of audio data and bring it into our ingestion pipeline.
- Operate and extend the cloud infrastructure for our ingestion pipeline, currently running on GCP and managed with Terraform.
- Collaborate closely with our Scientists to shift the cost/throughput/quality frontier, delivering richer data at bigger scale and lower cost to power our next-generation models.
- Collaborate with others on the AI Team and Speechify Leadership to craft the AI Team’s dataset roadmap to power Speechify’s next-generation consumer and enterprise products.
An Ideal Candidate Should Have:
- BS/MS/PhD in Computer Science or a related field.
- 5+ years of industry experience in software development.
- Proficiency with bash/Python scripting in Linux environments.
- Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP).
- Experience with web crawlers, large-scale data processing workflows is a plus.
- Ability to handle multiple tasks and adapt to changing priorities.
- Strong communication skills, both written and verbal.
What we offer:
- A fast-growing environment where you can help shape the company and product.
- An entrepreneurial-minded team that supports risk, intuition, and hustle.
- A hands-off management approach so you can focus and do your best work.
- An opportunity to make a big impact in a transformative industry.
- Competitive salaries, a friendly and laid-back atmosphere, and a commitment to building a great asynchronous culture.
- Opportunity to work on a life-changing product that millions of people use.
- Build products that directly impact and support people with learning differences like dyslexia, ADD, low vision, concussions, autism, and more.
- Work in one of the fastest-growing sectors of tech, the intersection of artificial intelligence and audio.
Think you’re a good fit for this job? Tell us more about yourself and why you're interested in the role when you apply. And don’t forget to include links to your portfolio and LinkedIn.
Speechify is committed to a diverse and inclusive workplace. Speechify does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status.
Data Infrastructure Engineer: Ingest & Scale Massive Datasets in Leeds employer: Speechify
At Speechify, we pride ourselves on being an exceptional employer that fosters a fast-growing and entrepreneurial environment where innovation thrives. Our fully distributed team allows for a flexible work culture, empowering employees to make significant contributions to a transformative product that impacts millions of users. With competitive salaries, a commitment to inclusivity, and ample opportunities for personal and professional growth, joining Speechify means becoming part of a mission-driven company at the forefront of AI and audio technology.
StudySmarter Expert Advice🤫
We think this is how you could land Data Infrastructure Engineer: Ingest & Scale Massive Datasets in Leeds
✨Tip Number 1
Network like a pro! Reach out to folks in your industry on LinkedIn or at meetups. A friendly chat can lead to opportunities that aren’t even advertised yet.
✨Tip Number 2
Show off your skills! Create a portfolio or GitHub repo showcasing your projects. This gives potential employers a taste of what you can do and sets you apart from the crowd.
✨Tip Number 3
Prepare for interviews by practicing common questions and scenarios related to data infrastructure. The more you rehearse, the more confident you'll feel when it’s showtime!
✨Tip Number 4
Don’t forget to apply through our website! It’s the best way to ensure your application gets seen by the right people. Plus, we love hearing why you want to join our mission!
We think you need these skills to ace Data Infrastructure Engineer: Ingest & Scale Massive Datasets in Leeds
Some tips for your application 🫡
Show Your Passion:When you're writing your application, let your enthusiasm for the role shine through! We want to see why you're excited about joining our team and how you can contribute to our mission at Speechify.
Tailor Your CV:Make sure your CV is tailored to the job description. Highlight relevant experience and skills that match what we're looking for in a Data Infrastructure Engineer. This helps us see how you fit into our vision!
Be Clear and Concise:Keep your application clear and to the point. Use straightforward language and avoid jargon unless it's necessary. We appreciate clarity, and it makes it easier for us to understand your qualifications.
Include Links to Your Work:Don’t forget to include links to your portfolio or any relevant projects you've worked on. This gives us a better idea of your skills and what you can bring to the table. Apply through our website to make sure we get all your details!
How to prepare for a job interview at Speechify
✨Know Your Tech Stack
Familiarise yourself with the technologies mentioned in the job description, especially GCP, Terraform, and Docker. Be ready to discuss your experience with these tools and how you've used them in past projects.
✨Showcase Your Problem-Solving Skills
Prepare to talk about specific technical challenges you've faced in your previous roles. Use the STAR method (Situation, Task, Action, Result) to structure your answers and highlight your problem-solving abilities.
✨Understand the Company’s Mission
Research Speechify's mission to make reading accessible for everyone. Be prepared to discuss how your role as a Data Infrastructure Engineer can contribute to this goal and why it resonates with you personally.
✨Ask Insightful Questions
Prepare thoughtful questions about the team dynamics, the AI team's dataset roadmap, and how your role will impact the company's future. This shows your genuine interest in the position and helps you assess if it's the right fit for you.