Machine Learning Engineer (Speech/Audio Focus)
Remote | Full-Time |
We’re working with a fast-growing, purpose-driven technology company that’s transforming how people around the world engage with meaningful content through voice and sound. Their mission is rooted in accessibility, precision, and innovation—using AI to support deep learning and personal development at scale.
As their platform rapidly expands its global reach, they’re looking to bring on a skilled Machine Learning Engineer with deep experience in audio processing and speech recognition. This is a unique opportunity to build models that impact millions of users while working with a team that values clarity, impact, and modern ML practices.
What You’ll Be Doing
- Design, train, and fine-tune machine learning models focused on automatic speech recognition (ASR)
- Leverage large audio datasets to improve model accuracy, efficiency, and real-world usability
- Collaborate with engineers and product teams to integrate models directly into mobile and web applications
- Use tools like PyTorch and NVIDIA NeMo to develop robust speech recognition pipelines
- Schedule and manage GPU-based training workloads with SLURM
- Store and access training data through Amazon S3
- Apply preprocessing and augmentation techniques to optimize audio inputs
- Monitor system performance and implement continuous improvements based on real-world usage
- Stay informed on the latest trends in deep learning, ASR, and model deployment
What We’re Looking For
- 3+ years of experience in machine learning, with a strong focus on deep learning and audio-based models
- Strong command of Python and experience using PyTorch in production environments
- Experience with SLURM for distributed training workload scheduling
- Comfortable working with cloud-based data solutions like Amazon S3
- Solid grasp of audio signal processing and data augmentation best practices
- Ability to work with large, high-dimensional datasets
- Sharp problem-solving skills and attention to detail
- Strong written and verbal communication; collaborative working style
- Availability for remote meetings Monday through Thursday between 8am–12pm Eastern
Bonus Points For
- Experience developing or deploying ASR (speech recognition) systems
- Hands-on use of NVIDIA NeMo for speech model development
- Exposure to natural language processing (NLP) or language modeling
- Familiarity with cloud environments like AWS or GCP
- Appreciation for the cultural and phonetic intricacies of recited content (not required)
Why You’ll Love This Opportunity
-
Remote-first flexibility – work from anywhere
- A product with real-world impact and a global user base
- A lean, passionate team with a clear mission and shared values
- Work on challenges that combine machine learning, culture, and scale
- Competitive compensation with potential equity
- Annual retreats and a flexible time-off policy
This role is for an innovative client making waves in the AI and speech tech space. If you're excited to apply your expertise to a product with purpose, we’d love to hear from you.