S
Speechify
Backend
Software Engineer, Data Infrastructure & Acquisition
PythonDockerGCPInfrastructure-As-CodeBash
About the Position
The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading into audio. This role is responsible for all aspects of data collection to support model training operations, involving infrastructure, engineering, and research work.
Responsibilities
- Find new sources of audio data and bring it into ingestion pipeline
- Operate and extend the cloud infrastructure for ingestion pipeline
- Collaborate with AI Scientists to improve data at scale and lower cost
- Craft the AI Team’s dataset roadmap
Requirements
- BS/MS/PhD in Computer Science or related field
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Proficiency in Docker and Infrastructure-as-Code
- Experience with major Cloud Providers (GCP preferred)
- Experience with web crawlers and large-scale data processing workflows
Benefits
- Competitive salaries
- Friendly and laid-back atmosphere
- Commitment to building a great asynchronous culture
- Opportunity to work on a life-changing product
- Fast-growing environment with entrepreneurial-minded team
- Flexible work in a remote setting
Software Engineer, Data Infrastructure & Acquisition