
This full-time Software Engineer role supports the Data Infrastructure and Acquisition team within Speechify's AI division, focusing on building petabyte-scale datasets for text-to-speech model training. Key responsibilities include sourcing new audio data, managing cloud ingestion pipelines on GCP using Terraform, and collaborating with scientists to optimize data quality, throughput, and cost. The position offers the appeal of working in a 100% distributed environment with a hands-off management style, allowing engineers to shape product direction in a fast-growing, entrepreneurial culture. The role provides the opportunity to make a significant impact on a transformative product that assists millions of users with learning differences.


