
This Data Scientist 3 role supports a critical NLP project focused on automatic tokenization and part-of-speech annotation for government language data. The position requires developing automated solutions to evaluate model performance against human-generated annotations and extracting value from large, complex datasets. Key responsibilities include designing machine learning algorithms, performing statistical analysis, and translating technical findings into actionable recommendations for non-technical audiences. The role appeals to candidates seeking to apply advanced data science skills in a mission-driven environment with a focus on high-quality language processing. An active TS/SCI security clearance is required, and the position involves working with government data holdings to advance analytic capabilities.
