Data QA Engineer - Contract Role
Sanas
This job is no longer accepting applications
See open jobs at Sanas.See open jobs similar to "Data QA Engineer - Contract Role" General Catalyst.Quality Assurance
Bengaluru, Karnataka, India
Key Responsibilities:
- Conduct thorough validation of datasets used in model training and evaluation, focusing on transcription accuracy, metadata integrity, and audio quality.
- Review real customer calls and synthetic audio to detect data anomalies such as clipping, silence, incorrect speaker tags, or transcription mismatches.
- Reproduce and document data issues that impact model quality, enabling effective debugging and iteration by research teams.
- Curate, clean, and manage high-quality datasets from a variety of sources including customer calls, synthetic pipelines, and open-source corpora.
- Annotate and label audio with quality issues such as background noise, gender mismatches, speech overlap, silence, or segmentation errors.
- Collaborate with research and engineering teams to enhance data validation tools and scale automation within QA workflows.
- Ensure high standards of data hygiene, consistency, and reproducibility across all Data QA processes.
- Support data-related workflows such as data mining, extraction, transformation, and manipulation.
Must have qualifications:
- 2+ years of experience in Data QA, audio/transcription QA, or related quality assurance fields.
- Exceptional attention to detail with the ability to identify subtle inconsistencies and data quality issues.
- Hands-on experience with audio inspection tools like Audacity, Praat, or similar platforms.
- Familiarity with audio quality aspects such as clipping, background noise, channel imbalance, or a strong willingness to learn.
- Proficiency in handling structured data using tools like Excel, Google Sheets, CSV/JSON, and basic scripting in Python or Bash.
- Strong written communication skills for producing clear, actionable QA documentation and feedback.
- Knowledge of database languages (e.g., SQL) and experience working with DBMS tools like PostgreSQL.
- Demonstrated ability to collaborate effectively with ML researchers, product managers, and customer-facing teams.
This job is no longer accepting applications
See open jobs at Sanas.See open jobs similar to "Data QA Engineer - Contract Role" General Catalyst.