Join our companies in their quest to drive powerful, positive, change that endures.

Senior Speech Scientist

Sanas

Sanas

Bengaluru, Karnataka, India
Posted on Wednesday, January 24, 2024
Sanas is revolutionizing the way we communicate with the world’s first real-time algorithm, designed to modulate accents, eliminate background noises, and magnify speech clarity. Pioneered by seasoned startup founders with a proven track record of creating and steering multiple unicorn companies, our groundbreaking GDP-shifting technology sets a gold standard. Our initial deployment is laser-focused on elevating the standards of customer experience centers. Testimonials from our partners reveal staggering double-digit improvements in mission-critical KPIs, coupled with boosts in CSAT and NPS. More than just a tool, our technology champions a bias-free workspace. This not only fosters a positive work environment but has also been instrumental in reducing employee attrition and curbing training expenditures.

Sanas is a 70-strong team, established in 2020. In this short span, we’ve successfully secured over $50 million in funding. Our innovation have been supported by the industry’s leading investors, including Insight Partners, Google Ventures, General Catalyst, Quiet Capital, and other influential investors. Our reputation is further solidified by collaborations with numerous Fortune 100 companies. With Sanas, you’re not just adopting a product; you’re investing in the future of communication.

Sanas is seeking a detail-oriented and self-motivated individual to join ML Analytics Org as a Sr. Speech Scientist. As a seasoned Speech Science professional, you will be responsible for doing Proof-of-Concepts (PoCs) and building solutions for ambiguous and complex problems in the space of Speech evaluation. You will work closely with the Linguists, Scientists and Engineers in the team to provide high-quality solutions to enable accurate Speech evaluations at scale.

Key Responsibilities

  • Create working solutions and PoCs for ambiguous and complex problems in speech evaluation
  • Create solutions to analyze speech data at scale with heuristic based automations
  • Create data visualizations to support PoCs and communicate a narrative
  • Write technical reports with actionable recommendations for improving model evaluation and performance, enabling data driven decision making for leaders
  • Create PoCs based on different speech and linguistic features to automate various effort intensive parts of existing evaluation processes
  • Conduct experiments with data sampling techniques to optimize evaluation test-sets for representativeness and improve evaluation time and cost
  • Create and maintain augmented datasets focused on different speech properties including prosody, quality, legibility etc


Basic Qualifications

  • A master’s degree in Computational Linguistics with a good hands-on exposure of Computer Science and Engineering
  • Native or near-native fluency in English with an understanding of accentual nuances
  • Excellent listening, comprehension, writing and presentation skills
  • Excellent organizational, analytical skills and attention to detail
  • Excellent Python or Java coding and Shell scripting skills
  • 7+ years of track record of R&D, publications in top-tier (preferably Q1) journals and delivering efficient solutions in NLU/NLP/NLG for large scale enterprises
  • 4+ YoE of data processing for different data modalities, visualization, and analysis
  • 4+ YoE of software development experience with different development methodologies including Agile
  • 4+ YoE of version control and iterative software development
  • 4+ YoE of working with diverse, cross-functional, global teams


Preferred qualifications

  • A PhD in Computational Linguistics or Computer Science
  • Understanding of IPA, SAMPA and other speech annotation frameworks
  • Knowledge of Databases and ETL pipelines
  • Experience in coaching junior team members on functional skills and best practices
  • Passionate about problem solving, taking ownership, and delivering results


ML Analytics Organization at SANAS AI

  • SANAS AI's ML Analytics Organization is a dynamic hub of innovation, dedicated to reshaping analytics in speech technology. Comprising diverse professionals including seasoned Applied Scientists, Speech Scientists, Computational Linguists, Data Linguists, and Software Developers, our team pioneers advanced evaluation metrics and frameworks for speech models. We empower large-scale speech evaluations with a self-serve platform. Additionally, we manage a robust data annotation platform catering to diverse data modalities internally. At the forefront of science initiatives, we drive accuracy, automation, and efficiency, ensuring objective and exhaustive evaluations of speech quality. Our commitment to advancing the field positions us as leaders in revolutionizing the landscape of speech technology


A day in the life of Sarvana, Speech Scientist at ML Analytics Org

  • As a Speech Engineer at ML Analytics team, Sarvana uses their in-depth experience of Natural Language Understanding to create solutions and proof-of-concepts (PoCs) for ambiguous and complex problems in different data modalities. Sarvana’s work deals with implementing SOTA techniques from the academia and industry to internal use cases.
  • Sarvana is working on a data sampling PoC to improve the recall of evaluation test-sets these days. Sarvana starts their day by analyzing text and speech data to narrow down on some features based on word length and amplitude to run experiments. Later on Sarvana reviews the results from the experiments with the wider team to get constructive feedback. Towards the end of day Sarvana does a doc review with stakeholders from Science and ML-Ops team for one of their PoCs that they want to productionize now