Join our companies in their quest to drive powerful, positive, change that endures.

Senior Machine Learning Engineer- Data

Luma AI

Luma AI

Software Engineering
Palo Alto, CA, USA
Posted on Jun 26, 2024
As MLE on Luma's Data team you are responsible for raising the bar for our data quality. Data is the critical foundation of our products, and we are looking for individuals who can identify creative approaches to data and captioning and then implement solutions for processing at PB scale. Good candidates should have exceptional general python engineering skills alongside a combination of industry ML experience, Data experience, and passion for building AI products.

Responsibilities

  • Design data pipelines, including finding appropriate data sources, scraping, filtering, post-processing, de-duplicating, and versioning. The system should be robust and scalable for production use.
  • Design and implement frameworks to evaluate the effectiveness of our models and data. For example, set up the standards for an automated evaluation pipeline to run before any new model gets deployed into the API.
  • Work closely with others who might be data contributors or consumers or both to incorporate their data usage needs on a variety of tasks and domains.
  • Work with human labeling vendors to refine the procedure and guidelines to collect high-quality human annotation data.
  • Conduct open-ended research to improve the quality of collected data, including but not limited to, semi-supervised learning, human-in-the-loop machine learning and fine-tuning with human feedback.

Experience

  • 5+ years of relevant experience or demonstration of high impact projects as a Data Engineer, Machine Learning Engineer, or Data Scientist, dealing with large amounts of data on a daily basis.
  • Have a strong belief in the criticality of high-quality data and are highly motivated to work with the associated challenges.
  • Have experience working in large distributed systems.
  • Strong generalist python and pytorch skills
  • Experience using SQL, Spark, or other tools for processing large amounts of data.
  • Please note this role is not meant for recent grads.

Compensation

  • The pay range for this position in California is $180,000 - $250,000yr; however, base pay offered may vary depending on job-related knowledge, skills, candidate location, and experience. We also offer competitive equity packages in the form of stock options and a comprehensive benefits plan.
Your application is reviewed by real people.