Senior Data Scientist (NLP and LLMs), AI Research
Brainly
This job is no longer accepting applications
See open jobs at Brainly.See open jobs similar to "Senior Data Scientist (NLP and LLMs), AI Research" General Catalyst.NOTICE: ONLINE RECRUITMENT PROCESS
LOCATION: REMOTELY FROM SPAIN
We're looking for a person who would join AI Research Team.
It is dedicated to bridging the gap between machine learning and the business domain, acting as an incubator for new AI initiatives, feasibility studies on state-of-the-art solutions implemented in our domain, discovering new AI opportunities, supporting existing projects, and providing research capabilities to our leadership, collaborating with them using our data science and analytics expertise to add value and ensure Brainly's success.
Specifically, our AI strategy and roadmap are investing more and more in our capacity to best exploit modern LLMs for question answering, quality assurance, and other educational tasks, and build domain-specific layers around them (e.g. learners' personalization) which is one of our main areas of research focus.
Our AI Research team also acts as the Center of Excellence for the Data Science domain, owning the career ladder and qualifiers of the Data Scientist tracks, developing tools, standards, and conventions used by all of the data science practitioners, leading internal community initiatives (e.g. seminars, learning and sharing sessions), and providing technical mentorship and resources for career development.
You will have the chance to work with top-class scientists, engineers, and domain experts, and to drive the data science and research processes of our LLM-based product features end-to-end.
The ideal candidate is an enthusiast of the educational domain with a blend of coding, machine learning, and statistics skillset.
WHAT YOU'LL DO
-
Conduct dedicated research and experiments with the latest state-of-the-art of Machine Learning (including NLP, LLMs, Computer Vision, traditional statistical learning…) applied to Brainly data and education domain
Inform the CTO and other stakeholders about decisions related to AI strategy and roadmap
Provide suggestions for new utilization of AI (e.g. LLMs use cases) as part of Brainly product features or optimization of internal processes
Develop proof-of-concepts or prototypes that can be further engineered and productized
Train and share knowledge with the rest of ml practitioners on those advances
Develop reusable tools, and scientific and programmatic methodologies for rapid experimentation and evaluation of a variety of AI applications e.g. question answering, content quality assurance, language chains, embeddings, personalization, classification, tagging, entity extraction, summarization, paraphrasing, text cleaning, ranking/comparison, information retrieval, object detection
Partner and rapidly provide data science support and programmatic tools to the AI Operations team and other human subject matter experts to produce the ground truth datasets required to validate our hypothesis or train/calibrate our algorithms
Assess the behavior of the current state of Brainly technology used in production, or in development environments, and provide advanced insights about strengths, weaknesses, biases, content characteristics, users' intentions, and areas for improvement e.g. x-raying our internal GPT-based AnswerBot solution over different cohorts
Collaborate with other teams in the rest of the company to provide consulting, initial research, prototypes, and recommendations of which AI/ML techniques and industry practices to implement as part of the R&D of new or existing projects. Enable Brainly employees who are dealing with AI technologies to learn how to use them, what to expect from it, and what kind of custom QA layers are required on top
WHAT MAKES YOU THE PERFECT CANDIDATE
4+ years of experience with Deep Learning models for NLP and transformers architecture
4+ years of working experience in Python and the PyData stack or other numerical programming languages
Experience with analyzing and producing insights from digital product datasets using both qualitative and quantitative techniques
Experience with modern Cloud Computing (preferably AWS)
Fluent verbal and written English skills
Ability to think strategically, connecting the dots in the big picture, framing the right problem, balancing trade-offs, and producing actionable insights for our decision-makers
Ability to synthesize key messages and action items in the form of executive summaries (in different forms) and present complex ideas and technical findings to non-technical audiences or with the language of a C-level
Ability to convey complex analyses with the most efficient and intuitive visual interactions and data storytelling
WHAT WILL BLOW OUR MINDS
Experience with cooperating and communicating with the top management to inform decisions via data science
Experience with prompt engineering, fine-tuning, or evaluating LLMs in production
Experience with HuggingFace transformers or other deep learning models
Experience with text mining and text analytics
Experience with data engineering, ETL jobs, or feature engineering
Knowledge of at least some of the data engineering technologies such as Spark, DataBricks, Glue, EMR, Docker, Kubernetes, SQL, key-value stores, Redshift, Snowflake
Being familiar with ML technologies such as AWS SageMaker, Tensorflow Extended, PyTorch, Spark ML, scikit-learn, XGBoost, KubeFlow, Neptune, Flyte, MLFlow, or related frameworks
WHAT YOU GET BY JOINING BRAINLY
We want to see you grow along with us – you will have 800$ per year for personal development, extra time for attending conferences and workshops, and unlimited access to an online learning platform (courses from Coursera, Udacity, Udemy, Harvard ManageMentor, and many others!)
Health is important, which is why at Brainly, we fully cover private health & dental care packages for you and your family and provide you with a sport card (Andjoy)
You will also get an access to online individual psychological consultations with professionals in English via the Mental Health Helpline
Your personal concierge AskHenry will support you in your daily duties, eg. planning your dream vacation
You can join internal communities and contribute to charity, diversity and inclusion initiatives, take part in great internal events or represent Brainly at conferences or meet-ups.
We also provide stock options
By sending us your application you agree that Brainly sp. z o.o. will process your personal data to participate in this recruitment process. If you want to know more about how Brainly processes your personal data please click here.
ABOUT BRAINLY
Brainly is the #1 AI education tool in the world, with a vision to give every student in the world access to personalized learning, no matter their background or resources.
Brainly’s full-service AI Learning Companion™ is relied upon by more than 15 million daily users; students, parents and teachers for personalized, on-demand academic assistance. The platform provides world-class homework help, test prep and tutoring that is verified for accuracy and customized to each student based on their learning style.
Founded in 2009, Brainly operates in the US, Europe, Asia and Latin America, and is backed by Prosus, Point Nine Capital, General Catalyst, Runa Capital, Learn Capital and Kulczyk Investments.
Learn more at www.brainly.com.
This job is no longer accepting applications
See open jobs at Brainly.See open jobs similar to "Senior Data Scientist (NLP and LLMs), AI Research" General Catalyst.