Mid Data Scientist (NLP and Computer Vision), Production Engineering

Brainly

Brainly

Software Engineering, Data Science
Poland
Posted 6+ months ago

NOTICE: ONLINE RECRUITMENT PROCESS

LOCATION: REMOTELY FROM POLAND

SALARY: 17 500 - 35 000 PLN gross/monthly

ROLE OVERVIEW

The AI Department represents a major investment toward implementing an AI-powered Predictive Intervention platform. The platform will enable every student to receive tailored recommendations satisfying their current and future educational needs and provide successful and specific learning paths.

In order to get there, we are building the fundamental pillars of our AI strategy by having dedicated teams owning different systems of our technology portfolio and in particular we are heavily investing in natural language processing technologies.

The Answer Platform ML team’s vision is to build the necessary systems to provide accurate, high-quality, and tailored answers to all the questions our users face in their daily educational journey.

The team is responsible for providing the machine learning capabilities to:

  • Route textual questions and educational content to the appropriate service that provides the best answers.

  • Increase the coverage of answers our users are provided with.

  • Actively contribute to the answering services and individual components that build up our answering and search platform.

  • Improve and introduce new functionalities to classify images and extract visual information to improve retrieval precision and personalize the answering results.

  • Deploy, maintain, and own the models in production.

This is a highly technical individual contributor role where you will work alongside a technical lead and other individual contributors within the team. The team consists of 2 Machine Learning Engineers and 2 Data Scientists, and you will be contributing heavily to the data science capabilities of our answering platform and state-of-the-art NLP algorithms and LLMs using Python, HuggingFace transformers, PyTorch, Sagemaker, and other AWS cloud services. You will have partial ownership of the development of the models developed by the team. In contrast, the team at large has full ownership of all aspects of the model development lifecycle, from initial EDA to deployment and in-production monitoring.

You will also work closely with other teams to integrate and facilitate the consumption of the deployed models. In Brainly, individual contributors are also expected to contribute to their team’s project scoping and actively partake in stakeholder management, and communication.

The ideal candidate is an enthusiast of educational technologies with a product-oriented Data Science & Machine Learning background.

Are you able to work fast, with short feedback loops and in close collaboration with other team members and multiple external stakeholders? Do you have strong analytical thinking - ability to explore data and draw conclusions? Do you have capability to solve problems in an unconventional manner and not getting stuck at obstacles? Do you have a scientific mentality with the ability to ask the right questions and answer them? Are you able to develop, own, and maintain production services, exposing models to various consumers? Are you able to convey complex analyses with the most efficient and intuitive visual interactions and data storytelling? Are you able to stay up to date with the latest academic research and to implement state-of-the-art methods? Do you have hands-on approach and clear communications skills? Are you familiar with agile development and lean principles.? If you answered yes to these questions, you might just be the perfect candidate for this role!

WHAT YOU'LL DO

  • Researching and implementing novel techniques of text classification and text generation using modern approaches like LLMs.

  • Exploring and analyzing multi-dimensional and unstructured data in order to find patterns and relationships.

  • Working closely with the engineers in order to integrate models and algorithms with the larger system.

  • Contributing to the definition of the team roadmap and deliverables.

  • Effectively communicating results to direct stakeholders and the company at large.

  • Working closely with product teams to design new features or improve the functionalities and user experience of the platform.

  • Ensuring statistical and scientific rigor among the whole team.

REQUIRED

  • 3+ years experience with Deep Learning models for NLP or text analytics and/or similar experience in Computer Vision in production, or a comparable industry career, with machine learning, data mining, or statistical modeling.

  • Experience solving business problems using modern Machine Learning techniques.

  • Strong python coding skills and experience writing easily maintainable code and unit tests whenever appropriate.

  • Deep knowledge and understanding of theoretical foundations of modern Machine Learning, specifically Deep Neural Networks, either NLP/LLMs and/or Computer Vision.

  • Machine Learning frameworks such as: Tensorflow or PyTorch, AWS Sagemaker, scikit-learn, Transformers (Huggingface ecosystem).

  • Data analysis and visualization tools such as pandas, plotly, matplotlib or streamlit.

  • Fluency in English.

PREFERRED

  • Bachelor’s degree or above in STEM (science, technology, engineering, mathematics) or a similar field.

  • Experience working with Parameter efficient finetuning of LLMs.

  • Experience with modern Cloud Computing (preferably AWS) or equivalent deployment and infrastructure experience.

  • Experience working with deployment of ML models to production.

  • Experience working on both NLP and CV projects.

  • Background and experience with cloud computing (AWS or equivalent for other cloud providers): EC2, SageMaker, S3, EKS, ECR, Lambda, Batch, StepFunctions.

  • Experience with modern deployment of models, for example using torchserve.

  • Familiarity with basics in Data Engineering: SQL and NoSQL.

  • Familiarity with Docker containers and deployment of models as part of the full Model lifecycle.

WHAT YOU GET BY JOINING BRAINLY

  • We want to see you grow along with us – you will have 800$ per year for personal development, extra time for attending conferences and workshops, and unlimited access to an online learning platform (courses from Coursera, Udacity, Udemy, Harvard ManageMentor, Busuu, and many others!)

  • Health is important, which is why at Brainly, we fully cover private health & dental care packages for you and your family and provide you with a sport card (Multisport Plus

  • You will also get an access to online individual psychological consultations with professionals in English, Polish & Ukrainian via the Mental Health Helpline

  • Your personal concierge AskHenry will support you in your daily duties, eg. planning your dream vacation

  • You can join internal communities and contribute to charity, diversity and inclusion initiatives, take part in great internal events or represent Brainly at conferences or meet-ups

  • We also provide stock options

By sending us your application you agree that Brainly sp. z o.o. will process your personal data to participate in this recruitment process. If you want to know more about how Brainly processes your personal data please click here.

ABOUT BRAINLY

Brainly is the #1 AI education tool in the world, with a vision to give every student in the world access to personalized learning, no matter their background or resources.

Brainly’s full-service AI Learning Companion™ is relied upon by more than 15 million daily users; students, parents and teachers for personalized, on-demand academic assistance. The platform provides world-class homework help, test prep and tutoring that is verified for accuracy and customized to each student based on their learning style.

Founded in 2009, Brainly operates in the US, Europe, Asia and Latin America, and is backed by Prosus, Point Nine Capital, General Catalyst, Runa Capital, Learn Capital and Kulczyk Investments.

Learn more at www.brainly.com.