Senior Machine Learning Ops Engineer
airSlate
This job is no longer accepting applications
See open jobs at airSlate.See open jobs similar to "Senior Machine Learning Ops Engineer" General Catalyst.Software Engineering, Operations
Wrocław, Poland
What you'll be working on:
- Building the Foundation: Design, make, and automate the data backbone for our ML initiatives, creating robust pipelines for both model training and real-time inference using diverse structured and unstructured data.
- Productionizing Machine Learning: Take models from concept to reality by developing production-ready code, architecting serving infrastructure, and implementing sophisticated deployment strategies.
- ML as a service: Develop machine learning solutions to enhance reuse, eliminate unnecessary duplication of solutions within the company, and accelerate the delivery of machine learning applications.
- Automating the ML Lifecycle: Architect and implement our MLOps framework, including CI/CD/CT (Continuous Integration, Delivery, and Training) pipelines to automate the testing, deployment, and retraining of our models.
- Ensuring Reliability & Performance: Establish a comprehensive monitoring practice for our deployed models, tracking performance, health, and key metrics in real-time to ensure accuracy and uptime.
- Driving Reproducibility: Implement and manage the tools and processes for model versioning, artifact management, and experiment tracking to ensure all our ML work is reproducible and traceable.
- Managing Cloud Infrastructure: Own and optimize the AWS cloud resources that power our ML workloads, ensuring they are scalable, secure, and cost-effective.
- Collaborating for Impact: Partner closely with Data Scientists, Data Analytics, Product team, business leaders, and other engineers to translate complex business needs into scalable, end-to-end machine learning solutions.
What we expect from you:
- Fluent in Python.
- A minimum of 3 years of experience in an MLOps engineering role.
- Experience with AWS, particularly with AWS SageMaker.
- Knowledge of machine learning techniques and model lifecycle management.
- Experience with software engineering principles, including unit testing and version control (e.g., Git).
- Experience in monitoring machine learning models in production, with a focus on logging, metrics, and alerting.
- Knowledge and experience with Vector Stores.
- Familiarity with machine learning frameworks (e.g., TensorFlow, PyTorch) and data libraries (e.g., Pandas, NumPy, Apache Spark).
- Understanding and experience in using CI/CD/CT tools (e.g., GitHub Actions, Jenkins) and containerization technologies (e.g., Docker).
Nice to have:
- Experience withTerraform.
This job is no longer accepting applications
See open jobs at airSlate.See open jobs similar to "Senior Machine Learning Ops Engineer" General Catalyst.