Join our companies in their quest to drive powerful, positive, change that endures.

","datePosted":"2023-09-27T15:17:32.946Z","validThrough":"2023-11-28","employmentType":[],"hiringOrganization":{"@type":"Organization","name":"CHARM Therapeutics","description":"Harnessing the power of deep learning on 3D molecular configurations to deliver medicines of transformational efficacy to patients.","numberOfEmployees":59,"address":[{"address":{"@type":"PostalAddress","addressLocality":"United Kingdom"}},{"address":{"@type":"PostalAddress","addressLocality":"London, UK"}},{"address":{"@type":"PostalAddress","addressLocality":"Mile End, London, UK"}}],"sameAs":"","url":"","logo":"","memberOf":{"@type":"Organization","name":"General Catalyst","description":"Funds breakthrough ventures and backs exceptional entrepreneurs via capital, business building resources and strategic support.","logo":"","url":""},"keywords":"Biotechnology, Data and Analytics, DeepTech, Software"},"jobLocation":[{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"London, UK"}},{"@type":"Place","address":{"@type":"PostalAddress","addressLocality":"Cambridge, UK"}}]}

Senior Data Engineer

CHARM Therapeutics

CHARM Therapeutics

Data Science
London, UK · Cambridge, UK
Posted on Wednesday, September 27, 2023

Senior Data Engineer


CHARM Therapeutics is a biotech focused on delivering transformational medicines that will address difficult-to-drug targets in key oncogenic pathways. CHARM has developed the first highly accurate, high throughput protein-ligand co-folding algorithm (DragonFold), driven by end-to-end 3D deep learning. Our platform enables the rapid generation of highly differentiated clinical candidates, our ambition is to revolutionise drug discovery. 

CHARM Therapeutics has established a state-of-the-art R&D facility in Cambridge, UK.  We are seeking a Senior Data Engineer with a passion for innovation.  This role will be instrumental in enabling our oncology-focussed drug discovery pipeline through establishing and improving our data platform.  The successful candidate must have excellent interpersonal and cross-functional collaboration skills, and will join a multidisciplinary, highly collaborative team of scientists and engineers to invent novel medicines.



  • Supporting drug design cycles across multiple programs by ensuring data transformations are operating at all times
  • Architect and build a system for ingesting and organising internal and external data, including affinity data and structural data, enabling a drug discovery design cycles
  • Liaising with program teams to optimise data usage in drug discovery, including chemistry and biology assay data, as necessary to support project decisions
  • Coordinating external data curation
  • Continually looking for opportunities to improve the accessibility of project data across the portfolio



  • 3+ years of experience in a biotechnology environment
  • Strong data engineering skills
  • Domain knowledge of key data resources to support drug discovery
    • Familiarity with ChEMBL, GoSTAR or similar databases
  • SQL for relational database querying
  • Demonstrable ETL (Extract, Transform, Load) workflow experience,  standardisation and data wrangling skills in Python
  • Strong team player with an inclusive mindset
  • Proven track record of setting up and maintaining data analytics and visualisation systems
  • Knowledge of collaborative data science environments (e.g. Jupyter Notebook)


Highly desirable skills

  • MSc or higher degree in drug discovery, biochemistry, pharmacology, or medicinal chemistry
  • Knowledge of ontologies
  • Hands on experience of manipulating bioactivity data
  • Familiarity with Electronic Lab Notebooks
  • Familiarity with Linux and command line tools
  • Knowledge of TIBCO Spotfire