Top Benefits
About the role
Who you are
- Ph.D. in Computer Science, High-Performance Computing, or a related field
- 3–5 years of hands-on experience, preferably in the private sector, working on one or more of the following:
- Probabilistic or causal modeling
- Large-scale graph algorithms
- Graph neural networks
- Experience in processing and curating multi-modal data—including large-scale omics, clinical datasets, and scientific literature
- Proficiency in running analyses and training machine learning or deep learning models in high-performance computing (HPC) environments, particularly those using GPUs
- Strong collaboration mindset, with the ability to identify problems and communicate technical concepts clearly to both technical and non-technical stakeholders
- Demonstrated ability to dive deep into technically complex problems and a track record of driving initiatives through to completion
- Familiarity with advanced AI concepts, including:
- Generative AI (LLMs, Biological Foundation Models)
- Probabilistic Graphical Models (e.g., Bayesian Networks, Markov Networks, deep learning extensions)
- Causal inference (e.g., do-calculus, recent developments in causal discovery)
- Experience with cloud platforms such as Google Cloud Platform (GCP) or AWS for data storage and compute
- Working knowledge of graph databases and graph data structures
- Basic understanding of molecular biology concepts, particularly the central dogma (DNA, RNA, protein), and related high-throughput technologies such as RNA-seq, epigenomics, single-cell and spatial omics
- Strong publication record in peer-reviewed venues (eg. NeurIPS, ICML, ICLR, CVPR, ECCV, ICCV)
- Willingness to travel up to 25% for conferences, customer engagements, team offsites, or internal meetings
What the job involves
- We are looking for an experienced and innovative Machine Learning Engineer to drive causal inference capabilities across complex biological systems using multi-modal datasets—including omics data, clinical information, and physics-based simulations
- In this role, you will design and build causal machine learning systems that enable a deeper understanding of biological mechanisms and accelerate scientific discovery
- You will bring expertise in probabilistic graphical models, large-scale graph algorithms, and deep learning techniques for causal discovery, and collaborate closely within a high-performing, interdisciplinary team of drug discovery scientists, computational chemists, physicists, AI researchers, bioinformaticians, and software engineers
- Develop robust, scalable software systems that enable large-scale causal reasoning
- Design and implement algorithms to advance understanding of causality in complex biological systems
- Apply advanced graph-based reasoning techniques—including Graph Neural Networks, Probabilistic Graphical Models, and LLMs—for querying and inference over large-scale causal biomedical knowledge graphs constructed from simulation, omics data, and literature
- Identify, ingest, and curate relevant data sources. Own data quality control, validation, and integration workflows
- Research and prototype novel bioinformatics and deep learning approaches to interpret human genetic variants, gene regulation mechanisms, gene expression dynamics, and disease pathways using diverse multimodal data (e.g., clinical phenotypes, medical records, multi-omics, single-cell data, proteomics, genomics)
- Communicate complex ideas effectively across audiences, including internal collaborators, external stakeholders, and clients—tailoring technical depth as needed
- Contribute to the scientific community through patent filings, peer-reviewed publications, white papers, and conference presentations
Benefits
- Stock options
- Unlimated vacation (PTO) and flexible work culture
- Company-wide winter and summer breaks to unwind
- Best-in-class health plans: Medical, dental and orthodontics, and vision
- Family planning and fertility benefits
- Military leave
- 100% paid parental leave
- 401k with company matching
- Education stipends
- Financial wellness resources
About SandboxAQ
In the tech and data intelligence worlds, a sandbox is where innovation is born. It’s a place where the brightest free-thinking minds from across disciplines come together to reimagine what’s possible. A collaborative environment where the whole is infinitely greater than the sum of the parts.
At SandboxAQ, this forward-looking vision is core to everything we do. It’s how we became who we are and it’s how we know our solutions can shift the way your business competes in tomorrow’s marketplace. As the world enters the third quantum revolution, AI + Quantum software will address significant business and scientific challenges. SandboxAQ is a B2B company delivering AI solutions that address some of the world's greatest challenges. The company’s Large Quantitative Models (LQMs) deliver critical advances in life sciences, financial services, navigation, cyber and other sectors. The company emerged from Alphabet Inc. as an independent, growth capital-backed company in 2022, funded by leading investors including T. Rowe Price, Eric Schmidt, Breyer Capital, Guggenheim Partners, Marc Benioff, Thomas Tull, Paladin Capital Group, and others.
Top Benefits
About the role
Who you are
- Ph.D. in Computer Science, High-Performance Computing, or a related field
- 3–5 years of hands-on experience, preferably in the private sector, working on one or more of the following:
- Probabilistic or causal modeling
- Large-scale graph algorithms
- Graph neural networks
- Experience in processing and curating multi-modal data—including large-scale omics, clinical datasets, and scientific literature
- Proficiency in running analyses and training machine learning or deep learning models in high-performance computing (HPC) environments, particularly those using GPUs
- Strong collaboration mindset, with the ability to identify problems and communicate technical concepts clearly to both technical and non-technical stakeholders
- Demonstrated ability to dive deep into technically complex problems and a track record of driving initiatives through to completion
- Familiarity with advanced AI concepts, including:
- Generative AI (LLMs, Biological Foundation Models)
- Probabilistic Graphical Models (e.g., Bayesian Networks, Markov Networks, deep learning extensions)
- Causal inference (e.g., do-calculus, recent developments in causal discovery)
- Experience with cloud platforms such as Google Cloud Platform (GCP) or AWS for data storage and compute
- Working knowledge of graph databases and graph data structures
- Basic understanding of molecular biology concepts, particularly the central dogma (DNA, RNA, protein), and related high-throughput technologies such as RNA-seq, epigenomics, single-cell and spatial omics
- Strong publication record in peer-reviewed venues (eg. NeurIPS, ICML, ICLR, CVPR, ECCV, ICCV)
- Willingness to travel up to 25% for conferences, customer engagements, team offsites, or internal meetings
What the job involves
- We are looking for an experienced and innovative Machine Learning Engineer to drive causal inference capabilities across complex biological systems using multi-modal datasets—including omics data, clinical information, and physics-based simulations
- In this role, you will design and build causal machine learning systems that enable a deeper understanding of biological mechanisms and accelerate scientific discovery
- You will bring expertise in probabilistic graphical models, large-scale graph algorithms, and deep learning techniques for causal discovery, and collaborate closely within a high-performing, interdisciplinary team of drug discovery scientists, computational chemists, physicists, AI researchers, bioinformaticians, and software engineers
- Develop robust, scalable software systems that enable large-scale causal reasoning
- Design and implement algorithms to advance understanding of causality in complex biological systems
- Apply advanced graph-based reasoning techniques—including Graph Neural Networks, Probabilistic Graphical Models, and LLMs—for querying and inference over large-scale causal biomedical knowledge graphs constructed from simulation, omics data, and literature
- Identify, ingest, and curate relevant data sources. Own data quality control, validation, and integration workflows
- Research and prototype novel bioinformatics and deep learning approaches to interpret human genetic variants, gene regulation mechanisms, gene expression dynamics, and disease pathways using diverse multimodal data (e.g., clinical phenotypes, medical records, multi-omics, single-cell data, proteomics, genomics)
- Communicate complex ideas effectively across audiences, including internal collaborators, external stakeholders, and clients—tailoring technical depth as needed
- Contribute to the scientific community through patent filings, peer-reviewed publications, white papers, and conference presentations
Benefits
- Stock options
- Unlimated vacation (PTO) and flexible work culture
- Company-wide winter and summer breaks to unwind
- Best-in-class health plans: Medical, dental and orthodontics, and vision
- Family planning and fertility benefits
- Military leave
- 100% paid parental leave
- 401k with company matching
- Education stipends
- Financial wellness resources
About SandboxAQ
In the tech and data intelligence worlds, a sandbox is where innovation is born. It’s a place where the brightest free-thinking minds from across disciplines come together to reimagine what’s possible. A collaborative environment where the whole is infinitely greater than the sum of the parts.
At SandboxAQ, this forward-looking vision is core to everything we do. It’s how we became who we are and it’s how we know our solutions can shift the way your business competes in tomorrow’s marketplace. As the world enters the third quantum revolution, AI + Quantum software will address significant business and scientific challenges. SandboxAQ is a B2B company delivering AI solutions that address some of the world's greatest challenges. The company’s Large Quantitative Models (LQMs) deliver critical advances in life sciences, financial services, navigation, cyber and other sectors. The company emerged from Alphabet Inc. as an independent, growth capital-backed company in 2022, funded by leading investors including T. Rowe Price, Eric Schmidt, Breyer Capital, Guggenheim Partners, Marc Benioff, Thomas Tull, Paladin Capital Group, and others.