ProCyon
ProCyon is a groundbreaking foundation model for modeling, generating, and predicting protein phenotypes.
Unified Clinical Vocabulary Embeddings
Unified Clinical Vocabulary Embeddings for Advancing Precision Medicine
Madrigal
Multimodal AI Predicts Clinical Outcomes of Drug Combinations from Preclinical Data
KGARevion
Knowledge Graph Based Agent for Complex, Knowledge-Intensive QA in Medicine
PocketFlow
Generalized Protein Pocket Generation with Prior-Informed Flow Matching
Therapeutics Data Commons 2.0
Multimodal Foundation for Therapeutic Science
SPECTRA
Evaluating Generalizability of Artificial Intelligence Models for Molecular Datasets
PDGrapher
Combinatorial Prediction of Therapeutic Perturbations Using Causally-Inspired Neural Networks
TxGNN
A Foundation Model for Clinician Centered Drug Repurposing
TimeX
Encoding Time-Series Explanations through Self-Supervised Model Behavior Consistency
Raincoat
Domain Adaptation for Time Series Under Feature and Label Shifts
SHEPHERD
Deep Learning for Diagnosing Patients with Rare Genetic Diseases
metapaths
Similarity Search in Heterogeneous Knowledge Graphs via Meta Paths
Mutual Interactors
Phenotype Discovery in Molecular Interaction Networks
Raindrop
Graph-Guided Network for Irregularly Sampled Multivariate Time Series
Therapeutics Data Commons
Machine Learning Datasets and Tasks for Drug Discovery and Development
DeepPurpose
Deep learning library for drug-target interaction prediction and applications to drug repurposing and virtual screening
Graph Query Embeddings
Method for embedding logical queries on knowledge graphs
Network-Guided Matrix Completion
Method for probabilistic prediction and imputation of interactions using prior knowledge
Multi-BioNER
Deep multi-task learning for cross-type biomedical named entity recognition
Latest News
Dec 2024: Foundation Model for Protein Phenotypes
Dec 2024: Unified Clinical Vocabulary Embeddings
New paper: A unified resource provides a new representation of clinical knowledge by unifying medical vocabularies. (1) Phenotype risk score analysis across 4.57 million patients, (2) Inter-institutional clinician panels evaluate alignment with clinical knowledge across 90 diseases and 3,000 clinical codes.
Dec 2024: SPECTRA in Nature Machine Intelligence
Are biomedical AI models truly as smart as they seem? SPECTRA is a framework that evaluates models by considering the full spectrum of cross-split overlap: train-test similarity. SPECTRA reveals gaps in benchmarks for molecular sequence data across 19 models, including LLMs, GNNs, diffusion models, and conv nets.
Nov 2024: Ayush Noori Selected as a Rhodes Scholar
Congratulations to Ayush Noori on being named a Rhodes Scholar! Such an incredible achievement!
Nov 2024: PocketGen in Nature Machine Intelligence
Nov 2024: Biomedical AI Agents in Cell
Oct 2024: Activity Cliffs in Molecular Properties
Oct 2024: Knowledge Graph Agent for Medical Reasoning
New paper introducing a knowledge graph agent for complex, knowledge-intensive medical reasoning.
Sep 2024: Three Papers Accepted to NeurIPS
Exciting projects include a unified multi-task time series model, a flow-matching approach for generating protein pockets using geometric priors, and a tokenization method that produces invariant molecular representations for integration into large language models.
Sep 2024: TxGNN Published in Nature Medicine
Graph foundation model for drug repurposing published in Nature Medicine. [Harvard Gazette] [Harvard Medicine News] [Forbes] [NVIDIA] [Kempner Institute] [Harvard Crimson]
Aug 2024: Graph AI in Medicine
Excited to share a new perspective on Graph Artificial Intelligence in Medicine in Annual Reviews.
Aug 2024: How Proteins Behave in Context
Harvard Medicine News on our new AI tool that captures how proteins behave in context. Kempner Institute on how context matters for foundation models in biology.
Jul 2024: PINNACLE in Nature Methods
PINNACLE contextual AI model is published in Nature Methods. Paper. Research Briefing. Project website.
Jul 2024: Digital Twins as Global Health and Disease Models of Individuals
Paper on digitial twins outlining strategies to leverage molecular and computational techniques to construct dynamic digital twins on the scale of populations to individuals.
Jul 2024: Graph Diffusion Convolutions at ICML
Graph diffusion convolution is a geometric deep learning architecture that aggregates information from higher-order network neighbors through a generalized graph diffusion to enhance model robustness to noisy and incomplete datasets. Paper at ICML.
Jul 2024: Three Papers: TrialBench, 3D Structure Design, LLM Editing
Jun 2024: TDC-2: Multimodal Foundation for Therapeutics
The Commons 2.0 (TDC-2) is an overhaul of Therapeutic Data Commons to catalyze research in multimodal models for drug discovery by unifying single-cell biology of diseases, biochemistry of molecules, and effects of drugs through multimodal datasets, AI-powered API endpoints, new tasks and benchmarks. Our paper.
May 2024: Broad MIA: Protein Language Models
Check out our Broad’s seminars on Multimodal protein language models for deciphering protein function.
May 2024: On Knowing a Gene in Cell Systems
Apr 2024: Biomedical AI Agents
We envision ‘AI scientists’ as systems capable of skeptical learning and reasoning that empower biomedical research through collaborative agents that integrate machine learning tools with experimental platforms.
Tweets