scCIPHER - Contextual Deep Learning on Single-Cell Knowledge Graphs for Precision Medicine in Neurological Disorders - Zitnik Lab

scCIPHER - Contextual Deep Learning on Single-Cell Knowledge Graphs for Precision Medicine in Neurological Disorders

Neurological disorders are the leading driver of global disability and cause 16.8% of global mortality. Unfortunately, most lack disease-modifying treatments or cures. To address disease complexity and heterogeneity in neurological disease, we developed scCIPHER, an AI approach for Contextually Informed Precision HEalthcaRe using deep learning on single-cell knowledge graphs.

We constructed the Neurological Disease Knowledge Graph (NeuroKG), a neurobiological knowledge graph with 132K nodes and 3.98 million edges, by integrating 20 high-quality primary data sources with single-cell RNA-sequencing data from 3.37 million cells across 106 regions of the adult human brain. Next, we pre-trained a heterogeneous graph transformer on NeuroKG to create scCIPHER.

We leverage scCIPHER to make precision medicine-based predictions in neurological disorders across patient phenotyping, therapeutic response prediction, and causal gene discovery tasks, with validation in large-scale patient cohorts.

Publication

This is an ongoing research project.

Code Availability

Pytorch implementation of scCIPHER is available in the GitHub repository.

Authors

We are grateful to our collaborators, including Noa Dagan (Clalit Research Institute), Valentina Giunchiglia (Harvard Medical School), the Khurana Laboratory (Brigham and Women’s Hospital), and the Church Laboratory (Wyss Institute for Biologically Inspired Engineering).

Latest News

Jul 2025: Launching CUREBench

Launched CUREBench, the first competition in AI reasoning for therapeutics. Colocated with NeurIPS 2025. Start at https://curebench.ai.

Jul 2025: Launching TxAgent Evaluation Portal

Launched TxAgent evaluation portal, our global evaluation of AI for drug decision-making and therapeutic reasoning. Participate in TxAgent evaluations! [TxAgent project]

Jul 2025: SPATIA Model of Spatial Cell Phenotypes

New preprint on SPATIA: a multimodal model for pediction and generation of spatial cell phenotypes.

Jul 2025: AI-Enabled Drug Discovery Reaches Clinical Milestone

New piece in Nature Medicine on AI-enabled drug discovery reaching a clinical milestone.

Jun 2025: Knowledge Tracing for Biomedical AI Education

New preprint on biologically inspired architecture for knowledge tracing. The study on the use of generative AI in education with prospective evaluation of knowledge tracing in the classroom.

Jun 2025: Few shot learning for rare disease diagnosis

New paper in npj Digital Medicine: Few shot learning for phenotype-driven diagnosis of patients with rare genetic diseases.

Jun 2025: One Patient, Many Contexts: Scaling Medical AI

New preprint: One patient, many contexts: Scaling medical AI through contextual intelligence

Jun 2025: ToolUniverse - 211+ Tools for "AI Scientist" Agents

ToolUniverse now offers access to over 211 cutting-edge biological and medical tools, all integrated with Model Context Protocol (MCP). Any “AI Scientist” agent can tap into these tools for biomedical research. [Tutorial] [ToolUniverse] [TxAgent]

May 2025: What Perturbation Can Reverse Disease Effects?

In press at Nature Biomedical Engineering: PDGrapher AI predicts chemicals to reverse disease phenotypic effects — with applications to drug target identification.

May 2025: Decision Transformers for Cell Reprogramming

New preprint: Decision transformers for generating reach-avoid policies in sequential decision making — with applications from robotics to cell reprogramming.

May 2025: COMPASS: Immunotherapy Outcome Prediction

New preprint introducing COMPASS: A generalizable AI that predicts immunotherapy outcomes across cancers and treatments. [Project website] [Code]

Apr 2025: ATOMICA and TxAgent on the Kempner Blog

Check out the Kempner Deeper Learning posts describing our latest ATOMICA and TxAgent AI models.

Apr 2025: ATOMICA - A Universal Model of Molecular Interactions

New preprint introducing ATOMICA, a universal model of intermolecular interactions across proteins, small molecules, ions, peptides, RNA, and DNA. [Kempner Institute]

Mar 2025: On Biomedical AI in Harvard Gazette

Read about AI in medicine in the latest Harvard Gazette and New York Times.

Mar 2025: TxAgent: AI Agent for Therapeutic Reasoning

TxAgent is an AI agent for therapeutic reasoning that consolidates 211 tools from trusted sources, including all US FDA-approved drugs since 1939 and validated clinical insights. [Project website] [TxAgent] [ToolUniverse]

Mar 2025: Multimodal AI predicts clinical outcomes of drug combinations from preclinical data

New paper: Multimodal AI approach for designing combination therapies with improved predictive accuracy and clinical relevance. [Project website]

Mar 2025: KGARevion: AI Agent for Knowledge-Intensive Biomedical QA

KGARevion is an AI agent designed for complex biomedical QA that integrates the non-codified knowledge of LLMs with the structured, codified knowledge found in knowledge graphs. [ICLR 2025 publication]

Feb 2025: MedTok: Unlocking Medical Codes for GenAI

Meet MedTok, a multimodal medical code tokenizer that transforms how AI understands structured medical data. By integrating textual descriptions and relational contexts, MedTok enhances tokenization for transformer-based models—powering everything from EHR foundation models to medical QA. [Project website]

Feb 2025: What If You Could Rewrite Biology? Meet CLEF

What if we could anticipate molecular and medical changes before they happen? Introducing CLEF, an approach for counterfactual generation in biological and medical sequence models. [Project website]

Feb 2025: Digital Twins as Global Health and Disease Models

New paper on the role of digital twins as global health and disease learning models for preventive and personalized medicine.

Tweets

Tweets by marinkazitnik

Zitnik Lab · Artificial Intelligence in Medicine and Science · Harvard · Department of Biomedical Informatics