CONCERT predicts spatial perturbation responses across tissue niches

Spatial perturbation experiments measure gene expression after targeted edits to provide mechanistic insight and therapeutic hypotheses, but current models cannot simulate how perturbation effects propagate across tissue microenvironments.

We present CONCERT, a virtual cell model for predicting spatial perturbations. CONCERT is a niche-aware generative counterfactual model that embeds the perturbation context and learns spatial kernels using a Gaussian process variational autoencoder to capture the dispersion of perturbation effects across tissue. We define three prediction tasks in spatial perturbation transcriptomics, patch, border, and niche, and benchmark CONCERT on perturb-map lung datasets. CONCERT outperforms seven state-of-the-art models, reducing E-distance by up to 33.77% (patch), 26.05% (border), and 33.74% (niche) versus the next best, with mean absolute error down by up to 23.28% and Pearson correlation up by up to 9.10%.

In a dextran sodium sulfate-induced colitis time course, CONCERT reconstructs spatial gene expression at time points not experimentally measured, enabling coherent longitudinal comparisons from unpaired tissue sections. The model resolves inter-mouse heterogeneity and recovers consistent temporal declines in inflammation-associated genes across tissue regions, revealing shifts in spatial dominance over recovery. In ischemic stroke, CONCERT predicts perturbation effects beyond experimental constraints, including variable lesion sizes and a 3D formulation across brain sections, and captures spatial patterns for lesion-core and perilesion markers. CONCERT supports practical workflows, such as resolution enhancement and in-painting of damaged tissue, while preserving biologically plausible perturbation responses. These capabilities establish CONCERT as an approach for simulating spatial perturbation effects and modeling niche-dependent responses.

CONCERT: Niche-Aware Virtual Cell Model

CONCERT is a niche-aware generative counterfactual model that predicts how a perturbation’s effect disperses across tissue and alters local gene expression. It takes a tissue’s spatial transcriptomics slide(s) as input—spot/cell gene expression, 2D/3D coordinates, and attributes such as perturbation identity, disease state, dose, lesion size, or time—and outputs predicted post-perturbation expression (rGEX) at user-selected locations, including unseen or imputed spots.

The perturbation module disentangles spot attributes into compact latent codes: categorical variables (e.g., CRISPR knockout, niche, slide ID) are embedded; continuous variables (e.g., time or lesion size) pass through small MLPs, enabling smooth interpolation and “what-if” edits by swapping only the perturbation code while holding cell state and position fixed. The spatial module then propagates the encoded perturbation through tissue with a perturbation-specific Gaussian-process kernel that has learnable, anisotropic length-scales and spot-specific cutoffs. This induces non-local, directionally aware dispersion that respects boundaries and niches rather than copying nearest neighbors. The generation module is a variational decoder that combines latent cell state and the propagated perturbation field to reconstruct rGEX; evaluating it at new coordinates or times enables resolution enhancement, in-painting, temporal interpolation, and 3D propagation across sections.

CONCERT (i) performs counterfactual prediction for patch, border, niche, and cross-niche tasks; (ii) densifies slides and in-paints damaged regions while preserving plausible responses; (iii) interpolates time and lesion size; and (iv) supports 2D and 3D kernels for multi-section tissue. Predictions come with calibrated uncertainty (≈95% coverage in-distribution).

Patch Predictions Within the Same Niche

Given a patch of spots in source state A, predict their rGEX after perturbing them to target state B observed nearby within the same niche. This probes whether a model can reproduce local responses without crossing tissue interfaces.

Across 12 conditions (4 slides × 3 patch sizes), CONCERT achieved the best E-distance in 9/12, closely approaching or surpassing the spatial kNN “upper bound” on multiple slides. Relative to the next-best method, E-distance dropped by up to 33.77%, with consistent gains in MAE and PCC. Average rank was 2.33, near the spatial kNN’s 2.08.

Predictions at Tissue Boundaries

Perturb spots along the edge of a patch and predict their rGEX, a boundary-aware setting with sharp transitions at tissue interfaces.

CONCERT topped 8/12 conditions and reduced E-distance by up to 26.05% versus the next-best model, with an average rank of 2.50, again approaching the spatial kNN bound (2.41). Improvements held across MAE and PCC.

Perturb the Niche, Predict the Core

Perturb the surrounding niche (border) from A to B and predict rGEX inside the core patch—requiring models to capture non-local propagation from neighbors. Only methods that model microenvironmental influence can perform this task.

Only CONCERT and one spatialized baseline could run this task; CONCERT was best in all 9/9 conditions, lowering E-distance by up to 33.74% and achieving the top average rank (1.0) across slides and patch sizes.

Patch Predictions Across Niches

Perturb a patch in state A toward state B that is observed in a different niche, forcing extrapolation across microenvironments.

CONCERT ranked first in 7/12 conditions and reduced E-distance by up to ~24% (with additional significant gains across MAE and PCC), yielding the best overall average rank (2.91) across all tests. Uncertainty calibration was near nominal 95% coverage.

In Vivo Case Studies: Mouse Colitis and Stroke

We asked whether CONCERT can support causal “what-if” reasoning in vivo beyond what experiments can measure or control. Two case studies test distinct axes:

  • colitis time course: reconstruct unmeasured time points and enable longitudinal comparisons across different mice; and
  • ischemic stroke: simulate location and size-specific lesions in 2D and extend dispersion to 3D across brain sections.

In colitis time course, CONCERT reconstructed spatial rGEX at day 30, an unseen day 50, and day 73 for each mouse, revealing consistent regional declines in inflammation markers (Clca4b, Ido1, Il1b) that were obscured by inter-mouse variability in the raw data. In stroke, CONCERT simulated multi-region ischemia and lesion-size effects on healthy slides, recovering known gene-level patterns (core-maximal Gm42418 and peri-lesion Lcn2/Spp1) and capturing z-axis propagation in a 3D kernel across sections.

These studies demonstrate a practical virtual cell/tissue workflow: fill in missing spatial data, run counterfactual interventions at user-chosen locations, sizes, and times, and compare conditions that are otherwise unpaired or infeasible to acquire. The same engine also supports resolution enhancement and in-painting of damaged tissue while preserving biologically plausible responses.

Publication

CONCERT predicts spatial perturbation responses across tissue niches
Xiang Lin, Zhenglun Kong, Ghosh Soumya, Manolis Kellis, Marinka Zitnik
In Review 2025 [bioRxiv]

@article{lin2025concert,
  title={CONCERT predicts spatial perturbation responses across tissue niches},
  author={Lin, Xiang and Kong, Zhenglun and Ghosh, Soumya and Kellis, Manolis and Zitnik, Marinka},
  journal={In Review},
  url={#},
  year={2025}
}

Code and Data Availability

Pytorch implementation of CONCERT is available in the GitHub repository.

Authors

Latest News

Aug 2025:   CUREBench - Reasoning for Therapeutics

Update from CUREBench: 650+ entrants, 100+ teams and 500+ submissions. Thank you to the CUREBench community. Working on AI for drug discovery and reasoning in medicine? New teams welcome. Tasks, rules, and leaderboard: https://curebench.ai.

Aug 2025:   Drug Discovery Workshop at NeurIPS 2025

Excited to organize a NeurIPS workshop on Virtual Cells and Digital Instruments. Submit your papers.

Aug 2025:   AI for Science Workshop at NeurIPS

Excited to organize a NeurIPS workshop on AI for Science. This is our 6th workshop in the AI for Science series. Submit your papers.

Jul 2025:   Launching CUREBench

Launched CUREBench, the first competition in AI reasoning for therapeutics. Colocated with NeurIPS 2025. Start at https://curebench.ai.

Jul 2025:   Launching TxAgent Evaluation Portal

Launched TxAgent evaluation portal, our global evaluation of AI for drug decision-making and therapeutic reasoning. Participate in TxAgent evaluations! [TxAgent project]

Jul 2025:   SPATIA Model of Spatial Cell Phenotypes

Jul 2025:   AI-Enabled Drug Discovery Reaches Clinical Milestone

Jun 2025:   Knowledge Tracing for Biomedical AI Education

New preprint on biologically inspired architecture for knowledge tracing. The study on the use of generative AI in education with prospective evaluation of knowledge tracing in the classroom.

Jun 2025:   Few shot learning for rare disease diagnosis

Jun 2025:   One Patient, Many Contexts: Scaling Medical AI

Jun 2025:   ToolUniverse - 211+ Tools for "AI Scientist" Agents

ToolUniverse now offers access to over 211 cutting-edge biological and medical tools, all integrated with Model Context Protocol (MCP). Any “AI Scientist” agent can tap into these tools for biomedical research. [Tutorial] [ToolUniverse] [TxAgent]

May 2025:   What Perturbation Can Reverse Disease Effects?

In press at Nature Biomedical Engineering: PDGrapher AI predicts chemicals to reverse disease phenotypic effects — with applications to drug target identification.

May 2025:   Decision Transformers for Cell Reprogramming

New preprint: Decision transformers for generating reach-avoid policies in sequential decision making — with applications from robotics to cell reprogramming.

May 2025:   COMPASS: Immunotherapy Outcome Prediction

Apr 2025:   ATOMICA and TxAgent on the Kempner Blog

Check out the Kempner Deeper Learning posts describing our latest ATOMICA and TxAgent AI models.

Apr 2025:   ATOMICA - A Universal Model of Molecular Interactions

Mar 2025:   On Biomedical AI in Harvard Gazette

Read about AI in medicine in the latest Harvard Gazette and New York Times.

Mar 2025:   TxAgent: AI Agent for Therapeutic Reasoning

TxAgent is an AI agent for therapeutic reasoning that consolidates 211 tools from trusted sources, including all US FDA-approved drugs since 1939 and validated clinical insights. [Project website] [TxAgent] [ToolUniverse]

Mar 2025:   Multimodal AI predicts clinical outcomes of drug combinations from preclinical data

Mar 2025:   KGARevion: AI Agent for Knowledge-Intensive Biomedical QA

KGARevion is an AI agent designed for complex biomedical QA that integrates the non-codified knowledge of LLMs with the structured, codified knowledge found in knowledge graphs. [ICLR 2025 publication]

Zitnik Lab  ·  Artificial Intelligence in Medicine and Science  ·  Harvard  ·  Department of Biomedical Informatics