Esm Tools¶

Configuration File: esm_tools.json Tool Type: Local Tools Count: 10

This page contains all tools defined in the esm_tools.json configuration file.

Available Tools¶

ESM_describe_sae_feature (Type: ESMTool)¶

Label a single SAE feature_id with its dominant biological category by aggregating UniProt featur…

ESM_explain_variant_mechanism (Type: ESMTool)¶

One-call composite for variant mechanism: runs ESMC-6B SAE variant disruption + describe_sae_feat…

ESM_fold_protein (Type: ESMTool)¶

Predict protein 3D structure from sequence using ESM3, returning pTM (predicted TM-score), per-re…

ESM_generate_protein_sequence (Type: ESMTool)¶

Generate or complete a protein sequence using ESM3, EvolutionaryScale’s generative protein langua…

ESM_get_protein_embedding (Type: ESMTool)¶

Get protein sequence embeddings from EvolutionaryScale ESMC (ESM Cambrian) models via the Forge A…

ESM_get_region_sae_features (Type: ESMTool)¶

Aggregate ESMC-6B SAE features over a contiguous residue range to characterize the region’s biolo…

ESM_get_sae_features (Type: ESMTool)¶

Run a protein sequence through an ESMC Sparse Autoencoder (SAE) and return sparse feature activat…

ESM_score_sequence (Type: ESMTool)¶

Score a protein sequence using ESMC logits to compute per-residue log-probabilities and mean pseu…

ESM_score_variant_sae_batch (Type: ESMTool)¶

Score many missense variants against one reference protein using ESMC-6B SAE. More Forge-efficien…

ESM_score_variant_sae_disruption (Type: ESMTool)¶

Composite SAE-based variant scoring. Given a protein sequence and a missense variant (position + …

ESM_score_variant_sae_disruption tool specification

Tool Information:

Name: ESM_score_variant_sae_disruption
Type: ESMTool
Description: Composite SAE-based variant scoring. Given a protein sequence and a missense variant (position + ref_aa + alt_aa), runs ESMC-6B SAE on both reference and mutant sequences, computes per-feature activation deltas summed over a window centered on the mutation site, and returns ranked top-K features LOST and GAINED in the mutant. This is the convenience layer over ESM_get_sae_features — use this for one-shot variant interpretation; use ESM_get_sae_features directly only when you need raw per-residue features for non-variant analyses. Validates ref_aa matches the position in the supplied sequence (returns clear error if not, so you can detect transcript / isoform mismatches). Cost: 2 Forge API credits (1 ref + 1 mut). Latency: ~3-6s for typical-length human proteins. Prerequisites: same as ESM_get_sae_features (pip install ‘esm @ git+https://github.com/evolutionaryscale/esm@ee891c52’, ESM_API_KEY env var). License: SAE outputs governed by Cambrian Inference License — non-commercial / academic only.

Parameters:

sequence (string) (required) Reference protein sequence (canonical isoform). Single-letter codes, no gaps. The mutant sequence is built internally by substituting alt_aa at position.
position (integer) (required) 1-indexed mutation position. The amino acid at sequence[position-1] must equal ref_aa; otherwise the tool returns an error (the wrong sequence was supplied or the variant notation references a different isoform).
ref_aa (string) (required) Reference amino acid at the mutation position, single-letter code (e.g. ‘R’). Tool validates this matches the supplied sequence.
alt_aa (string) (required) Mutant amino acid, single-letter code (e.g. ‘H’ for R175H).
window (integer) (optional) Residue window radius around the mutation. Per-feature activations are summed across this window before computing the delta. Default 8 = +/- 8 residues = 17-residue window.
top_k_features (integer) (optional) Number of top LOST and top GAINED features to return. Default 10.
model (string) (optional) ESMC backbone (default esmc-6b-2024-12).
sae_model (string) (optional) SAE checkpoint (default layer-60 6B SAE).

Example Usage:

query = {
    "name": "ESM_score_variant_sae_disruption",
    "arguments": {
        "sequence": "example_value",
        "position": 10,
        "ref_aa": "example_value",
        "alt_aa": "example_value"
    }
}
result = tu.run(query)