Ncbi Sra Tools

Configuration File: ncbi_sra_tools.json Tool Type: Local Tools Count: 4

This page contains all tools defined in the ncbi_sra_tools.json configuration file.

Available Tools

NCBI_SRA_get_download_urls (Type: NCBISRATool)

Get FTP and cloud storage URLs for downloading FASTQ sequencing data from SRA runs. Returns multi…

NCBI_SRA_get_download_urls tool specification

Tool Information:

  • Name: NCBI_SRA_get_download_urls

  • Type: NCBISRATool

  • Description: Get FTP and cloud storage URLs for downloading FASTQ sequencing data from SRA runs. Returns multiple access methods: NCBI FTP URLs for direct .sra file download, AWS S3 URLs for cloud-based access (faster for large-scale analysis), and NCBI web URLs for browser access. Note: Downloaded .sra files require SRA Toolkit (fastq-dump or fasterq-dump) to convert to standard FASTQ format. Essential for programmatic access to petabytes of public sequencing data. Works with SRR (NCBI), ERR (ENA), and DRR (DDBJ) accessions.

Parameters:

  • operation (unknown) (required) Operation type (fixed: get_download_urls)

  • accessions (unknown) (required) SRA run accession(s) (e.g., ‘SRR000001’, ‘ERR000001’, ‘DRR000001’). Can be single accession or array of accessions.

Example Usage:

query = {
    "name": "NCBI_SRA_get_download_urls",
    "arguments": {
        "operation": "example_value",
        "accessions": "example_value"
    }
}
result = tu.run(query)

NCBI_SRA_get_run_info (Type: NCBISRATool)

Get detailed metadata for SRA run accessions including sequencing platform (ILLUMINA, Oxford Nano…

NCBI_SRA_get_run_info tool specification

Tool Information:

  • Name: NCBI_SRA_get_run_info

  • Type: NCBISRATool

  • Description: Get detailed metadata for SRA run accessions including sequencing platform (ILLUMINA, Oxford Nanopore), instrument model, library strategy (RNA-Seq, WGS), library source (genomic, transcriptomic), library layout (paired/single), sample information, study details, organism, and data statistics (total spots, bases). Essential for understanding experimental design and data characteristics before downloading large FASTQ files. Use SRA run accessions (SRR000001, ERR000001, DRR000001) from NCBI_SRA_search_runs or known accessions.

Parameters:

  • operation (unknown) (required) Operation type (fixed: get_run_info)

  • accessions (unknown) (required) SRA run accession(s) (e.g., ‘SRR000001’, ‘ERR000001’, ‘DRR000001’). Can be single accession or array of accessions. Use SRR for NCBI, ERR for ENA, DRR for DDBJ.

Example Usage:

query = {
    "name": "NCBI_SRA_get_run_info",
    "arguments": {
        "operation": "example_value",
        "accessions": "example_value"
    }
}
result = tu.run(query)

NCBI_SRA_search_runs (Type: NCBISRATool)

Search NCBI Sequence Read Archive (SRA) for NGS/RNA-seq sequencing runs by study accession, organ…

NCBI_SRA_search_runs tool specification

Tool Information:

  • Name: NCBI_SRA_search_runs

  • Type: NCBISRATool

  • Description: Search NCBI Sequence Read Archive (SRA) for NGS/RNA-seq sequencing runs by study accession, organism, sequencing strategy (RNA-Seq, WGS, ChIP-Seq), platform (ILLUMINA, OXFORD_NANOPORE), or library source. Returns SRA UIDs which can be used with other SRA tools to retrieve metadata and download links. Essential for discovering high-throughput sequencing data from public repositories. Use this to find relevant sequencing experiments before accessing the actual FASTQ data files.

Parameters:

  • operation (unknown) (required) Operation type (fixed: search)

  • study (string) (optional) Study accession to search (e.g., ‘SRP000001’, ‘ERP000001’, ‘DRP000001’). Searches in [Study] field.

  • organism (string) (optional) Organism name to filter (e.g., ‘Homo sapiens’, ‘Mus musculus’, ‘SARS-CoV-2’). Searches in [Organism] field.

  • strategy (string) (optional) Sequencing strategy to filter. Common values: ‘RNA-Seq’ (transcriptomics), ‘WGS’ (whole genome), ‘WXS’ (exome), ‘ChIP-Seq’ (chromatin), ‘ATAC-Seq’ (accessibility), ‘Bisulfite-Seq’ (methylation), ‘AMPLICON’ (targeted), ‘Hi-C’ (3D genome)

  • platform (string) (optional) Sequencing platform to filter. Common values: ‘ILLUMINA’, ‘OXFORD_NANOPORE’, ‘PACBIO_SMRT’, ‘ION_TORRENT’, ‘BGISEQ’

  • source (string) (optional) Library source to filter. Common values: ‘GENOMIC’ (DNA), ‘TRANSCRIPTOMIC’ (RNA), ‘METAGENOMIC’, ‘METATRANSCRIPTOMIC’, ‘SYNTHETIC’, ‘VIRAL_RNA’

  • query (string) (optional) Free-form search query if not using structured filters. Use NCBI query syntax (e.g., ‘(Homo sapiens[Organism]) AND (RNA-Seq[Strategy])’)

  • limit (integer) (optional) Maximum number of results to return (default: 20, max: 100)

  • sort (string) (optional) Sort order for results (default: relevance)

Example Usage:

query = {
    "name": "NCBI_SRA_search_runs",
    "arguments": {
        "operation": "example_value"
    }
}
result = tu.run(query)