Rcsb Advanced Search Tools¶
Configuration File: rcsb_advanced_search_tools.json
Tool Type: Local
Tools Count: 2
This page contains all tools defined in the rcsb_advanced_search_tools.json configuration file.
Available Tools¶
RCSBAdvSearch_search_by_motif (Type: RCSBAdvancedSearchTool)¶
Search RCSB PDB for protein structures containing a specific sequence motif pattern. Supports PRO…
RCSBAdvSearch_search_by_motif tool specification
Tool Information:
Name:
RCSBAdvSearch_search_by_motifType:
RCSBAdvancedSearchToolDescription: Search RCSB PDB for protein structures containing a specific sequence motif pattern. Supports PROSITE patterns, regular expressions, and simple patterns. Useful for finding structures with specific functional motifs like zinc fingers, kinase domains, or any sequence pattern of interest. Returns polymer entity identifiers (PDB_ID + entity number).
Parameters:
pattern(string) (required) Sequence motif pattern. PROSITE format examples: ‘C-x(2,4)-C-x(3)-[LIVMFYWC]-x(8)-H-x(3,5)-H’ (zinc finger), ‘G-x(4)-GK-[ST]’ (P-loop NTP binding). Regex format: ‘C.{2,4}C.{3}[LIVMFYWC].{8}H.{3,5}H’. Simple format: ‘CXXCXXXH’ (X=any amino acid).pattern_type(string) (optional) Pattern format: ‘prosite’ (default, PROSITE-style), ‘regex’ (regular expression), ‘simple’ (X=any).sequence_type(string) (optional) Sequence type: ‘protein’ (default) or ‘dna’ or ‘rna’.rows(integer) (optional) Number of results to return (default: 10, max: 50).
Example Usage:
query = {
"name": "RCSBAdvSearch_search_by_motif",
"arguments": {
"pattern": "example_value"
}
}
result = tu.run(query)
RCSBAdvSearch_search_structures (Type: RCSBAdvancedSearchTool)¶
Advanced search of RCSB Protein Data Bank structures by combining multiple attribute filters. Fil…
RCSBAdvSearch_search_structures tool specification
Tool Information:
Name:
RCSBAdvSearch_search_structuresType:
RCSBAdvancedSearchToolDescription: Advanced search of RCSB Protein Data Bank structures by combining multiple attribute filters. Filter by organism (species), resolution, experimental method (X-RAY DIFFRACTION, ELECTRON MICROSCOPY, SOLUTION NMR), polymer description, and deposition date. Also supports free-text search combined with attribute filters. Use this when you need to find PDB structures matching specific criteria (e.g., all human cryo-EM structures below 3 Angstrom resolution). Returns PDB IDs sorted by resolution or date.
Parameters:
query([‘string’, ‘null’]) (optional) Free-text search query. Examples: ‘insulin receptor’, ‘CRISPR-Cas9’, ‘COVID spike protein’. Can be combined with other attribute filters.organism([‘string’, ‘null’]) (optional) Filter by source organism (exact match). Examples: ‘Homo sapiens’, ‘Mus musculus’, ‘Escherichia coli’, ‘Saccharomyces cerevisiae’, ‘SARS-CoV-2’.max_resolution([‘number’, ‘null’]) (optional) Maximum resolution in Angstroms. Lower values mean higher resolution. Examples: 1.5 (very high), 2.0 (high), 3.0 (moderate), 4.0 (low). Only applies to X-ray and cryo-EM structures.experimental_method([‘string’, ‘null’]) (optional) Filter by experimental method. Must be EXACT value: ‘X-RAY DIFFRACTION’, ‘ELECTRON MICROSCOPY’, ‘SOLUTION NMR’, ‘NEUTRON DIFFRACTION’, ‘ELECTRON CRYSTALLOGRAPHY’.polymer_description([‘string’, ‘null’]) (optional) Filter by polymer entity description (contains words). Examples: ‘kinase’, ‘hemoglobin’, ‘antibody’, ‘DNA polymerase’.min_deposition_date([‘string’, ‘null’]) (optional) Only structures deposited after this date. Format: YYYY-MM-DD. Example: ‘2023-01-01’ for structures deposited in 2023 or later.rows(integer) (optional) Number of results to return (default: 10, max: 50).sort_by([‘string’, ‘null’]) (optional) Sort results by: ‘resolution’ (best resolution first, default), ‘date’ (newest first), ‘weight’ (smallest first).
Example Usage:
query = {
"name": "RCSBAdvSearch_search_structures",
"arguments": {
}
}
result = tu.run(query)