Core Tools

Configuration File: core_tools.json Tool Type: Local Tools Count: 2

This page contains all tools defined in the core_tools.json configuration file.

Available Tools

CORE_get_fulltext_snippets (Type: CorePDFSnippetsTool)

Fetch an open-access PDF (commonly returned by CORE_search_papers) and return bounded text snippe…

CORE_get_fulltext_snippets tool specification

Tool Information:

  • Name: CORE_get_fulltext_snippets

  • Type: CorePDFSnippetsTool

  • Description: Fetch an open-access PDF (commonly returned by CORE_search_papers) and return bounded text snippets around provided terms. This is a local (download + scan) full-text search fallback when index-based search misses body-only keywords. Extraction backends: PyMuPDF (fitz), pypdf, or markitdown (auto-selects fastest available unless extractor is set).

Parameters:

  • pdf_url (string) (optional) Direct PDF URL to download (preferred). CORE_search_papers returns this as the url field in many results.

  • url (string) (optional) Alias for pdf_url for convenience when piping CORE_search_papers outputs.

  • terms (array) (required) Terms to search for in the extracted PDF text (case-insensitive).

  • window_chars (integer) (optional) Context window size (characters) before and after each match.

  • max_snippets_per_term (integer) (optional) Maximum number of snippets returned per term.

  • max_total_chars (integer) (optional) Hard cap on total characters returned across all snippets.

  • extractor (string) (optional) PDF text extraction backend: auto (default), fitz (PyMuPDF), pypdf, or markitdown.

  • timeout (integer) (optional) Download timeout in seconds (bounded to <=55 to stay under typical MCP call deadlines).

  • max_pdf_bytes (integer) (optional) Maximum PDF size (bytes) allowed for download/scan. Prevents timeouts on huge PDFs.

  • max_pages (integer) (optional) Maximum number of PDF pages to scan when using fitz/pypdf extractors.

  • max_text_chars (integer) (optional) Maximum number of extracted text characters to scan for term matches (limits CPU/memory).

Example Usage:

query = {
    "name": "CORE_get_fulltext_snippets",
    "arguments": {
        "terms": ["item1", "item2"]
    }
}
result = tu.run(query)

CORE_search_papers (Type: CoreTool)

Search for open access academic papers using CORE API. CORE aggregates OA repository and journal …

CORE_search_papers tool specification

Tool Information:

  • Name: CORE_search_papers

  • Type: CoreTool

  • Description: Search for open access academic papers using CORE API. CORE aggregates OA repository and journal content; queries can match metadata and (for many records) indexed full text, but coverage varies by source. Use CORE_get_fulltext_snippets to search inside a returned PDF URL when exact body-text terms matter.

Parameters:

  • query (string) (optional) Search query for CORE papers. Use keywords separated by spaces to refine your search.

  • search (string) (optional) Alias for query.

  • q (string) (optional) Alias for query.

  • limit (integer) (optional) Maximum number of papers to return. This sets the maximum number of papers retrieved from CORE.

  • page_size (integer) (optional) Alias for limit.

  • max_results (integer) (optional) Alias for limit.

  • year_from (integer) (optional) Start year for publication date filter (e.g., 2020). Optional parameter to limit search to papers published from this year onwards.

  • year_to (integer) (optional) End year for publication date filter (e.g., 2024). Optional parameter to limit search to papers published up to this year.

  • language (string) (optional) Language filter for papers (e.g., ‘en’, ‘es’, ‘fr’). Optional parameter to limit search to papers in specific language.

Example Usage:

query = {
    "name": "CORE_search_papers",
    "arguments": {
    }
}
result = tu.run(query)