tooluniverse.odphp_tool module¶

class tooluniverse.odphp_tool.BaseTool(tool_config)[source][source]¶

Bases: object

__init__(tool_config)[source][source]¶

classmethod get_default_config_file()[source][source]¶

Get the path to the default configuration file for this tool type.

This method uses a robust path resolution strategy that works across different installation scenarios:

Installed packages: Uses importlib.resources for proper package resource access
Development mode: Falls back to file-based path resolution
Legacy Python: Handles importlib.resources and importlib_resources

Override this method in subclasses to specify a custom defaults file.

Returns:: Path or resource object pointing to the defaults file

classmethod load_defaults_from_file()[source][source]¶: Load defaults from the configuration file

run(arguments=None)[source][source]¶

Execute the tool.

The default BaseTool implementation accepts an optional arguments mapping to align with most concrete tool implementations which expect a dictionary of inputs.

check_function_call(function_call_json)[source][source]¶

get_required_parameters()[source][source]¶: Retrieve required parameters from the endpoint definition. Returns: list: List of required parameters for the given endpoint.

tooluniverse.odphp_tool.register_tool(tool_type_name=None, config=None)[source][source]¶

Decorator to automatically register tool classes and their configs.

Usage:: @register_tool(‘CustomToolName’, config={…}) class MyTool:

pass

Bases: Tag

A data structure representing a parsed HTML or XML document.

Most of the methods you’ll call on a BeautifulSoup object are inherited from PageElement or Tag.

Internally, this class defines the basic interface called by the tree builders when converting an HTML/XML document into a data structure. The interface abstracts away the differences between parsers. To write a new tree builder, you’ll need to understand these methods as a whole.

These methods will be called by the BeautifulSoup constructor:

reset()
feed(markup)

The tree builder may call these methods from its feed() implementation:

handle_starttag(name, attrs) # See note about return value
handle_endtag(name)
handle_data(data) # Appends to the current data node
endData(containerClass) # Ends the current data node

No matter how complicated the underlying parser is, you should be able to build a tree using ‘start tag’ events, ‘end tag’ events, ‘data’ events, and “done with data” events.

If you encounter an empty-element tag (aka a self-closing tag, like HTML’s <br> tag), call handle_starttag and then handle_endtag.

ROOT_TAG_NAME: str = '[document]'[source]¶: Since BeautifulSoup subclasses Tag, it’s possible to treat it as a Tag with a Tag.name. Hoever, this name makes it clear the BeautifulSoup object isn’t a real markup tag.

DEFAULT_BUILDER_FEATURES: Sequence[str] = ['html', 'fast'][source]¶: If the end-user gives no indication which tree builder they want, look for one with these features.

ASCII_SPACES: str = ' \n\t\x0c\r'[source]¶: A string containing all ASCII whitespace characters, used in during parsing to detect data chunks that seem ‘empty’.

original_encoding: str | None[source]¶: Beautiful Soup’s best guess as to the character encoding of the original document.

declared_html_encoding: str | None[source]¶: The character encoding, if any, that was explicitly defined in the original document. This may or may not match BeautifulSoup.original_encoding.

contains_replacement_characters: bool[source]¶: This is True if the markup that was parsed contains U+FFFD REPLACEMENT_CHARACTER characters which were not present in the original markup. These mark character sequences that could not be represented in Unicode.

Constructor.

Parameters:

markup – A string or a file-like object representing markup to be parsed.
features – Desirable features of the parser to be used. This may be the name of a specific parser (“lxml”, “lxml-xml”, “html.parser”, or “html5lib”) or it may be the type of markup to be used (“html”, “html5”, “xml”). It’s recommended that you name a specific parser, so that Beautiful Soup gives you the same results across platforms and virtual environments.
builder – A TreeBuilder subclass to instantiate (or instance to use) instead of looking one up based on features. You only need to use this if you’ve implemented a custom TreeBuilder.
parse_only – A SoupStrainer. Only parts of the document matching the SoupStrainer will be considered. This is useful when parsing part of a document that would otherwise be too large to fit into memory.
from_encoding – A string indicating the encoding of the document to be parsed. Pass this in if Beautiful Soup is guessing wrongly about the document’s encoding.
exclude_encodings – A list of strings indicating encodings known to be wrong. Pass this in if you don’t know the document’s encoding but you know Beautiful Soup’s guess is wrong.
element_classes – A dictionary mapping BeautifulSoup classes like Tag and NavigableString, to other classes you’d like to be instantiated instead as the parse tree is built. This is useful for subclassing Tag or NavigableString to modify default behavior.
kwargs –
For backwards compatibility purposes, the constructor accepts certain keyword arguments used in Beautiful Soup 3. None of these arguments do anything in Beautiful Soup 4; they will result in a warning and then be ignored.

Apart from this, any keyword arguments passed into the BeautifulSoup constructor are propagated to the TreeBuilder constructor. This makes it possible to configure a TreeBuilder by passing in arguments, not just by saying which one to use.

is_xml: bool[source]¶

copy_self() → BeautifulSoup[source][source]¶

Create a new BeautifulSoup object with the same TreeBuilder, but not associated with any markup.

This is the first step of the deepcopy process.

reset() → None[source][source]¶: Reset this object to a state as though it had never parsed any markup.

Create a new Tag associated with this BeautifulSoup object.

Parameters:

name – The name of the new Tag.
namespace – The URI of the new Tag’s XML namespace, if any.
prefix – The prefix for the new Tag’s XML namespace, if any.
attrs – A dictionary of this Tag’s attribute values; can be used instead of `kwattrs` for attributes like ‘class’ that are reserved words in Python.
sourceline – The line number where this tag was (purportedly) found in its source document.
sourcepos – The character position within `sourceline` where this tag was (purportedly) found.
string – String content for the new Tag, if any.
kwattrs – Keyword arguments for the new Tag’s attribute values.

string_container(base_class: Type[NavigableString] | None = None) → Type[NavigableString][source][source]¶

Find the class that should be instantiated to hold a given kind of string.

This may be a built-in Beautiful Soup class or a custom class passed in to the BeautifulSoup constructor.

new_string(s: str, subclass: Type[NavigableString] | None = None) → NavigableString[source][source]¶

Create a new NavigableString associated with this BeautifulSoup object.

Parameters:

s – The string content of the NavigableString
subclass – The subclass of NavigableString, if any, to use. If a document is being processed, an appropriate subclass for the current location in the document will be determined automatically.

insert_before(*args: PageElement | str) → List[PageElement][source][source]¶: This method is part of the PageElement API, but BeautifulSoup doesn’t implement it because there is nothing before or after it in the parse tree.

insert_after(*args: PageElement | str) → List[PageElement][source][source]¶: This method is part of the PageElement API, but BeautifulSoup doesn’t implement it because there is nothing before or after it in the parse tree.

decode(indent_level: int | None = None, eventual_encoding: str = 'utf-8', formatter: Formatter | str = 'minimal', iterator: Iterator[PageElement] | None = None, **kwargs: Any) → str[source][source]¶

Returns a string representation of the parse tree: as a full HTML or XML document.

Parameters:

indent_level – Each line of the rendering will be indented this many levels. (The `formatter` decides what a ‘level’ means, in terms of spaces or other characters output.) This is used internally in recursive calls while pretty-printing.
eventual_encoding – The encoding of the final document. If this is None, the document will be a Unicode string.
formatter – Either a Formatter object, or a string naming one of the standard formatters.
iterator – The iterator to use when navigating over the parse tree. This is only used by Tag.decode_contents and you probably won’t need to use it.

class tooluniverse.odphp_tool.ODPHPRESTTool(tool_config)[source][source]¶

Bases: BaseTool

Base class for ODPHP (MyHealthfinder) REST API tools.

__init__(tool_config)[source][source]¶

class tooluniverse.odphp_tool.ODPHPMyHealthfinder(tool_config)[source][source]¶

Bases: ODPHPRESTTool

Search for demographic-specific health recommendations (MyHealthfinder).

run(arguments: Dict[str, Any]) → Dict[str, Any][source][source]¶

Execute the tool.

The default BaseTool implementation accepts an optional arguments mapping to align with most concrete tool implementations which expect a dictionary of inputs.

class tooluniverse.odphp_tool.ODPHPItemList(tool_config)[source][source]¶

Bases: ODPHPRESTTool

Retrieve list of topics or categories.

run(arguments: Dict[str, Any]) → Dict[str, Any][source][source]¶

Execute the tool.

The default BaseTool implementation accepts an optional arguments mapping to align with most concrete tool implementations which expect a dictionary of inputs.

class tooluniverse.odphp_tool.ODPHPTopicSearch(tool_config)[source][source]¶

Bases: ODPHPRESTTool

Search for health topics by ID, category, or keyword.

run(arguments: Dict[str, Any]) → Dict[str, Any][source][source]¶

Execute the tool.

The default BaseTool implementation accepts an optional arguments mapping to align with most concrete tool implementations which expect a dictionary of inputs.

class tooluniverse.odphp_tool.ODPHPOutlinkFetch(tool_config)[source][source]¶

Bases: BaseTool

Fetch article pages referenced by AccessibleVersion / RelatedItems.Url and return readable text. - HTML: extracts main/article/body text; strips nav/aside/footer/script/style. - PDF or non-HTML: returns metadata + URL so the agent can surface it.

__init__(tool_config)[source][source]¶

run(arguments: Dict[str, Any]) → Dict[str, Any][source][source]¶

Execute the tool.

The default BaseTool implementation accepts an optional arguments mapping to align with most concrete tool implementations which expect a dictionary of inputs.