ac_core.utils package#

Submodules#

ac_core.utils.html_parse_helper module#

ac_core.utils.html_parse_helper.parse_start_end(soup: BeautifulSoup) Tuple[int, int][source]#
ac_core.utils.html_parse_helper.parse_url(soup: BeautifulSoup) str[source]#

Module contents#

ac_core.utils.get_direct_children_text(tag: Tag) str[source]#

get_direct_children_text collects the text which are direct children of the given tag.

For example, this returns “A - Hello world “ for a tag <h2>A - Hello world <a href="...">Editorial</a></h2>.

ac_core.utils.remove_prefix(s: str, prefix: str) str[source]#
ac_core.utils.remove_suffix(s: str, suffix: str) str[source]#
ac_core.utils.time_str_2_timestamp(s: str) int[source]#