brdata

module core.crawler


class Crawler

Base class for crawlers.

If you want to create a new crawler, you should inherit from this class. It provides some useful methods for crawling.

Args:

method __init__

__init__(url: str)

method get_page

get_page(url: str = None, path: str = <class 'str'>, **kwargs) → str

Get a page from a given url. This is just a wrapper around get_response method.


method get_page_soup

get_page_soup(
    url: str = None,
    enable_cache: bool = True,
    **kwargs
)  BeautifulSoup

Get a BeautifulSoup object from a given url. This is just a wrapper around get_page method.


method get_response

get_response(url: str = None, path: str = <class 'str'>, **kwargs) → Response

Get a response from a given url.

Args:

Returns:


method join_url

join_url(*args: str)  str

Join url parts.

Args:

Returns:


This file was automatically generated via lazydocs.