socio4health.utils.extractor_utils.run_standard_spider#

socio4health.utils.extractor_utils.run_standard_spider(url, depth, down_ext, key_words)[source]#

Run the Scrapy spider to extract data from the given URL .

Parameters:

url (str) – The URL to start crawling from.
depth (int) – The depth of the crawl.
down_ext (list) – List of file extensions to download.
key_words (list) – List of keywords to filter the crawled data.

Returns:

True if spider completed successfully, False otherwise

Return type: