socio4health.utils.extractor_utils.run_standard_spider#

socio4health.utils.extractor_utils.run_standard_spider(url, depth, down_ext, key_words)[source]#

Run the Scrapy spider to extract data from the given URL .

Parameters:
  • url (str) – The URL to start crawling from.

  • depth (int) – The depth of the crawl.

  • down_ext (list) – List of file extensions to download.

  • key_words (list) – List of keywords to filter the crawled data.

Return type:

None