socio4health.Extractor.extract#

Extractor.extract()[source]#

Extracts data from the specified input path, either by scraping online sources or processing local files.

This method determines the operation mode based on the input path:

  • If the input path is a URL, it performs online scraping to find downloadable files.

  • If the input path is a local directory, it processes files directly from that directory.

Returns:

List of Dask DataFrames containing the extracted data.

Return type:

list of dask.dataframe.DataFrame

Raises:

ValueError – If extraction fails due to an invalid input path, missing column specifications for fixed-width files, or if no valid data files are found after processing.