socio4health.Extractor.extract#
- Extractor.extract()[source]#
Extracts data from the specified input path, either by scraping online sources or processing local files.
This method determines the operation mode based on the input path:
If the input path is a
URL
, it performs online scraping to find downloadable files.If the input path is a local directory, it processes files directly from that directory.
- Returns:
List of Dask DataFrames containing the extracted data.
- Return type:
list of dask.dataframe.DataFrame
- Raises:
ValueError – If extraction fails due to an invalid input path, missing column specifications for fixed-width files, or if no valid data files are found after processing.