socio4health.Harmonizer.data_selector#
- Harmonizer.data_selector(ddfs: List[DataFrame]) List[DataFrame] [source]#
Select rows from Dask DataFrames based on the instance parameters.
- Parameters:
ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to filter.
- Returns:
List of filtered Dask DataFrames according to the key column, key values, categories, and extra columns.
- Return type:
list of dask.dataframe.DataFrame
- Raises:
KeyError – If the key column is not found in a DataFrame.