socio4health.Harmonizer.data_selector#

Harmonizer.data_selector(ddfs: List[DataFrame]) List[DataFrame][source]#

Select rows from Dask DataFrames based on the instance parameters.

Parameters:

ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to filter.

Returns:

List of filtered Dask DataFrames according to the key column, key values, categories, and extra columns.

Return type:

list of dask.dataframe.DataFrame

Raises:

KeyError – If the key column is not found in a DataFrame.