socio4health.Harmonizer.s4h_data_selector#

Harmonizer.s4h_data_selector(ddfs: List[DataFrame]) List[DataFrame][source]#

Select rows from Dask DataFrames based on the instance parameters.

Parameters:

ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to filter.

Returns:

List of filtered Dask DataFrames according to the key_col, key_val, categories, and extra_cols.

Return type:

list of dask.dataframe.DataFrame

Raises:

KeyError – If the key_col is not found in a DataFrame.