socio4health.Harmonizer.s4h_data_selector#
- Harmonizer.s4h_data_selector(ddfs: List[DataFrame]) List[DataFrame] [source]#
Select rows from Dask DataFrames based on the instance parameters.
- Parameters:
ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to filter.
- Returns:
List of filtered Dask DataFrames according to the
key_col
,key_val
,categories
, andextra_cols
.- Return type:
list of dask.dataframe.DataFrame
- Raises:
KeyError – If the
key_col
is not found in a DataFrame.