socio4health.Harmonizer.get_available_columns#

static Harmonizer.get_available_columns(ddf_or_ddfs: DataFrame | List[DataFrame]) List[str][source]#

Get a list of unique column names from a single Dask DataFrame or a list of Dask DataFrames.

Parameters:

ddf_or_ddfs – or list of dask.dataframe.DataFrame A single Dask DataFrame or a list of Dask DataFrames to extract column names from.

Returns:

Sorted list of unique column names across all provided Dask DataFrames.

Return type:

list of str

Raises:

TypeError – If the input is not a Dask DataFrame or a list of Dask DataFrames.