socio4health.Harmonizer.s4h_get_available_columns#

static Harmonizer.s4h_get_available_columns(df_or_dfs: DataFrame | DataFrame | List[DataFrame | DataFrame]) List[str][source]#

Get a list of unique column names from a single DataFrame or a list of DataFrames. Supports both Dask and pandas DataFrames.

Parameters:

df_or_dfs (Union[dask.dataframe.DataFrame, pandas.DataFrame,) – List[Union[dask.dataframe.DataFrame, pandas.DataFrame]]] A single DataFrame or a list of DataFrames to extract column names from. Can be Dask DataFrames, pandas DataFrames, or a mix of both.

Returns:

Sorted list of unique column names across all provided DataFrames.

Return type:

list of str

Raises:

TypeError – If the input is not a DataFrame or a list of DataFrames.