socio4health.Harmonizer.s4h_join_data#

Harmonizer.s4h_join_data(ddfs: List[DataFrame]) DataFrame[source]#

Join multiple Dask DataFrames on a specified key_col, removing duplicate columns.

Parameters:

ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to join.

Returns:

Merged DataFrame with duplicate columns removed.

Return type:

pandas.DataFrame