socio4health.Harmonizer.join_data#

Harmonizer.join_data(ddfs: List[DataFrame]) DataFrame[source]#

Join multiple Dask DataFrames on a specified key column, removing duplicate columns.

Parameters:

ddfs (list of dask.dataframe.DataFrame) – List of Dask DataFrames to join.

Returns:

Merged DataFrame with duplicate columns removed.

Return type:

pandas.DataFrame