Home > Back-end >  Convert subset of rows to column pyspark dataframe
Convert subset of rows to column pyspark dataframe

Time:04-28

Suppose we have the following df

Id PlaceCod  Val
1  1         0 
1  2         3
2  2         4 
2  1         5
3  1         6

How can I convert this DF to this one:

Id Store Warehouse
1  0     3
2  5     4
3  6     null

I've tried to use df.pivot(f.col("PlaceCod")) but got error message 'DataFrame has no pivot attribute'

CodePudding user response:

As posted by @Emma on the comments:

df.groupby('Id').pivot('PlaceCod').agg(F.first('Val'))

Using the above solution my problem was solved!

  • Related