Home > Software engineering >  Pandas - Calculate Mean and Variance
Pandas - Calculate Mean and Variance

Time:10-07

For a current project, I would like to calculate both the mean and variance for a group of values.

My existing code calculates the mean through .agg('mean'). I tried to add , 'var' inside the bracket, which however yielded an error:

f"numpy operations are not valid with " pandas.errors.UnsupportedFunctionCall: numpy operations are not valid with groupby. Use .groupby(...).mean() instead

Is there any smart tweak to make the code below work?

newdf = df.groupby(['stock_symbol', 'quarter'])['rating_recommend', 'rating_outlook'].agg('mean')

CodePudding user response:

add 'var' for variance in the parenthesis.


newdf = (df.groupby(['stock_symbol', 'quarter'])['rating_recommend', 'rating_outlook']
         .agg('mean', 'var'))
  • Related