I want to generate pairs of lineplots where one of them is used as a benchmark.
I can generate a plot like this with the code below.
however, I wish I could have 6 pair plots with Arkhangelsk as the benchmark line in each plot instead. for example, one of them will be like this:
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
data = {'year': ['1998','1998','1998','1998','1998','1998','1998','1999','1999','1999','1999','1999','1999','1999'],'region':['Adygea','Altai Krai','Amur Oblast','Arkhangelsk','Astrakhan','Bashkortostan','Belgorod','Adygea','Altai Krai','Amur Oblast','Arkhangelsk','Astrakhan','Bashkortostan','Belgorod'], 'sales':[8.8, 19.2,21.2, 10.6,18,17.5,23, 10, 17.8, 20.5, 12.6, 19.9, 16, 21]}
df1 = pd.DataFrame(data)
plt.figure(figsize=(12, 6))
palette1 = {c:'#079b51' if c=='Astrakhan' else 'grey' for c in df1['region'].unique()}
sns.lineplot(x= 'year', y='sales', data=df1,hue='region', palette=palette1) # or sns.relplot
plt.legend(bbox_to_anchor=(1.05, 1), loc='upper left')
My code is reproducible.
I have tried the following, which apparently does not work. I am not sure how to loop through each of my 6 regions to compare the Astkrakhan line plot to. It should probably contain a condition like not equal/equal to 'Astrakhan' ? thank you.
fig, axs = plt.subplots(2,3, figsize=(18,15))
for i in enumerate(df1.region.unique()):
sns.lineplot(x= 'year', y='sales', data=df1,ax = axs[i])
and this, which brings up the ValueError: Could not interpret value Adygea
for parameter y
.
df2 = df1.pivot(index='year', columns='region', values='sales') # converting the rows into columns
df_a = df2[['Arkhangelsk']]
df_r = df2.loc[:, ~df2.columns.isin(['Arkhangelsk'])] ## all other columns
fig, axes = plt.subplots(2, 3)
for col, ax in zip(df2.columns, axes.ravel()):
sns.lineplot(x = "year", y = col, data = df_a, ax = ax, linestyle="--")
sns.lineplot(x = "year", y = col, data = df_r, ax = ax)
CodePudding user response: