I'm trying to do some data analysis with python and pandas on a power consumption dataset.
However when I plot the data I get that stright line from 5-1-2007
to 13-1-2007
but I have no missing values in my dataset which is a weird behavior as I made sure that my dataset in clean.
Anyone had similar issue? or can explain this behavior?
Thank you.
EDIT: Here is what the data looks like in that range EDIT 2 : Here is the link to the original dataset (before cleaning) if that might help:
CodePudding user response:
How does the data between 2007-01-01 and 2007-01-15 look like? (use df[(df['Date_Time'] >= '2007-01-01 ') & (df['Date_Time'] <= '2007-01-15')]
).
If no data is missing it could be that the dataset has been manipulated and the missing datapoints were interpolated (see
CodePudding user response:
Fact is that when there is data on the x (Datetime)axis, then if there is no data on the y axis, then the rendering continues anyway. Is especially noticeable on financial data on weekends and holidays or when there are gaps. Here this problem is described