enter image description hereI'm working on Colab and trying to separate out a test set of the last 2 months of data but I'm facing this error (ValueError: Both dates must have the same UTC offset), I know the error is because the start date of the set is in BST and the end date is in GMT.
latest_df = df.loc['Sat 01 Oct 2022 12:00:03 AM BST':'Thu 01 Dec 2022 10:02:02 AM GMT']
latest_df.head()
I tried to convert the time manually on the excel of the dataset but it will take a long time to convert all dates because it is a big data.
CodePudding user response:
You can use the pytz library to convert the dates to the same timezone. Here's an example:
import pytz
# Set the timezone for the start and end dates
start_tz = pytz.timezone('Europe/London')
end_tz = pytz.timezone('Europe/London')
# Convert the start and end dates to the same timezone
start_date = start_tz.localize(df['Sat 01 Oct 2022 12:00:03 AM BST'])
end_date = end_tz.localize(df['Thu 01 Dec 2022 10:02:02 AM GMT'])
# Select the rows between the start and end dates
latest_df = df.loc[start_date:end_date]
latest_df.head()
CodePudding user response:
You can simply convert your start_date timezone instead of converting whole data.