Home > Blockchain >  Pandas Indexing Creates ERROR ( Result = self._data[key] ): HEELP
Pandas Indexing Creates ERROR ( Result = self._data[key] ): HEELP

Time:12-07

Error when using Pandas and indexing. why? I even asked Derek Banas and he wasn't sure why didn't work so please help Errors at bottom in 'quote format'

This my code:

import numpy as np 

import pandas as pd
from pandas_datareader import data as web
import matplotlib.pyplot as plt 
import matplotlib.dates as mdates
%matplotlib inline

import datetime as dt #For defining dates
import mplfinance as mpf # Matplotlib finance

import time
import yfinance as yf;
# Used to get data from a directory
import os
from os import listdir
from os.path import isfile, join
pd.set_option('display.max_columns', None)
pd.set_option('display.max_rows', 5);
#Statsmodels is a great library we can use to run regressions.
import statsmodels.api as sm
# Seaborn extends the capabilities of Matplotlib
import seaborn as sns
# Used for calculating regressions
from statsmodels.tsa.ar_model import AutoReg, ar_select_order

msft= yf.download(tickers='MSFT', period = '1mo',interval = '5m');
x = msft.index; close = msft.index['Adj Close'], high = msft['High']; low = msft['Low']; openprice=msft['Open'];
#print(x); print(high);

print(msft) 

Error is:

x = msft.index; close = msft.index['Adj Close'];

---- " [my addrs]...\pandas\core\indexes\extension.py, line 238 in getitem

result = self._data[key] Following a tutorial https://youtu.be/boouvnzw-G8?t=582

I'm using Eclipse on windows. Why doesn't Python work when I index whatever's in the video? Appreciate it, I'm a lowly student and cannot figure why my copy-paste of the github repositry code won't work. I even asked Derek Banas and he wasn't sure why didn't work so please help

CodePudding user response:

It seems like a formatting issue.. Change this line

x = msft.index; close = msft.index['Adj Close'], high = msft['High']; low = msft['Low']; openprice=msft['Open'];

to this:

x = msft.index
# THIS LINE CAUSES THE ERROR
# close = msft.index['Adj Close']
close = msft['Adj Close']
high = msft['High'] 
low = msft['Low']
openprice=msft['Open']

Tip 1: No need for ; anywhere. Remove them.

Tip 2: One statement per line.

CodePudding user response:

The problem is that msft.index is a an index not a column of your DataFrame. Whereas you are trying to access it as if it is a column (msft.index[<ColumnName>]).
It should be simply msft[<ColumnName>]. See below for details:

Run:

msft= yf.download(tickers='MSFT', period = '1mo',interval = '5m');
type(msft.index)

Output:

pandas.core.indexes.datetimes.DatetimeIndex

Run:

msft.head()

Output:


                            Open       High         Low        Close        Adj Close   Volume
Datetime                        
2021-11-04 09:30:00-04:00   332.890015  333.239990  329.640015  330.549988  330.549988  1619634
2021-11-04 09:35:00-04:00   330.000000  330.589996  329.859985  330.390015  330.390015  966856
2021-11-04 09:40:00-04:00   330.500000  332.299988  330.450104  332.269989  332.269989  760918
2021-11-04 09:45:00-04:00   332.250000  333.100006  332.170013  332.989990  332.989990  526600
2021-11-04 09:50:00-04:00   332.750000  333.179993  332.510010  333.174988  333.174988  498996

You need to access it this way (get rid of .index):

Run:

msft['Adj Close']

Output:

Datetime
2021-11-04 09:30:00-04:00    330.549988
2021-11-04 09:35:00-04:00    330.390015
2021-11-04 09:40:00-04:00    332.269989
2021-11-04 09:45:00-04:00    332.989990
2021-11-04 09:50:00-04:00    333.174988
                                ...    
2021-12-03 15:35:00-05:00    320.594604
2021-12-03 15:40:00-05:00    320.304993
2021-12-03 15:45:00-05:00    320.317505
2021-12-03 15:50:00-05:00    321.679993
2021-12-03 15:55:00-05:00    323.149994
Name: Adj Close, Length: 1603, dtype: float64

I'm guessing this error was the result of a copy/paste typo (because the video does not have it as you do with .index).

I would strongly echo the advise of @WesleyJonCheek : "One statement per line"

  • Related