I have a Dataframe with 45 columns and 11k rows. This dataframe consist of players. Columns displaying their name, player_id, rating, height etc. Pretend you have the name of a player in the dataframe, or their ID, and you want to access the entire row of that player. You want to see all the information of that individual, but you only have one unique identifyer.
I tried using df.loc[[id_number]]
, but that only takes me to the index of the dataframe, which does not correspond to player_id.
Hopefully I explained it well enough. If you have any questions, please post them below.
CodePudding user response:
df.loc[df['column_name'] == some_value]
Related to https://stackoverflow.com/a/17071908/17487637
CodePudding user response:
You can try applying a mask:
df[df.playerId == id_number]
Assuming playerId
is the name of the column containing the player ids.
CodePudding user response:
As far as I have understood, you want to query a dataframe based on one unique identifier. Let's suppose you only have player name.
df[df.player_name==playername]
Here playername is the variable where you will store your desired player name.
CodePudding user response:
I use this in my code to find what i need in excel file. Mayby will be helpful for you.
search = input('\t\t Find row: ')
xlsxfile = pd.read_excel('your_filename.xlsx', engine='openpyxl')
df = xlsxfile[xlsxfile['name_of_your_column' ].str.contains(search , na=False)]
dx = (df[['name', 'player_id','height']])
print(dx)
You will need some third party module for this like openpyxl.