I'm trying to read one column from a csv (with header 'Peptide Sequence'). However, this gives me the error as in the title. I know this probably has something to do with the encoding, which I know very little about. Is there a quick workaround for this?
import pandas as pd
file = r'C:\...\thpdb.csv'
df = pd.read_csv(file, usecols=['Peptide Sequence'])
print(df)
CodePudding user response:
read_csv takes an encoding argument to deal with files in different formats, "ISO-8859-1" should work for you. See here:
import pandas as pd
file = r'C:\...\thpdb.csv'
df = pd.read_csv(file, usecols=['Peptide Sequence'], encoding = "ISO-8859-1")
print(df)