I have a pandas dataframe
import pandas as pd
s = pd.DataFrame({'ioc' : ['ESP', 'CYP', 'USA', 'ESP', 'NED']})
and I want to return
out = pd.DataFrame(
{'ioc' : ['ESP', 'CYP', 'USA', 'ESP', 'NED'],
'countryName' : ['Spain', 'Cyprus', 'United States of America',
'Spain', 'Netherlands']})
CodePudding user response:
import pycountry
def foo(a):
country = pycountry.countries.get(alpha_3=a)
return country.name if country else "NA"
s.ioc.apply(foo)
0 Spain
1 Cyprus
2 United States
3 Spain
4 NA
Name: ioc, dtype: object
Another option
import country_converter as coco
cc = coco.CountryConverter()
cc.convert(names=s.ioc, to='name')
# ['Spain', 'Cyprus', 'United States', 'Spain', 'not found']
CodePudding user response:
ioc = pd.read_html('https://en.wikipedia.org/wiki/List_of_IOC_country_codes')[0]
ioc = ioc.assign(Code=ioc['Code'].str[-3:]).set_index('Code')['National Olympic Committee']
s['countryName'] = s['ioc'].map(ioc)
print(s)
# Output:
ioc countryName
0 ESP Spain
1 CYP Cyprus
2 USA United States
3 ESP Spain
4 NED Netherlands