I have my data in this format :
Harvard | MIT |
---|---|
David | Troy |
Siri | Charlie |
Troy | David |
Alexa | Cortana |
Cortana | Man |
Animal | David |
and I want my results to be in :
Harvard | MIT | Output |
---|---|---|
David | Troy | David |
Harvard | MIT | Troy |
David | Troy | Cortana |
Siri | Charlie | |
Troy | David | |
Alexa | Cortana | |
Cortana | Man |
I don't care the order I get the name, just I need the list of person who are attending both institutions. I have them stored in same csv file but different column.
Python is my preferred language. Or I have git bash Installed on my windows either. I need a solution.
CodePudding user response:
The following line of code should solve your problem.
df['Output'] = df['Harvard'].loc[df['Harvard'].isin(df.MIT.values.tolist())]
You can sort the Output-column afterwards, if you want to.