`Following series, contains result as string of lists with values either PASS or FAIL. Input:-
result |
---|
"['PASS','FAIL']" |
"['PASS','FAIL','PASS','FAIL']" |
"['FAIL','FAIL']" |
Output:
result |
---|
1 |
1 |
0 |
If any row has at-least one PASS as value then return 1 else return 0 Input:-
result |
---|
"['PASS','FAIL']" |
"['PASS','FAIL','PASS','FAIL']" |
"['FAIL','FAIL']" |
CodePudding user response:
A simple and fast approach, use a regex with str.contains
:
# if your want a robust check
df['result'] = df['result'].str.contains(r'\bPASS\b').astype(int)
# or if you're sure there are only PASS/FAIL
df['result'] = df['result'].str.contains('PASS').astype(int)
CodePudding user response:
If there are lists use in
statement:
df['result'] = [int('PASS' in x) for x in df['result']]
#alternative solution
df['result'] = df['result'].apply(lambda x: 'PASS' in x).astype(int)
If strings use Series.str.contains
:
df['result'] = df['result'].str.contains('PASS').astype(int)