loading semi structured data to pandas-CodePudding

I have data that looks like this (from jq)

script_runtime{application="app1",runtime="1651394161"} 1651394161
folder_put_time{application="app1",runtime="1651394161"} 22
folder_get_time{application="app1",runtime="1651394161"} 128.544
folder_ls_time{application="app1",runtime="1651394161"} 3.868
folder_ls_count{application="app1",runtime="1651394161"} 5046

The dataframe should allow manipulation of each row to this:

script_runtime,app1,1651394161,1651394161
folder_put_time,app1,1651394161,22

Its in a textfile. How can I easily load it into pandas for data manipulation?

CodePudding user response：

Load the .txt using pd.read_csv(), specifying a space as the separator (

If you want to drop the first column which contains the bracketed value:
df = df.iloc[: , 1:]

Full code:

df = pd.read_csv("textfile.txt", header=None, delimiter=r"\s ")

df['function'] = df[0].str.split("{",expand=True)[0]
df['application'] = df[0].str.split("\"",expand=True)[1]
df['runtime'] = df[0].str.split("\"",expand=True)[3]

df = df.iloc[: , 1:]