Home > front end >  Combining dictionaries with a particular key: Python
Combining dictionaries with a particular key: Python

Time:01-05

I have the following dictionaries generated using for loop and I would to combine and dump them into one YAML file.

dict1

{'name': 'test',
 'version': '0011 * *? *',

 'datasets': [{
   'dataset': 'dataset_a',
   'match_on': ['code'],
   'columns': {'include': ['aa',
     'bb',
     'cc']}}]
}

dict2

{'name': 'test',
 'version': '0011 * *? *',

 'datasets': [{
   'dataset': 'dataset_b',
   'match_on': ['code'],
   'columns': {'include': ['aaa',
     'bbb',
     'ccc']}}]
}

The dictionaries have common values on name and version keys. So, I would like to put it as common but they have differences on the datasets key. The desired output look like the following

combined_dict

{'name': 'test',
 'version': '0011 * *? *',

 'datasets': [{
   'dataset': 'dataset_a',
   'match_on': ['code'],
   'columns': {'include': ['aa',
     'bb',
     'cc']}}]

'datasets': [{
   'dataset': 'dataset_b',
   'match_on': ['code'],
   'columns': {'include': ['aaa',
     'bbb',
     'ccc']}}]
}

To achieve this, I have tried the following.

combined_dict= {**dict1,**dict2}

with open('combined_dict.yml', 'w') as outfile:
    yaml.dump(updated_dict , outfile, sort_keys=False)

CodePudding user response:

As said in the comment, you cannot do what you want since the key in the dictionary has to be unique. What you can do is add an element to the list associated with the key "datasets", in order to obtain something like this:

{
    "name": "test",
    "version": "0011 * *? *",
    "datasets": [
        {
            "dataset": "dataset_a",
            "match_on": ["code"],
            "columns": {"include": ["aa", "bb", "cc"]},
        },
        {
            "dataset": "dataset_b",
            "match_on": ["code"],
            "columns": {"include": ["aaa", "bbb", "ccc"]},
        },
    ],
}

This is the code to do that:

combined_dict = dict1.copy()
combined_dict["datasets"].extend(dict2["datasets"])

If you want to modify in-place dict1, just replace combined_dict with dict1

  • Related