Home > Software engineering >  AWS S3 object filter to NOT match Prefix in Python script
AWS S3 object filter to NOT match Prefix in Python script

Time:12-18

When iterating over S3 objects using Python/boto3, I see that there's a filter method. But can you apply a NOT condition?

I want to just get the top level objects, not objects in folders (they have a prefix). I am currently doing this and it works:

import boto3

s3 = boto3.resource('s3')
bucket = cfg['s3']['bucket_name']

for obj in s3.Bucket(bucket).objects.all():
    if not re.match('folder_name.*', obj.key):

I see support for a filter like this:

for obj in s3.Bucket(bucket).objects.filter(Prefix=folder_name):

I'm asking is there a way to say Prefix != folder_name?

CodePudding user response:

If you just want a list of objects at without a shared prefix, specify the delimiter to the filter, and boto3 will filter away the shared prefixes:

s3 = boto3.resource('s3')
for obj in s3.Bucket(bucket).objects.filter(Delimiter='/'):
    print(obj.key)
  • Related