Home > Software design >  Using BeautifulSoup to count xml elements in a function
Using BeautifulSoup to count xml elements in a function

Time:10-21

I often use len(find_all("some_element") to count the number of entities in a xml file. I tried to build a function, but it doesn't work/ it always give me "None".

The XML file:

<parent>
 <some>
   <child>text</child>
   <child>text</child>
   <child>text</child>
 </some>
</parent>

my python code:

def return_len(para1,para2): # doesn't work
    if bool(suppe.para1): # the element isn't always present in the xml
        return len(suppe.para1.find_all(para2))

def return_len1(): # does work
    if bool(suppe.some):
        return len(suppe.some.find_all("child"))

print(return_len("some","child")) # doesnt work
print(return_len1()) # does work

How must i modify my function return_len to get working / what did i wrong?

CodePudding user response:

You can do like this.

from bs4 import BeautifulSoup

s = """<parent>
 <some>
   <child>text</child>
   <child>text</child>
   <child>text</child>
 </some>
</parent>    
"""

soup = BeautifulSoup(s, 'xml')

def return_len(para1,para2,soup):
    print(f'No. of <{para2}> tags inside <{para1}> tag.')
    temp = soup.find(para1)
    if temp:
        return len(temp.find_all(para2))

print(return_len('some', 'child', soup))
print(return_len('parent', 'some', soup))

No. of <child> tags inside <some> tag.
3
No. of <some> tags inside <parent> tag.
1

CodePudding user response:

Without any external library - see the below

import xml.etree.ElementTree as ET


xml = '''<parent>
 <some>
   <child>text</child>
   <child>text</child>
   <child>text</child>
 </some>
</parent>'''

root = ET.fromstring(xml)
print(f'Number of child elements is {len(root.findall(".//child"))}') 

output

Number of child elements is 3
  • Related