- I am using
findAll
method ofBeautifulSoup
and trying to fetch all the values of particular tagDocumentIndex
. - While using it, I am getting the output as
[<DocumentIndex>3646</DocumentIndex>, <DocumentIndex>3650</DocumentIndex>, <DocumentIndex>3649</DocumentIndex>]
- Code, gstr_xml is available here
lstr_soup = BeautifulSoup(gstr_xml, features="xml")
lstr_folder_index = lstr_soup.findAll('DocumentIndex')
print(lstr_folder_index)
- How can I get the output just as
[3646, 3650, 3649]
CodePudding user response:
Each value in the list is a <class 'bs4.element.Tag'>
, which you can call .text
on to retrieve just the text value.
print([x.text for x in lstr_folder_index])
# Output:
['3646', '3650', '3649']