Oct-18-2023, 11:22 AM
This format is confusing me.
What is the best way to parse into individual components with BeautifulSoup.
html:
What is the best way to parse into individual components with BeautifulSoup.
html:
Output:<td class="small">
<b>
[Amend]
</b>
<b>
[Cover]
</b>
Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A))
<br/>
Acc-no: 0001609177-23-000017 (34 Act) Size: 3 KB
</td>
desired resilts:Output:[Amend]
[Cover]
Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A))
Acc-no: 0001609177-23-000017 (34 Act) Size: 3 KB
I have tried:td.get_text(strip=True).split('\n')which results in a list of length 1:
Output:['[Amend][Cover]Material Amendment to Form ATS-N (Rule 304(a)(2)(i)(A))Acc-no: 0001609177-23-000017\xa0(34 Act)\xa0 Size: 3 KB']
Also tried numerous other methods with improper results.