Python Forum
Why doesn't my spider find body text?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Why doesn't my spider find body text?
#2
Update: I also checked NY Times and FOX, and they haven't found the bodytext either, so apparently it's a systematic issue, and those few CNN articles are the outliers (for example this one).

Does anyone have any idea why this might be, and why the CNN one might be different?

Edit: The CBS one also found the bodytext everywhere (for example here), which makes me even more confused.
Reply


Messages In This Thread
RE: Why doesn't my spider find body text? - by sigalizer - Oct-30-2019, 01:35 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  [BeautifulSoup] Find </body>? Winfried 3 1,422 Jul-21-2023, 11:25 AM
Last Post: Gaurav_Kumar
  Deployed Spider on Heroku: How do I email downloaded files? JaneTan 2 1,616 Mar-24-2022, 08:31 AM
Last Post: JaneTan
  find a hyperlink in Gmail body python 3(imap and selenium) taomihiranga 1 8,241 Dec-30-2020, 05:31 PM
Last Post: Gamer1057
  Get html body of URL rama27 6 3,548 Aug-03-2020, 02:37 PM
Last Post: snippsat
  Is it possible to perform a PUT request by passing a req body instead of an ID ary 0 1,859 Feb-20-2019, 05:55 AM
Last Post: ary
  XML Parsing - Find a specific text (ElementTree) TeraX 3 4,123 Oct-09-2018, 09:06 AM
Last Post: TeraX
  How to find particular text from td tag using bs4 Prince_Bhatia 7 5,998 Sep-24-2018, 08:36 PM
Last Post: nilamo
  BS4 Not Able To Find Text In CSS Comments digitalmatic7 4 5,308 Feb-27-2018, 03:45 AM
Last Post: digitalmatic7
  In CSV, how to write the header after writing the body? Tim 18 14,872 Jan-06-2018, 01:54 PM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020