Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Search the entire web
#4
What is considered a part of the "entire web"? Not everything can be indexed, or crawled, and not everything is accessible over http.

You really only need the requests module to get a page. Finding all links in that page would be easier with beautifulsoup (the package name is bs4). And unless you have infinite time, you probably want to store an indexed version of the page in some way, using some sort of database, which would be another package.
Reply


Messages In This Thread
Search the entire web - by DT909 - Dec-15-2017, 05:22 PM
RE: Search the entire web - by Larz60+ - Dec-15-2017, 05:25 PM
RE: Search the entire web - by DT909 - Dec-15-2017, 05:58 PM
RE: Search the entire web - by buran - Dec-15-2017, 06:59 PM
RE: Search the entire web - by nilamo - Dec-15-2017, 06:09 PM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020