urlib - to use or not to use ( for web scraping )?

urlib - to use or not to use ( for web scraping )?

Thread Rating:

0 Vote(s) - 0 Average
1
2
3
4
5

Thread Modes

urlib - to use or not to use ( for web scraping )?

snippsat

Administrators

Administrators

Posts: 7,103

Threads: 122

Joined: Sep 2016

Reputation: 499

#19

Oct-01-2018, 05:06 AM

(Sep-30-2018, 11:33 PM)Truman Wrote: https://www.amazon.com/Web-Scraping-Pyth...1491910291
The whole book is based on urlib library.

It's not based on urllib,it's used to read source code of web-site.
Requests is of course now the recommend way to do it.

Page 20,if i rewrite to use Request.
In bs4 need also to specify a parser to use,recommended is to use lxml as parser.

from urllib.request import urlopen
from bs4 import BeautifulSoup

html = urlopen("http://www.pythonscraping.com/pages/page3.html")
bsObj = BeautifulSoup(html)
for child in bsObj.find("table",{"id":"giftList"}).children:
    print(child)

With Requests and lxml as parser.

import requests
from bs4 import BeautifulSoup

html = requests.get("http://www.pythonscraping.com/pages/page3.html")
soup = BeautifulSoup(html.content, 'lxml')
for child in soup.find("table",{"id":"giftList"}).children:
    print(child)

Now you know how urllib.request import urlopen can be replaced bye Requests,
this apply to over 90% of example in book.
The parsing with BeautifulSoup is okay in book.

Find

Messages In This Thread

urlib - to use or not to use ( for web scraping )? - by Truman - Sep-26-2018, 11:48 PM

RE: urlib - to use or not to use ( for web scraping )? - by metulburr - Sep-27-2018, 01:20 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-27-2018, 01:21 AM

RE: urlib - to use or not to use ( for web scraping )? - by Axel_Erfurt - Sep-27-2018, 07:07 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-27-2018, 11:45 AM

RE: urlib - to use or not to use ( for web scraping )? - by metulburr - Sep-27-2018, 01:10 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Sep-27-2018, 10:23 PM

RE: urlib - to use or not to use ( for web scraping )? - by wavic - Sep-30-2018, 09:56 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-30-2018, 11:29 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Sep-30-2018, 09:16 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-30-2018, 11:03 PM

RE: urlib - to use or not to use ( for web scraping )? - by metulburr - Sep-30-2018, 11:06 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-30-2018, 11:28 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Sep-30-2018, 11:33 PM

RE: urlib - to use or not to use ( for web scraping )? - by metulburr - Sep-30-2018, 11:41 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Sep-30-2018, 11:43 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Sep-30-2018, 11:57 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Oct-01-2018, 12:27 AM

RE: urlib - to use or not to use ( for web scraping )? - by snippsat - Oct-01-2018, 05:06 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-11-2018, 01:01 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-27-2018, 12:07 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Nov-27-2018, 01:29 AM

RE: urlib - to use or not to use ( for web scraping )? - by stranac - Nov-27-2018, 05:16 AM

RE: urlib - to use or not to use ( for web scraping )? - by snippsat - Nov-27-2018, 10:59 AM

RE: urlib - to use or not to use ( for web scraping )? - by stranac - Nov-27-2018, 03:13 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-27-2018, 10:45 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Nov-27-2018, 10:49 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-27-2018, 11:28 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Nov-28-2018, 12:29 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-28-2018, 10:25 PM

RE: urlib - to use or not to use ( for web scraping )? - by wavic - Nov-29-2018, 12:29 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Nov-28-2018, 11:15 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Nov-29-2018, 11:10 PM

RE: urlib - to use or not to use ( for web scraping )? - by wavic - Nov-30-2018, 08:57 AM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Nov-29-2018, 11:15 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-10-2018, 11:15 PM

RE: urlib - to use or not to use ( for web scraping )? - by snippsat - Dec-10-2018, 11:51 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-11-2018, 11:49 PM

RE: urlib - to use or not to use ( for web scraping )? - by Larz60+ - Dec-12-2018, 12:44 AM

RE: urlib - to use or not to use ( for web scraping )? - by snippsat - Dec-12-2018, 01:37 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-12-2018, 11:09 PM

RE: urlib - to use or not to use ( for web scraping )? - by snippsat - Dec-13-2018, 03:54 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-14-2018, 12:25 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-15-2018, 12:34 AM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-17-2018, 11:24 PM

RE: urlib - to use or not to use ( for web scraping )? - by Truman - Dec-19-2018, 10:45 PM

Users browsing this thread: 1 Guest(s)

View a Printable Version

User Panel Messages

Pay your profile a visit

User Control Panel

Do some changes on your profile

View private messages unread

Change signature

Announcements

Announcement #1 8/1/2020

Announcement #2 8/2/2020

Announcement #3 8/6/2020