Scraping a webpage with BS4

**Larz60+** · (This post was last modified: Jan-30-2019, 12:48 AM by Larz60+.)

Quote:what does the [-1] refer to in the url.split line

the [-1] is an index to the last item in the slice of the split, so it gets the file name.
you have to be careful with this though, and look for additional attributes after the file name, like '?ei=wvJQXPryB6m2ggfQx76wAg&q', etc. which I didn't do here as it didn't appear that it would be an issue.

Quote:I don't totally follow the code. I received an error on the first line when trying to run the code. What does the 'n' refer to?

please note that I stated code was untested. If that line was used, it would should read:

    for n, link in enumerate(soup.find_all('a')):

but since n is not needed, it should be removed, and the line should be:

    for link in soup.find_all('a'):

what enumerate does is return the current iteration of the loop

Scraping a webpage with BS4

User Panel Messages

Announcements