Image Scraper (beautifulsoup), stopped working, need to help see why

woodmister · Jan-12-2021, 04:10 PM

Man, that's awesome. I love the concurrent.futures part! I went ahead though and bastardized your code too to ask me for a folder name and then change the name a titch. this is what I ended up with.

import requests
from bs4 import BeautifulSoup
from os import path
import os
import concurrent.futures
 
url = input("Website:")
folder = input("Folder:")

def read_url(url):
    headers = {'User-Agent': 'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/86.0.4240.75 Safari/537.36'}
    response = requests.get(url, headers=headers)
    soup = BeautifulSoup(response.content, 'lxml')
    img_all = soup.select('div.thread_image_box > a')
    return img_all
 
def img_multi(img_link):
    name = folder + ' - ' + path.basename(img_link)
    print(f'Download --> ',name)
    dlpath = os.path.join(folder, name)
    with open(dlpath, 'wb') as f_out:
        f_out.write(requests.get(img_link).content)
 
if __name__ == '__main__':
    img_all = read_url(url)
    if not os.path.isdir(folder):
        os.makedirs(folder)
    # ThreadPoolExecutor | ProcessPoolExecutor
    with concurrent.futures.ThreadPoolExecutor(max_workers=8) as executor:
        for img in img_all:
            img_link = img.get('href')
            executor.submit(img_multi, img_link)

took me far longer than you probably could image to figure out how to actually get it to ask for a name and then use it. But, it's there! Thanks for sharing your code!

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Web scraper	tomenzo123	8	4,395	Aug-18-2023, 12:45 PM Last Post: Gaurav_Kumar
	Web scraper not populating .txt with scraped data	BlackHeart	5	1,523	Apr-03-2023, 05:12 PM Last Post: snippsat
	BeautifulSoup Showing none while extracting image url	josephandrew	0	1,949	Sep-20-2021, 11:40 AM Last Post: josephandrew
	Web scrapping - Stopped working	peterjv26	2	3,096	Sep-23-2020, 08:30 AM Last Post: peterjv26
	not getting image src in my BeautifulSoup csv file	farhan275	11	3,749	Sep-14-2020, 04:52 PM Last Post: buran
	Court Opinion Scraper in Python w/ BS4 (Currently exports to CSV) need help with SQL	MidnightDreamer	4	3,010	Mar-12-2020, 09:57 AM Last Post: BrandonKastning
	Python using BS scraper	paulfearn100	1	2,555	Feb-07-2020, 10:22 PM Last Post: snippsat
	web scraper using pathlib	Larz60+	1	3,215	Oct-16-2017, 05:27 PM Last Post: Larz60+
	Need alittle hlpl with an image scraper.	Blue Dog	8	7,741	Dec-24-2016, 08:09 PM Last Post: Blue Dog
	Made a very simple email grabber(scraper)	Blue Dog	4	6,892	Dec-13-2016, 06:25 AM Last Post: wavic

Image Scraper (beautifulsoup), stopped working, need to help see why

User Panel Messages

Announcements