MaxRetryError while scraping a website multiple times

kawasso · Aug-27-2019, 09:48 PM

The code that I am using for this is below:

from bs4 import BeautifulSoup
import requests, io
import pandas as pd
from selenium import webdriver
import time

################## NOTE THAT THIS CODE WORKS FOR 1 LINK AT A TIME, FOR MORE THAN ONE IT FAILS
############# error: MaxRetryError: HTTPConnectionPool(host='127.0.0.1', port=57192): Max retries exceeded with url: /session/dcba731d4173518f03b593a17afe111c/url (Caused by NewConnectionError('<urllib3.connection.HTTPConnection object at 0x0000001DD4212470>: Failed to establish a new connection: [WinError 10061] No connection could be made because the target machine actively refused it'))

driver = webdriver.Chrome(executable_path=r"myfolder\chromedriver.exe")
uchar=['a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z','<','>',',']
timestamp = pd.datetime.today().strftime('%Y%m%d-&H&M&S')

links_df = pd.read_excel(r'myfolder\myfile.xlsx', sheetname='Hoja1')
links_Df = links_df[(links_df['Country'] == 'PT')]

results = pd.DataFrame(columns=['ISIN', 'N Shares', 'Link'])

for ISIN in links_df.ISIN:
    link='https://www.bolsadelisboa.com.pt/pt-pt/products/equities/' + ISIN + '-XLIS/market-information'
    driver.get(link)
    soup = BeautifulSoup(driver.page_source, 'html.parser')
    driver.quit()
    r=soup.find_all("strong")[14]
    dirtyresult=str(r)
    for x in uchar:
        cleanresult=dirtyresult.replace(x,"").replace("<strong>","").replace("</strong>","")
    time.sleep(30)
    
results = results.append({'ISIN': ISIN, 'N Shares': cleanresult, 'Link': link}, ignore_index=True)
print(ISIN +": " + cleanresult)
    
results.to_csv(r'myfolder\output' + timestamp + '.csv', index=False)

print('Finish')

Possibly Related Threads…
Thread		Author	Replies	Views	Last Post
	Web-scraping, multiple webpages	Pabloty92	1	1,293	Dec-28-2022, 02:09 PM Last Post: Yoriz
	web scraping for new additions/modifed website?	kingoman123	4	2,266	Apr-14-2022, 04:46 PM Last Post: snippsat
	Scraping lender data from Ren Ren Dai website using Python. I will pay for that 200$	Hafedh_2021	1	2,766	May-18-2021, 08:41 PM Last Post: snippsat
	Scraping all website text using Python	MKMKMKMK	1	2,105	Nov-26-2020, 10:35 PM Last Post: Larz60+
	scraping multiple pages from table	bandar	1	2,726	Jun-27-2020, 10:43 PM Last Post: Larz60+
	Scraping a Website (HELP)	LearnPython2	1	1,777	May-08-2020, 03:20 PM Last Post: Larz60+
	Scraping Multiple Pages	mbadatanut	1	4,249	May-08-2020, 02:30 AM Last Post: Larz60+
	scraping from a website that hides source code	PIWI_Protein	1	1,985	Mar-27-2020, 05:08 PM Last Post: Larz60+
	Scraping not moving to the next pages in a website	jithin123	0	1,989	Mar-23-2020, 06:10 PM Last Post: jithin123
	Scraping from multiple URLS to print in a single line.	jb89	4	3,387	Jan-29-2020, 06:12 AM Last Post: perfringo

MaxRetryError while scraping a website multiple times

User Panel Messages

Announcements