Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
email scraper_help
#1
This is a video I watched on you tube. It work ok from him, not for me.
In this code I have comments telling you what  I know about this code.
I am using selenium, I have never used this mod. be for. here it is:
# I know what this does, I just don't know if it is right 
from selenium import webdriver
#I know this is right
import re
#this is where the problem is,webdriver.chrome() is the problem.
driver = webdriver.chrome()
#This open thr url
driver,get("http://www.networksecuritybybluedog.com/")
#This is a var that make the page sourse the same as doc
doc = driver.page_sourse
#Now I know that emails = re.findall is looking for everything that
#match this r'[\w\.-]+@[\w\.-]+' I am not sure how this works. To me
#it looks like r+read all that match this [\w\.-]+@[\w\.-]. It looks
# to me that this is sayin write every thing from the space befor 
# @ write to the next space everything after@
emails = re.findall(r'[\w\.-]+@[\w\.-]+', doc)
#this is a for loop email keep looping untill there no more emails
for email in emails:
# this is print the var email
    print(email)
Error:
Traceback (most recent call last):   File "C:\Users\renny and kite\Desktop\email_scraper\test_one\test_one\test_one .py", line 4, in <module>     driver = webdriver.chrome() TypeError: 'module' object is not callable Press any key to continue . . .
I have a felling this is 3.5 python

I hope some one can help me understand this code better
Thank you Think
Reply
#2
i believe its Chrome() 

and

http://stackoverflow.com/questions/82559...h-selenium
Recommended Tutorials:
Reply
#3
never mind, it looks like I have to install chrome. We will see how hard this is going be.
thank you
Reply
#4
Have to capitalize chrome() and set path.
driver = webdriver.Chrome(executable_path="C:/path_to/chromedriver.exe")
Reply
#5
Thank you snippsat
Reply
#6
It seems like you're trying to use Selenium to scrape emails from a website, but you're encountering an error related to the webdriver initialization. The error message indicates that you're trying to call a module as if it were a function, which is causing the TypeError.

The correct way to initialize a webdriver for Chrome in Selenium is to use webdriver.Chrome() (with a capital "C" in Chrome). Here's a corrected version of your code with explanations:


# Import the required module correctly
from selenium import webdriver
import re

# Initialize the Chrome webdriver instance
driver = webdriver.Chrome()  # Use 'Chrome' instead of 'chrome'

# Open the URL
driver.get("http://www.networksecuritybybluedog.com/")

# Get the page source
doc = driver.page_source

# Find all email addresses using regular expression
emails = re.findall(r'[\w\.-]+@[\w\.-]+', doc)

# Loop through the emails and print them
for email in emails:
    print(email)

# Close the browser window when done
driver.quit()
Make sure you have the Selenium library and the appropriate web driver executable (in this case, ChromeDriver) installed and properly configured. You should also remember to close the browser window using driver.quit() after you're done to free up system resources.

Additionally, ensure that you have the Chrome web driver executable installed and added to your system's PATH. You can download ChromeDriver from the official site: https://sites.google.com/chromium.org/driver/
Reply
#7
@Gaurav_Kumar look at date of Thread and the answer given,before answer.
This Thread was solved in 2016.
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Django send email - email form Remek953 2 2,248 Sep-18-2020, 07:07 AM
Last Post: Remek953

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020