Bottom Page

Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
 Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to loop to next HTML/new CSV Row
#1
Hello Python Web Scrapers,

This is what I am currently up against and was hoping somebody could point me in the right direction.

Python3 + BeautifulSoup4 + lxml (HTML -> CSV):

How to loop to the next HTML URL and save as new CSV Row in the existing .csv that the current code scrapes from.

For instance: How would I make this URL do the above

Next HTML URL: https://law.justia.com/cases/federal/app...66/308423/

Python3 Code:

from urllib.request import urlopen
from bs4 import BeautifulSoup
html = urlopen("http://law.justia.com/cases/federal/appellate-courts/F2/999/663/308588/")
bsObj = BeautifulSoup(html.read())
allOpinion = bsObj.findAll(id="opinion")
import requests
from bs4 import BeautifulSoup

url = "http://law.justia.com/cases/federal/appellate-courts/F2/999/663/308588/"
allTitle = bsObj.findAll({"title"})
allURL = url

print(allOpinion)
print(allTitle)
print(allURL)

import csv
csvRow = [allOpinion,allTitle,allURL]
csvfile = "current_F2_opinion_with_tags_current.csv"
with open(csvfile, "a") as fp:
    wr = csv.writer(fp, dialect='excel')
    wr.writerow(csvRow)

print(allOpinion[0].get_text(),url)
 
import csv
csvRow = [allOpinion[0].get_text(),allTitle[0].get_text(),allURL]
csvfile = "current_F2_opinion_without_tags_current.csv"
with open(csvfile, "a") as fp:
    wr = csv.writer(fp, dialect='excel')
    wr.writerow(csvRow)

Thank you!

Best Regards,

Brandon Kastning

P.S. - Everyone be safe!
apollo likes this post
Quote

Top Page

Possibly Related Threads...
Thread Author Replies Views Last Post
  HTML loading process windows11 1 74 Apr-01-2020, 04:45 PM
Last Post: Larz60+
  how does a html form work exactly? mp3909 2 116 Apr-01-2020, 04:02 PM
Last Post: mp3909
  Selenium cant get elements from HTML(Rookie) Troop 1 85 Mar-31-2020, 03:37 AM
Last Post: Larz60+
  extrat data from a button html windows11 1 106 Mar-24-2020, 03:39 PM
Last Post: Larz60+
  Pandas tuple list returning html string shansaran 0 85 Mar-23-2020, 08:44 PM
Last Post: shansaran
  Python3 + BeautifulSoup4 + lxml (HTML -> CSV) - How to write 3 Columns to MariaDB? BrandonKastning 21 240 Mar-23-2020, 05:51 PM
Last Post: ndc85430
  How to POST html data to be handled by a route endpoint nikos 1 132 Mar-07-2020, 03:14 PM
Last Post: nikos
  How to get the href value of a specific word in the html code julio2000 2 169 Mar-05-2020, 07:50 PM
Last Post: julio2000
  BeautifulSoup4 plugin help Lathem01 2 174 Feb-16-2020, 11:56 AM
Last Post: snippsat
  Parse data from downloaded html nikos48 7 304 Jan-26-2020, 03:35 PM
Last Post: nikos48

Forum Jump:


Users browsing this thread: 1 Guest(s)