Dec-15-2018, 10:54 PM
this works here
from urllib.request import urlopen from bs4 import BeautifulSoup as bsoup url = 'http://www.cfbstats.com/2018/team/234/index.html' ofile = urlopen(url) soup = bsoup(ofile, "html.parser", from_encoding='utf-8') soup.prettify() table = soup.find("table", attrs={"class":"team-schedule"}) datasets = [] mytable = table.find_all("tr")#[1:] for row in mytable: text = str(row.get_text()).split('\n') datasets.append(text) _len = len(datasets) for x in range(_len -1): t = datasets[x] print((t[1] + '\t' + t[2] + '\t' + t[5]).expandtabs(30))
Output:Date Opponent Attendance
09/03/18 Virginia Tech 75,237
09/08/18 Samford 72,239
09/15/18 @ 17 Syracuse 37,457
09/22/18 Northern Ill. 65,633
09/29/18 @ Louisville 52,798
10/06/18 @ Miami (Fla.) 65,490
10/20/18 Wake Forest 67,274
10/27/18 2 Clemson 68,403
11/03/18 @ North Carolina St. 57,600
11/10/18 @ 3 Notre Dame 77,622
11/17/18 Boston College 57,274
11/24/18 10 Florida 71,953