Python Forum
Extracting data from a website
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Extracting data from a website
#1
Hello,

I have a very basic problem and would be happy if someone could help me out.

I would like to extract price data of a stock from a website. So far I have used the following code to retrieve the data from the website.

import pandas as pd

url = 'https://www.ariva.de/varta-aktie/kurs'
df = pd.read_html(url)
print(df)
Output:
[ 0 1 2 3 4 0 NaN NaN X-DAX 14.064 +0,47% Dow Jones 31.056 +0,05% ... NaN NaN, 0 1 2 3 4 5 0 X-DAX 14.064 +0,47% Dow Jones 31.056 +0,05% 1 L-TecDAX 3.281 +0,79% Dollarkurs 1222.000 -0,35%, 0 1 2 3 0 NaN NaN NaN NaN 1 124,20 € NaN -1,51% -1,90 €, Handelsplatz Letzter Unnamed: 2 Änderung Änderung.1 Vortag \ 0 Tradegate 124,20 € NaN -1,51% NaN 126,10 € 1 Gettex 124,50 € NaN -1,66% NaN 126,60 € 2 Quotrix 124,40 € NaN -1,58% NaN 126,40 € 3 L&S RT 124,45 € NaN -1,27% NaN 126,05 € 4 HypoVereinsbank 122,05 € NaN -3,21% NaN 126,10 € 5 Xetra 126,90 € NaN -0,08% NaN 127,00 € 6 Stuttgart 124,30 € NaN -1,43% NaN 126,10 € 7 Frankfurt 121,00 € NaN -4,20% NaN 126,30 € 8 Hamburg 121,00 € NaN -4,35% NaN 126,50 € 9 München 125,80 € NaN -0,32% NaN 126,20 € 10 Berlin 125,80 € NaN -0,16% NaN 126,00 € 11 Düsseldorf 125,40 € NaN -0,24% NaN 125,70 € 12 Nasdaq OTC Other 146,34 $ NaN +8,40% NaN 135,00 $ 13 Hannover 128,50 € NaN +2,23% NaN 125,70 € 14 Wien 126,50 € NaN +1,36% NaN 124,80 €
Now, I want to create a variable that shows only the first price of "Xetra" (= 126,90 €) and nothing else. But I do not know how to reference this variable in the table in this specific context.

Any help would be highly appreciated!

Many thanks,

Tim
Reply
#2
Something like this.
import pandas as pd

url = 'https://www.ariva.de/varta-aktie/kurs'
df = pd.read_html(url)
# Get right table from html
df = df[3]
>>> df_new = df.iloc[[5]]
>>> df_new
  Handelsplatz   Letzter  Unnamed: 2  ...      Zeit  Unnamed: 10  Unnamed: 11
5        Xetra  126,90 €         NaN  ...  08.01.21          NaN  Hist. Kurse

[1 rows x 12 columns]

>>> value = df_new['Letzter'].values[0]
>>> value
'126,90\xa0€'
>>> print(value)
126,90 €
tgottsc1 and Pedroski55 like this post
Reply
#3
(Jan-08-2021, 11:14 PM)snippsat Wrote: Something like this.
import pandas as pd

url = 'https://www.ariva.de/varta-aktie/kurs'
df = pd.read_html(url)
# Get right table from html
df = df[3]
>>> df_new = df.iloc[[5]]
>>> df_new
  Handelsplatz   Letzter  Unnamed: 2  ...      Zeit  Unnamed: 10  Unnamed: 11
5        Xetra  126,90 €         NaN  ...  08.01.21          NaN  Hist. Kurse

[1 rows x 12 columns]

>>> value = df_new['Letzter'].values[0]
>>> value
'126,90\xa0€'
>>> print(value)
126,90 €

Many thanks snippsat, really helpful to know!
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Scraping Data from Website melkaray 3 789 Sep-22-2023, 12:41 PM
Last Post: melkaray
  Extracting Data into Columns using pdfplumber arvin 17 5,564 Dec-17-2022, 11:59 AM
Last Post: arvin
  Code to retrieve data from a website charlie13255 0 983 Jul-07-2022, 07:53 PM
Last Post: charlie13255
  Is this possible to write a script for checking data from website? WanW 2 1,119 Jun-02-2022, 02:31 AM
Last Post: Larz60+
  Extracting Data from tables DataExtrator 0 1,148 Nov-02-2021, 12:24 PM
Last Post: DataExtrator
  extracting data ajitnayak1987 1 1,537 Jul-29-2021, 06:13 AM
Last Post: bowlofred
  Extracting and printing data ajitnayak1987 0 1,411 Jul-28-2021, 09:30 AM
Last Post: ajitnayak1987
  Extracting Elements From A Website List knight2000 2 2,267 Jul-20-2021, 10:38 AM
Last Post: knight2000
  Extracting unique pairs from a data set based on another value rybina 2 2,307 Feb-12-2021, 08:36 AM
Last Post: rybina
Thumbs Down extracting data/strings from Word doc mikkelibsen 1 1,926 Feb-10-2021, 11:06 AM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020