Thank you
I wrote this code:
Why this ?
this is the result.
I wrote this code:
from urllib.request import urlopen from bs4 import BeautifulSoup import re html = urlopen('http://www.orchidspecies.com/indexe-ep.htm') bs = BeautifulSoup(html, 'html.parser') images = bs.find_all('a') for image in images: prova=image.text if re.search('Earina sigmoidea', prova): #print(image.text) print(image.get('href'))But the result is not entirely correct, becouse the name 'Earina sigmoidea' in the page there is only one.
Why this ?
this is the result.
Output:earinaaestivalis.htm
earsigmoidea.htm