Aug-25-2018, 08:46 AM
Using
xpath()
method of ElementTree you could query all td elements without span child like this:from lxml import html html_text = """<html> <head> <title>foo</title> </head> <body> <tr> <td><span class="number blue">xx</span></td> <td>001</td> <td>002</td> </tr>> </body> </html>""" et = html.fromstring(html_text) spans = et.xpath('//tr/td/span[@class="number blue"]') print(spans[0].text) for e in et.xpath('//tr/td[not(span)]'): print(e.text)