I post here the entire table structure to perfectly visualize what I try to scrape.
I want to extract the phone, email, website, main activity (li element text without the div)
UPDATE: I forgot to mention that i ran into error because sometimes there is no email or website available vice versa, and code does not understand and breakes the entire cycle. I think there should be some error control somehow.
I want to extract the phone, email, website, main activity (li element text without the div)
UPDATE: I forgot to mention that i ran into error because sometimes there is no email or website available vice versa, and code does not understand and breakes the entire cycle. I think there should be some error control somehow.
<table class="table-info"> <tbody> <tr> <td class="col-1"> <div class="col-1-text">Business name</div> </td> <td class="col-2"> <div class="col-2-text">Company XYZ </div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Register code:</div> </td> <td class="col-2"> <div class="col-2-text">112233558</div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Operating address:</div> </td> <td class="col-2"> <div class="col-2-text"><a target="googlemaps" href="https://www.google.com/maps/place/Some-location" class="link-location">Some location strt. 233</a></div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Legal address</div> </td> <td class="col-2"> <div class="col-2-text"> <a class="link-location" href="https://www.google.com/maps/place/Some-location" target="_new">Some location </a> </div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">VAT No:</div> </td> <td class="col-2"> <div class="col-2-text"><a href="javascript:void(0)" onclick="return getVAT(this, '12345678')">Get VAT liability</a></div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Age:</div> </td> <td class="col-2"> <div class="col-2-text">1 year 3 months</div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Founded:</div> </td> <td class="col-2"> <div class="col-2-text">20/09/2019</div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Capital:</div> </td> <td class="col-2"> <div class="col-2-text">2000 USD</div> </td> </tr> <tr> <td colspan="2" class="sep"></td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Phone:</div> </td> <td class="col-2"> <div class="col-2-text">123456789</div> </td> </tr> <tr> <td class="col-1"> <div class="col-1-text">E-mail:</div> </td> <td class="col-2"> <div class="col-2-text"><a href="mailto:[email protected]">[email protected]</a></div> </td> </tr> <tr> <td class="col-1"><div class="col-1-text">Website:</div></td> <td class="col-2"><div class="col-2-text"><a href="http://www.somecompany.com" target="_blank">www.somecompany.com</a></div></td> </tr> <tr> <td colspan="2" class="sep"></td> </tr> <tr> <td class="col-1"> <div class="col-1-text">Representatives:</div> </td> <td class="col-2"> <div class="col-2-text"> <div class="box-message"> <p class="desc">To access information, please</p> <p> <a href="#" onclick="return loginClicked(this, '#');" class="btn btn-small btn-purple link-login">Log in</a> </p> </div> </div> </td> </tr> <tr> <td colspan="2" class="sep"></td> </tr> <tr> <td class="col-1"> <div class="col-1-text"> Main activity: <span class="tip info" title="" data-original-title="Activities are classified according to EMTAK 2008"></span> </div> </td> <td class="col-2"> <div class="col-2-text" id="activity_top5ffe2eab23d13"> <ul> <li> Computer consultancy activities <div class="main_activities_top_link_wrapper"> <a href="https://www.somesite.com/" target="_blank" onclick="ga('send', 'event', 'check', 'top_btn', 'Anonym');" class="btn btn-simple btn-open-graph"> <span>Open TOP 20</span> </a> </div> </li> </ul> </div> </td> </tr> </tbody> </table>