Python Forum
parsing local xml files to csv
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
parsing local xml files to csv
#5
Hello;
Thank you very much for your help and your efforts.
I started doing something with your solutions and examples.
Trying to adapt slowly to your example
from bs4 import BeautifulSoup
lst = []
soup = BeautifulSoup(open('NPL_2008_LFS_v01_M_v01_A_ILOVAR.xml', encoding='utf-8'), 'xml')
title = soup.find('titl')
producer = soup.find('producer')
print(title.text.strip())
print(producer.attrs.get('affiliation'))
data = soup.find('dataDscr')
vars = data.find_all('var')
for var in vars:
    ID=var.attrs.get('ID')
    name=var.attrs.get('name')
    files=var.attrs.get('files')
    dcml=var.attrs.get('dcml')
    intrvl=var.attrs.get('intrvl')
    labl=var.find('labl').text.strip()
    sumStat=[i.text.strip() for i in var.find_all('sumStat')]
    print(title.text.strip(),producer.text.strip(),ID,name,files,dcml,intrvl,labl,sumStat)
    lst.append((title.text.strip(),producer.text.strip(),ID,name,files,dcml,intrvl,labl,sumStat))
    
I couldn't do it even though I tried a lot.So I want to ask 2 things here.
1-is it possible to write text of type for sumstat
# All sumStat
>>> [i.text.strip() for i in var.find_all('sumStat')]
as below or otherwise
['vald':'76208', 'invd':'0', 'min':'1001', 'max':'1800', 'mean':'1412.79', 'stdev':'231.955']


2.How do I combine category tags?

<catgry>
        <catValu>
          1
        </catValu>
        <labl>
          Eastern
        </labl>
        <catStat type="freq">
          16926
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          2
        </catValu>
        <labl>
          Central
        </labl>
        <catStat type="freq">
          31316
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          3
        </catValu>
        <labl>
          Western
        </labl>
        <catStat type="freq">
          13527
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          4
        </catValu>
        <labl>
          Mid-Western
        </labl>
        <catStat type="freq">
          8060
        </catStat>
      </catgry>
      <catgry>
        <catValu>
          5
        </catValu>
        <labl>
          Far-Western
        </labl>
        <catStat type="freq">
          6379
        </catStat>
</catgry>
Like the example below
['1-Eastern','freq':'16926'] | ['2-Central','freq':'31316'] | ['3-Western','freq':'13527'] |......

I would like your help.
Best regards
Reply


Messages In This Thread
parsing local xml files to csv - by erdem_ustunmu - Feb-23-2019, 11:30 AM
RE: parsing local xml files to csv - by snippsat - Feb-23-2019, 12:22 PM
RE: parsing local xml files to csv - by snippsat - Feb-23-2019, 10:41 PM
RE: parsing local xml files to csv - by erdem_ustunmu - Feb-24-2019, 11:46 AM
RE: parsing local xml files to csv - by snippsat - Feb-25-2019, 03:24 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  uploading files from a ubuntu local directory to Minio storage container dchilambo 0 520 Dec-22-2023, 07:17 AM
Last Post: dchilambo
  How to take the tar backup files form remote server to local server sivareddy 0 1,941 Jul-14-2021, 01:32 PM
Last Post: sivareddy
  opening files and output of parsing leodavinci1990 4 2,607 Oct-12-2020, 06:52 AM
Last Post: bowlofred
  Parsing Xml files >3gb using lxml iterparse Prit_Modi 2 2,403 May-16-2020, 06:53 AM
Last Post: Prit_Modi
  Parsing Attached .MSG Files with Python3 ericl42 1 3,718 Apr-12-2019, 06:28 PM
Last Post: ericl42
  Fetching html files from local directories shiva 3 3,465 Mar-20-2018, 05:12 PM
Last Post: wavic

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020