Python Forum
How to read Wikimedia multistream XML?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to read Wikimedia multistream XML?
#1
I can read XML using Dom whole in memory or Sax using events?
Wikimedia XML are large but fortunately have indices. I prefer read about 0.5 MB chunk from XML from byte n1 to byte n2 (from indices) and process this chunk whole in memory. How can do it?
Reply
#2
This may help.
Reply


Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020