Python Forum
How to read Wikimedia multistream XML?
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
How to read Wikimedia multistream XML?
#1
I can read XML using Dom whole in memory or Sax using events?
Wikimedia XML are large but fortunately have indices. I prefer read about 0.5 MB chunk from XML from byte n1 to byte n2 (from indices) and process this chunk whole in memory. How can do it?
Reply


Messages In This Thread
How to read Wikimedia multistream XML? - by AndrzejB - Mar-08-2023, 05:22 PM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020