Python Forum
How to read Wikimedia multistream XML? - Printable Version

+- Python Forum (https://python-forum.io)
+-- Forum: Python Coding (https://python-forum.io/forum-7.html)
+--- Forum: General Coding Help (https://python-forum.io/forum-8.html)
+--- Thread: How to read Wikimedia multistream XML? (/thread-39559.html)



How to read Wikimedia multistream XML? - AndrzejB - Mar-08-2023

I can read XML using Dom whole in memory or Sax using events?
Wikimedia XML are large but fortunately have indices. I prefer read about 0.5 MB chunk from XML from byte n1 to byte n2 (from indices) and process this chunk whole in memory. How can do it?


RE: How to read Wikimedia multistream XML? - Larz60+ - Mar-09-2023

This may help.