![]() |
How to read Wikimedia multistream XML? - Printable Version +- Python Forum (https://python-forum.io) +-- Forum: Python Coding (https://python-forum.io/forum-7.html) +--- Forum: General Coding Help (https://python-forum.io/forum-8.html) +--- Thread: How to read Wikimedia multistream XML? (/thread-39559.html) |
How to read Wikimedia multistream XML? - AndrzejB - Mar-08-2023 I can read XML using Dom whole in memory or Sax using events? Wikimedia XML are large but fortunately have indices. I prefer read about 0.5 MB chunk from XML from byte n1 to byte n2 (from indices) and process this chunk whole in memory. How can do it? RE: How to read Wikimedia multistream XML? - Larz60+ - Mar-09-2023 This may help. |