Python Forum

Full Version: How to read Wikimedia multistream XML?
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
I can read XML using Dom whole in memory or Sax using events?
Wikimedia XML are large but fortunately have indices. I prefer read about 0.5 MB chunk from XML from byte n1 to byte n2 (from indices) and process this chunk whole in memory. How can do it?
This may help.