Python Forum
sorting a strange file
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
sorting a strange file
#1
i have a big file i need to sort. it has 2 or more whitespace separated tokens on each line. the last token has 2 or more slash separated names. i need to sort the lines in the order of the last name of the last token as the primary key and all the tokens before the last one as the secondary key with all the whitespace between them compared as if it is a single space. it looks like they are a single space but i can't be so sure because the file has about 88 million lines in 9GB. the system has 16GB RAM and 16GB swap space. i can reboot before running this sort. the sort command does not appear to have the ability to do this so i am thinking of doing this in Python. what i envision doing first is read in all lines of the file into a giant list. a sort key function would do all that funny parsing and comparison, optimized to skip parsing for the secondary keys if the primary keys are not equal. also, i need to do the comparison in a case insensitive way, but that shoulb easy enough. finally, the sorted list would be written out. does this sound fun? do i need another bottle of whiskey?
Tradition is peer pressure from dead people

What do you call someone who speaks three languages? Trilingual. Two languages? Bilingual. One language? American.
Reply


Messages In This Thread
sorting a strange file - by Skaperen - Oct-13-2018, 02:15 AM
RE: sorting a strange file - by Gribouillis - Oct-14-2018, 01:59 PM
RE: sorting a strange file - by wavic - Oct-14-2018, 02:13 PM
RE: sorting a strange file - by Gribouillis - Oct-14-2018, 02:28 PM
RE: sorting a strange file - by wavic - Oct-14-2018, 02:54 PM
RE: sorting a strange file - by Gribouillis - Oct-14-2018, 06:04 PM
RE: sorting a strange file - by wavic - Oct-14-2018, 06:24 PM
RE: sorting a strange file - by Larz60+ - Oct-14-2018, 07:56 PM
RE: sorting a strange file - by wavic - Oct-14-2018, 11:05 PM
RE: sorting a strange file - by Skaperen - Oct-15-2018, 02:12 AM
RE: sorting a strange file - by wavic - Oct-15-2018, 05:03 AM
RE: sorting a strange file - by Skaperen - Oct-15-2018, 06:16 AM
RE: sorting a strange file - by wavic - Oct-15-2018, 06:24 AM
RE: sorting a strange file - by Gribouillis - Oct-15-2018, 09:37 AM
RE: sorting a strange file - by Skaperen - Oct-15-2018, 07:31 PM
RE: sorting a strange file - by wavic - Oct-15-2018, 07:52 PM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020