Python Forum
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Excel Question
#6
If you are using Python, the simplest storage until you have found the duplicates is a pickle file containing a list of tuples (or namedtuples). Once the duplicates are found, you can write a new Excel file.

The reason for this is that Python is slow when it reads Excel files, while loading such a pickle file with 30000 records takes a fraction of a second.

That said, without more information about the contents of the data, it is difficult to elaborate a good strategy.
Reply


Messages In This Thread
Excel Question - by karaokelove - Jan-05-2018, 05:45 PM
RE: Excel Question - by hshivaraj - Jan-05-2018, 06:04 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 06:08 PM
RE: Excel Question - by Povellesto - Jan-05-2018, 06:24 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 06:31 PM
RE: Excel Question - by Gribouillis - Jan-05-2018, 07:41 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 09:03 PM
RE: Excel Question - by Gribouillis - Jan-05-2018, 09:27 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 09:35 PM
RE: Excel Question - by Gribouillis - Jan-05-2018, 09:38 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 09:46 PM
RE: Excel Question - by snippsat - Jan-05-2018, 11:15 PM
RE: Excel Question - by karaokelove - Jan-05-2018, 11:17 PM

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020