Python Forum
Counting Duplicates in large Data Set
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Counting Duplicates in large Data Set
#2
Simple simulation would do: run n-times random.sample on the desired range and convert result to tuple and feed to collections Counter. Then inspect results. If order doesn't matter, then before converting to tuple use sorted. I tried with n=1000000 and there was no 1, 2, 3, 4, 5, 6 in results (I used range(1, 49)).
I'm not 'in'-sane. Indeed, I am so far 'out' of sane that you appear a tiny blip on the distant coast of sanity. Bucky Katt, Get Fuzzy

Da Bishop: There's a dead bishop on the landing. I don't know who keeps bringing them in here. ....but society is to blame.
Reply


Messages In This Thread
Counting Duplicates in large Data Set - by jmair - Dec-06-2022, 01:52 PM
RE: Counting Duplicates in large Data Set - by perfringo - Dec-06-2022, 03:35 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Add group number for duplicates atomxkai 2 1,182 Dec-08-2022, 06:08 AM
Last Post: atomxkai
  Reading large crapy text file in anaconda to profile data syamatunuguntla 0 849 Nov-18-2022, 06:15 PM
Last Post: syamatunuguntla
  Searching Module to plot large data G_rizzle 0 1,486 Dec-06-2021, 08:00 AM
Last Post: G_rizzle
  Pandas Indexing with duplicates energerecontractuel 3 2,917 Mar-07-2019, 12:57 AM
Last Post: scidam
  How to filter specific rows from large data file Ariane 7 8,326 Jun-29-2018, 02:43 PM
Last Post: gontajones
  jupyter pandas remove duplicates help okl 3 7,563 Feb-25-2018, 01:11 PM
Last Post: glidecode

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020