Python Forum
Clustering for imbalanced data sets
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Clustering for imbalanced data sets
#1
Hi all,
I have a set of measurements with four features.


Two features are continuous (time and distance) and two are discrete.


We also know that the population consists of two groups. One is the minority with having around 10% samples and the other 90% of the samples being the second group. At the same time we do-not know exactly to which group each sample belongs and this is the need we want clustering to show us some hints on how to differentiate the two groups.

We would like to see if a clustering algorithm can see these two populations. Agglomerative clustering with cosine affinity performed by far the best, and I think this is because cosine affinity sees more the structure instead of the numeric values.

I wanted also to try some other techniques that can work on such imbalanced dataset. Can you propose me which technique might fit here?
Regards
Alex
Reply


Possibly Related Threads…
Thread Author Replies Views Last Post
  Stock clustering and portfolio diversification. Suitable features. timurkanaz 1 67 Yesterday, 09:54 AM
Last Post: Larz60+
  K Means Clustering antouanet 0 660 Jan-30-2023, 01:18 PM
Last Post: antouanet
  replace sets of values in an array without using loops paul18fr 7 1,630 Jun-20-2022, 08:15 PM
Last Post: paul18fr
  Data sets comparison Fraetos 0 1,384 Sep-14-2021, 06:45 AM
Last Post: Fraetos
  Mann Whitney U-test on several data sets rybina 2 2,051 Jan-05-2021, 03:08 PM
Last Post: rybina
  Least-squares fit multiple data sets multiverse22 1 2,226 Jun-06-2020, 01:38 AM
Last Post: Larz60+
  text clustering evaluation ?? khalidreemy 1 2,130 May-29-2019, 03:10 AM
Last Post: heiner55
  Clustering based on a variable and on a distance matrix flucoe 2 6,176 Dec-16-2018, 09:57 PM
Last Post: flucoe
  Compare 2 Csv data sets, identify record with latest date MJUk 11 6,096 Jan-06-2018, 09:23 PM
Last Post: MJUk
  Match two data sets based on item values klllmmm 7 6,355 Mar-29-2017, 02:33 PM
Last Post: zivoni

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020