Python Forum
Exotic stats problem ; mode, fuzzy clusters, etc
Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
Exotic stats problem ; mode, fuzzy clusters, etc
#1
EXOTIC STATS PROBLEM ; MODE, FUZZY CLUSTERS, ETC

Hi I'm working on a math problem and not sure how to approach it.

Trying to enumerate, rank and extract most common numeric ranges
from a list, with a twist ;

Basic operation example is, I have ten numeric values representing
weights in variable unit measure types, i.e. grams, ounces, etc

Each value is unique and is a decimal value,
for example we have the following set of numbers
shown below. In this set, we are interested in the
most common magnitudinal range. Here the most
common range value is shown by three values ;

295.999, 312.015, 330.111

the complete set is shown ;
.........................
102.35
8000.32
330.111
295.999
77.01
16.999
1099.222
645
890.01
312.015
.........................

What I want to be able to do, is to input a list of ten values similar to the above,
and have some way, to simply, easily derive the most common value by range.

If I was using values that were more static, for example if all value in the range
in the list were all similar such as "310", then I could just use the mode function
and it would easily tell me this.

However since the values are variable decimal types, I am a bit stumped as to how
I would accomplish this. I came across Python Fuzzy Clustering and it looks like
this might work possibly in relation to mode but wondering if there is a simpler, easier, faster
way to do this?

The end goal is I want to be able to do pattern analysis on a list of numbers and return
the most common range of highest magnitude.

Outputs desired, as an example from the list above would be the thee values printed
to screen, text file, or variable

???? better way to do this than ;

1.) Fuzzy Clustering
2.) Mode (most common value) of discrete data.

What do I mean by define "most common value by range"

Any set of ten values, there will be a few values
that reoccur, but with slight variance. So for the above the most common range value would be 300 but with minor
variance for each instance. What I need to do is to have a function that can tell what number magnitude
occurs the most even though each instance is not fixed. If I had a set of ten numbers and out of ten
I had three exact values occur I could just use a mode function, but since here - the magnitudes are comprable
but not exact, mode wont work in the standard case, but there must be a way to achieve this?

Why am I using "three" common range values?

"three" is stated in the original post 'as an example'. Here, Im saying we have a fixed data set
of ten decimal numeric values. In the set, a certain quantity of the ten values will approach a similiar
threshold of magnitude, the other values with be random. The threshold 'cluster magnitude' could be any
range, in a spectral type set. So here I chose three values, only to illustrate the concept I am after.

This could change, so out of ten there could be four, five, six values with similiar magnitudes.

Another way to view this problem might be like a sensor array, where each of the ten values in the set are picking
up a pulse value represented by a decimal number and we want to interpret similiar values across the set as
something like pressure, a curve, etc.

The point is, while we can use a python mode type function to extract most common numeric values from a list
where the values are exactly alike and fixed, here we cannot use mode, because the values are not exactly
alike, only approximately alike, such as the examples I gave of ; 295.999, 312.015, 330.111
Reply


Messages In This Thread
Exotic stats problem ; mode, fuzzy clusters, etc - by amyvaulhausen - Aug-03-2018, 07:34 PM

Possibly Related Threads…
Thread Author Replies Views Last Post
  Fuzzy Wuzzy how to get my fuzzyratio displayed windingsareuseful 3 273 Apr-04-2024, 05:38 PM
Last Post: deanhystad
  things that work in terminal mode but not in sublime mode alok 4 2,897 Aug-11-2021, 07:02 PM
Last Post: snippsat
  seeking simple|clean|pythonic way to capture {1,} numeric clusters NetPCDoc 6 3,136 Jun-10-2021, 05:14 PM
Last Post: NetPCDoc
  Clusters dawid294 6 2,375 Sep-17-2020, 02:18 PM
Last Post: Larz60+
  sports Stats > table output loop problems paulfearn100 3 2,526 Jul-22-2020, 03:21 AM
Last Post: c_rutherford
  implementation fuzzy logic using sckit fuzzy nana_4895 0 2,105 Oct-21-2019, 03:28 AM
Last Post: nana_4895
  Disk usage stats. MuntyScruntfundle 2 2,546 Jan-22-2019, 10:53 PM
Last Post: MuntyScruntfundle
  Fuzzy match on text columns within dataframe Nsaibot 0 4,328 Aug-27-2018, 10:52 PM
Last Post: Nsaibot
  Installing fuzzy wuzzy terrancepython11 6 18,250 Mar-25-2017, 06:55 PM
Last Post: Larz60+

Forum Jump:

User Panel Messages

Announcements
Announcement #1 8/1/2020
Announcement #2 8/2/2020
Announcement #3 8/6/2020