I have a set of data: about 100,000 positive nonzero integers. For simplicity let's say 10,000 values from each of the last ten years.
Each year, around 1/3 of the values are the number 50.
Around 1/3 of the values are between 1 and 49, with lots of 25s and 10s.
Around 1/3 of the values are greater than 50; large numbers of 100s, quite a few 250s... all the way up a small number of values (maybe 50 per year) are in the 10,000 to 100,000 range.
Can anybody think of a good way to characterize this data, in terms of an annual value of some sort?
An average isn't very meaningful, because a few large values has such a huge impact on the average.
The median value is always 50, so that's meaningless as well.
-- Eric
Each year, around 1/3 of the values are the number 50.
Around 1/3 of the values are between 1 and 49, with lots of 25s and 10s.
Around 1/3 of the values are greater than 50; large numbers of 100s, quite a few 250s... all the way up a small number of values (maybe 50 per year) are in the 10,000 to 100,000 range.
Can anybody think of a good way to characterize this data, in terms of an annual value of some sort?
An average isn't very meaningful, because a few large values has such a huge impact on the average.
The median value is always 50, so that's meaningless as well.
-- Eric
Comment