Statistics: Distributions – Joint

Delivering Multidimensional Understanding

In this article I will demonstrate how to use a data set to communicate joint (or two) distributions of data.

First, I will convert the table into a relative cumulative table, beginning with the first age group (0-8).
In this formula, I’m instructing Excel to divide the first diagnosis number (56) to produce a percentage. Note my use of the ‘$’ character in the formula to ensure when I duplicate the formula to other cells, the same range of values are used.

Now that I have the first percentage for the age group 0-8 for the year 2000, I’ll duplicate the formula to the adjacent cells, then create a row total.

Next, I’ll duplicate the formula to other age groups’ rows, then create their totals. Finally, I’ll produce a total for each column. Notice how each column totals summed equal 100% as well as each row totals.

With both (joint) distributions (Age Groups and Years), I can now easily see the highest percentage of disease diagnoses occurred for the two oldest age groups, in the year 2000, and the least percentage of diagnoses occurred for all three years between age groups of 26-30 and 31-40, with the age group of 19-25 only in the year 2000.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out /  Change )

Google photo

You are commenting using your Google account. Log Out /  Change )

Twitter picture

You are commenting using your Twitter account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s