Estimates
We can certainly make some estimates based on this information. From our graph, we see that blue is the tallest bar, so blue is the most popular color. We can estimate that the majority of Betta owners keep blue Bettas. We see that yellow is the shortest, so we can estimate that yellow Bettas are not as popular among Betta owners.
The majority of Betta owners keep blue, red and purple Bettas. So, if I had a Betta store, I would do best by selling these three colors. Yellows and whites are very few, so I wouldn't do so well by selling those. These are the kinds of estimates I can make based on what I see from our bar graph.
Predictions
What about predictions? The thing about categorical data is that because they are groups, there is not much to say about other groups that aren't listed on the graph. If you don't have data for other groups, there is not much you can predict about them. With mathematical data, you can continue the pattern and make a prediction on what may happen with data that is outside the range of the data you have. But with categorical data, you can't do that because the groups you have data for are not connected to other groups.
Looking at our graph, can we make a prediction about colors that aren't listed? No, because we can't say for certain what color group will come next. And, there is no pattern to draw out.
Lesson Summary
What have we learned? We've learned that categorical data is data that can be grouped. A good visual way to represent categorical data is with a bar graph, a graph with bars of varying heights. We can make estimates about what is most common and what is least common, but we can't make predictions on categorical data.
Learning Outcomes
Following this lesson, you'll have the ability to:
• Define categorical data and bar graph
• Explain how to read a bar graph
• Describe what kind of estimates you can make by reading a bar graph
• Summarize why you cannot make predictions based off categorical data