How to create and interpret descriptive statistics in R

How to create and interpret descriptive statistics in R
88 / 100

Hey there! Welcome to Statssy! Today we will learn how to calculate and interpret descriptive statistics in R.

Let us consider the daily sales of mangoes from your nearby shop for 28 days. Sometimes the sales are high, and sometimes the sales are low. But to understand the shop’s mango sales performance, we need to summarize the data. descriptive statistics in R

#daily sales written in R

mangoes_sales <- c (20 , 56, 58, 9, 10, 59, 6, 15, 17, 50, 16, 18, 95, 20,15,13,17,58,10,25,26,57,1,59,28,97,34,10)

#summarize the data

summary(mangoes)
descriptive statistics in R

Explicit Interpretation

Now let us interpret each of the results one by one.

  • Min. -> It represents the minimum value in the dataset. Here it is 1.0, which means the least shopkeeper sold in a day is 1.0 mango.
  • 1st Qu. -> It represents the 25th percentile or first quartile of the dataset. Here it is 14.5, meaning 25% of the day the shopkeeper sold less than 14.5 mangoes, and 75% of days she sold more than 14.5. 
  • Median -> It represents the middle most value of the dataset. Here it is 20, meaning 50% of the days, the shopkeeper sold more than 20 mangoes, and for the rest, 50%, she sold less than 20 mangoes. 
  • Mean -> It represents the average number of mangoes sold. The average number means dividing total sales by the total number of days. 
  • 3rd Qu. -> It represents the 75th percentile or third quartile of the dataset. Here it is 56.25, which means on 25% of the days, the shopkeeper sold more than 56.25 mangoes, and on 75% of the days, the sales were less than 56.25 mangoes. 
  • Max -> It presents the maximum value in the dataset. Here it is 97, meaning the maximum shopkeeper sold in a day is 97.descriptive statistics in R

Implicit Interpretation

Now since we have interpreted what is directly available, there are some things that we can imply from these results. 

For example, consider 25th and 75th percentile. We interpret each of them now if we see the difference between the two,

75th – 25th percentile = 56.25 – 14.50 = 41.75

This 41.75 value obtained by subtracting the 75th and 25th percentile is called IQR or Inter-quartile range. Why?

Because it represents the values between two quartiles, i.e., the first and third quartiles. Now, what does this means?

It shows where the middle 50% of the data values lie. In this case, we can say that on 50% of the days, the shopkeeper sold mangoes between 14.50 and 56.25.

Further Quests: Level Up Your Data Game

Ready to take your data game to the next level? Here’s a treasure trove of resources that are as binge-worthy as the latest Netflix series.

Books & Articles : descriptive statistics in R

Statistics Fundamentals

R Programming

Python Programming

So, are you ready to embark on your next data quest?

Submit a Comment

Your email address will not be published. Required fields are marked *

academic Sidebar Image

Unlock the power of data with our user-friendly statistics calculator.

Explore now
academic Sidebar Image

Explore our data science courses to supercharge your career growth in the world of data and analytics.

REGISTER FOR COURSES
academic Sidebar Image

Test Your Skills With Our Quiz

SOLVE QUIZ

Contact me today! I have solution to all your problems.

Please enable JavaScript in your browser to complete this form.