summarize()
This lesson is called summarize(), part of the Fundamentals of R (RStudio) course. This lesson is called summarize(), part of the Fundamentals of R (RStudio) course.
Transcript
Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.
Loading transcript...
View code shown in video
# Load Packages -----------------------------------------------------------
library(tidyverse)
# Import Data -------------------------------------------------------------
penguins <- read_csv("penguins.csv")
# summarize() -------------------------------------------------------------
# With summarize(), we can go from a complete dataset down to a summary.
# We use any of the summary functions with summarize().
# Here's how we calculate the mean bill length.
penguins |>
summarize(mean_bill_length = mean(bill_length_mm))
# This doesn't work! Notice what the result is.
# We need to add na.rm = TRUE to tell R to drop NA values.
penguins |>
summarize(mean_bill_length = mean(bill_length_mm, na.rm = TRUE))
# Another option is to drop NA values before calling summarize().
penguins |>
drop_na(bill_length_mm) |>
summarize(mean_bill_length = mean(bill_length_mm))
# We can have multiple arguments in each usage of summarize().
penguins |>
summarize(mean_bill_length = mean(bill_length_mm, na.rm = TRUE),
max_bill_depth = max(bill_depth_mm, na.rm = TRUE))
Your Turn
# Load Packages -----------------------------------------------------------
# Load the tidyverse package
library(tidyverse)
# Import Data -------------------------------------------------------------
# Download data from https://rfor.us/penguins
# Copy the data into the RStudio project
# Create a new R script file and add code to import your data
penguins <- read_csv("penguins.csv")
# Calculate the weight of the heaviest penguin.
# Don't forget to drop NAs!
# YOUR CODE HERE
# Calculate the minimum and maximum weight of penguins in the dataset.
# YOUR CODE HERE
Learn More
To learn more about the summarize() function, check out Chapter 3 of R for Data Science.
Have any questions? Put them below and we will help you out!
Course Content
34 Lessons
1
The Grammar of Graphics
04:39
2
Scatterplots
03:46
3
Histograms
05:47
4
Bar Charts
06:37
5
Setting color and fill Aesthetic Properties
02:39
6
Setting color and fill Scales
05:40
7
Setting x and y Scales
03:09
8
Adding Text to Plots
07:32
9
Plot Labels
03:57
10
Themes
02:19
11
Facets
03:12
12
Save Plots
02:57
13
Bring it All Together (Data Visualization)
06:42
You need to be signed-in to comment on this post. Login.