select()
This lesson is called select(), part of the Fundamentals of R course. This lesson is called select(), part of the Fundamentals of R course.
Transcript
Click on the transcript to go to that point in the video. Please note that transcripts are auto generated and may contain minor inaccuracies.
Loading transcript...
View code shown in video
# Load Packages -----------------------------------------------------------
library(tidyverse)
# Import Data -------------------------------------------------------------
penguins <-
read_csv("penguins.csv")
# select() ----------------------------------------------------------------
penguins
# With select() we can select variables from the larger data frame.
penguins |>
select(bill_length_mm)
# We can also use select() for multiple variables:
penguins |>
select(bill_length_mm, bill_depth_mm)
# select() has several helper functions for selecting variables.
# The contains() function finds any variable with certain text
# in the variable name:
penguins |>
select(contains("bill"))
# The starts_with() function allows us to select variables
# that start with certain text:
penguins |>
select(starts_with("bill"))
# The ends_with() function allows us to select variables that end with certain text:
penguins |>
select(ends_with("mm"))
# We can select a range of columns using the var1:var2 pattern
penguins |>
select(species:bill_length_mm)
# We can drop variables using the -var format:
penguins |>
select(-bill_length_mm)
# We can drop a set of variables using the -(var1:var2) format:
penguins |>
select(-(bill_length_mm:flipper_length_mm))
Your Turn
Copy the code below into your R script file and complete the exercises.
# Load Packages -----------------------------------------------------------
# Load the tidyverse package
library(tidyverse)
# Import Data -------------------------------------------------------------
# Download data from https://rfor.us/penguins
# Copy the data into your Positron project where you downloaded the penguins data
# Overwrite your R code that you created previously with the code below
penguins <- read_csv("penguins.csv")
# select() ----------------------------------------------------------------
# Use select() to keep only the sex variable
# YOUR CODE HERE
# Use select() to keep the island and sex variables
# YOUR CODE HERE
# Use one of the select() helper functions to keep all variables that have the letter s in their names
# YOUR CODE HERE
# Use one of the select() helper functions to keep all variables that start with the letter b
# YOUR CODE HERE
# Use select() to keep the variables from island to the end
# YOUR CODE HERE
# Use the dropping syntax with - to keep the same variables as above (island to the end)
# YOUR CODE HERE
# Drop all variables from bill_length_mm to body_mass_g
# YOUR CODE HERE
Learn More
To learn more about the select() function, check out Chapter 3 of R for Data Science.
Have any questions? Put them below and we will help you out!
Course Content
33 Lessons
1
The Grammar of Graphics
04:36
2
Scatterplots
03:40
3
Histograms
04:51
4
Bar Charts
04:53
5
Setting color and fill Aesthetic Properties
02:43
6
Setting color and fill Scales
05:12
7
Setting x and y Scales
02:58
8
Adding Text to Plots
05:50
9
Plot Labels
02:59
10
Themes
02:10
11
Facets
02:56
12
Save Plots
02:49
13
Bring it All Together (Data Visualization)
06:14
You need to be signed-in to comment on this post. Login.