Introduction
Tidyverse is a collection of R packages designed for data science. All packages share a common design philosophy and grammar.
Installing Tidyverse
install.packages("tidyverse")
library(tidyverse)
Core Tidyverse Packages
| Package | Purpose |
|---|---|
| ggplot2 | Data visualization |
| dplyr | Data manipulation |
| tidyr | Data tidying |
| readr | Data import |
| purrr | Functional programming |
| tibble | Tibbles (modern data frames) |
| stringr | String manipulation |
| forcats | Factor handling |
Tibbles
library(tibble)
# Create tibble
df <- tibble(
x = 1:5,
y = c("a", "b", "c", "d", "e")
)
# Print behavior
df
# Convert to tibble
as_tibble(df)
Pipes
library(magrittr)
# Using pipes
df %>%
filter(condition) %>%
select(columns) %>%
mutate(new_column = x + y)
Data Transformation
library(dplyr)
# Basic verbs
df %>%
select(col1, col2) %>%
filter(row_condition) %>%
mutate(new_col = calculation) %>%
group_by(category) %>%
summarize(mean_value = mean(value))
Summary
Tidyverse provides a consistent framework for data manipulation and visualization in R.