Home > database >  visualization for categorical variables in R
visualization for categorical variables in R

Time:10-29

By having the following dataframe:

df = data.frame(Date=1:5,
           Cat1=c(1,0,1,0,0),
           Cat2=c(1,1,1,0,0),
           Cat3=c(0,1,1,0,1),
           Cat4=c(0,0,1,1,0))

By using ggplot2, how would be possible to achieve this plot?

enter image description here

CodePudding user response:

You need to pivot the data into long format, then convert the 0 and 1 values into factors. You can then plot with geom_tile, using the values as fill colours.

library(ggplot2)

ggplot(tidyr::pivot_longer(df, -1),
         aes(x = Date, y = factor(name, levels = rev(unique(name))), 
             fill = as.factor(value)))  
  geom_tile(color = "black")  
  scale_fill_manual(values = c("white", "grey50"))  
  labs(y = "")  
  theme_void()  
  theme(legend.position = "none",
        axis.text = element_text(size = 15),
        axis.title.x = element_text(size = 15),
        plot.margin = margin(20, 20, 20, 20))

enter image description here

And of course, you have lots of options with how the final plot looks. For example:

ggplot(tidyr::pivot_longer(df, -1),
         aes(x = Date, y = factor(name, levels = rev(unique(name))), 
             fill = as.factor(value)))  
  geom_tile(color = "black", size = 1)  
  scale_fill_manual(values = c("gold", "deepskyblue4"))  
  coord_equal()  
  labs(y = "")  
  theme_void()  
  theme(legend.position = "none",
        axis.text = element_text(size = 15),
        axis.title.x = element_text(size = 15),
        plot.margin = margin(20, 20, 20, 20))

enter image description here

  • Related