Home > other >  R: Reference column name with paste
R: Reference column name with paste

Time:06-28

I am attempting to reference a column in my data frame with paste given a variable name and the column is not being recognized. Below is an example data frame and variable names that I am attempting to work with:

arm <- "ARM"

ex_df <- data.frame(col1 = c("A","B","C"), col2 = c("X","Y","Z"), ARM_1 = c(1,2,3), ARM_2 = c(4,5,6,), ARM_3 = c(7,8,9), ARM_4 = c(0,0,0))

ex_df %>%
    transmute(col1 = col1, col2 = str_to_title(col2), paste0(arm,"_",4) = paste0(arm,"_",1)   paste0(arm,"_",2)   paste0(arm,"_",3)) %>%
    arrange(col1,col2)

Error: unexpected '=' in:
    "col2 = str_to_title(col2),
    paste0(arm,"_",4) = "

The desired output should read ARM_4 = ARM_1 ARM_2 ARM_3 and output the sum of each columns values.

Thank you for your help.

CodePudding user response:

We can use := with !! - along with rowSums on the columns that starts_with the value stored in arm

library(dplyr)
library(stringr)
ex_df %>%
    transmute(col1 = col1, col2 = str_to_title(col2), 
    !!paste0(arm,"_",4) := rowSums(across(starts_with(arm)), na.rm = TRUE))

-output

 col1 col2 ARM_4
1    A    X    12
2    B    Y    15
3    C    Z    18

NOTE: rowSums would be better compared to - 1) take care of NA elements with na.rm = TRUE, 2) compact option when there are more than 10 'ARM' columns


In case we want to get the value of the column with paste

ex_df %>%
    transmute(col1 = col1, col2 = str_to_title(col2), 
   !!paste0(arm,"_",4) := .data[[paste0(arm,"_",1)]]   
                         .data[[paste0(arm,"_",2)]]   
                        .data[[paste0(arm,"_",3)]]) %>%
    arrange(col1,col2)

-output

 col1 col2 ARM_4
1    A    X    12
2    B    Y    15
3    C    Z    18
  • Related