Home > database >  String replacing in a data frame based on another data frame
String replacing in a data frame based on another data frame

Time:09-30

The problem

I am a scholar who needs to anonymise a large data frame of tweets to keep on my research. In this data frame, there are over 279280 rows each containing the original tweet text columns with a variety of metadata.

Here is a sample of my data:

structure(list(text = c("@Rod comentem aqui utilizando  #BrequeDosApps #AmanhaTemBrequedosApps  \nNão pode ser só as tags sozinhas pq vira spam!!", 
"@Roderick #BrequeDosApps ✊           
  • Related