The problem
I am a scholar who needs to anonymise a large data frame of tweets to keep on my research. In this data frame, there are over 279280 rows each containing the original tweet text columns with a variety of metadata.
Here is a sample of my data:
structure(list(text = c("@Rod comentem aqui utilizando #BrequeDosApps #AmanhaTemBrequedosApps \nNão pode ser só as tags sozinhas pq vira spam!!",
"@Roderick #BrequeDosApps ✊