Home > Software design >  How to remove special characters,unicode emojis in pyspark?
How to remove special characters,unicode emojis in pyspark?

Time:11-07

Good afternoon everyone, I have a problem to clear special characters in a string column of the dataframe, I just want to remove special characters like html components, emojis and unicode errors, for example \u2013.

does anyone have an regular expression to help me? Or any suggestions on how to treat this problem?

input:

i want to remove            
  • Related