From my corpus I get a frequency of words I would like to convert this frequency to a number between 0-10.
If a word appear 50 times it should be closer to 10 and if a word appear 5 times it should be closer to 1.
the output should be a score for each word.
word number_of_occurrence score
and 200 10
png 2 1
where 50 6
news 120 7
CodePudding user response:
If you want to rate terms frequencies in a corpus, I suggest you to read this wikipedia article :