Home > other >  RDD simple statistics about two groups of data
RDD simple statistics about two groups of data

Time:10-11

Data: data structures: a timestamp, provinces, cities, users, advertising, the middle field using a whitespace-delimited all codes when the last field is the AD id
The data content:
1516609143867 June 7 64 16
1516609143869 September 4, 75 18
1516609143869 1 July 87 12
1516609143869 8, 92 9

Using scala code implementation USES sparkRDD calculate the total number of each AD was clicked in each province
  • Related