I want to do a topic, topic is,
Based on S3a Hadoop data analysis software, based on the Hadoop framework development S3a plug-in, provide a read cache capacity, a sharp rise in the large data analysis bandwidth (& gt; 300 m/s), what is want me to do what, I just set the hadoop completely distributed environment, other what also don't understand, elder sister want to please give me some ideas, so how to start to do something,