I don't understand, search a lot of results, or do not understand,
For woman, in what situations and its effect,
PS: it Juno will also support the feature of the spark, cloud computing, the spark, so keep an eye on,
confused
CodePudding user response:
Spark is a new generation of large data after the Hadoop distributed processing framework, is an extensible platform for the data analysis, it integrates the calculation of primitive memory, therefore, relative to the Hadoop cluster storage methods, it is more advantage in terms of performance, the Spark is implemented in the Scala language, and use of the language, provides a unique environment for data processing,Spark is growing big data analysis solution has attracted much attention in the family of new members, it is not only for the distributed processing of the data set provides an effective framework, and in an efficient way (by simple Scala scripts) processing distribution data set, the Spark and Scala are actively developing stage, however, because of them is adopted in the key Internet attributes, both seem to have been a focus of attention from the open source software based Web technology transition to become,
CodePudding user response:
Spark is a new generation of large data after the Hadoop distributed processing framework, is an extensible platform for the data analysis, it integrates the calculation of primitive memory, therefore, relative to the Hadoop cluster storage methods, it is more advantage in terms of performance, the Spark is implemented in the Scala language, and use of the language, provides a unique environment for data processing,CodePudding user response:
Spark is a new generation of large data after the Hadoop distributed processing framework, is an extensible platform for the data analysis, it integrates the calculation of primitive memory, therefore, relative to the Hadoop cluster storage methods, it is more advantage in terms of performance, the Spark is implemented in the Scala language, and use of the language, provides a unique environment for data processing,Spark is growing big data analysis solution has attracted much attention in the family of new members, it is not only for the distributed processing of the data set provides an effective framework, and in an efficient way (by simple Scala scripts) processing distribution data set, the Spark and Scala are actively developing stage, however, because of them is adopted in the key Internet attributes, both seem to have been a focus of attention from the open source software based Web technology transition to become,
CodePudding user response:
a few upstairs said estimated is similar to baidu,,,In fact, the spark you can think of is replaced before you use the database at the ~! Before you save the data in oracle, mysql, and now you want to store on the spark, then deposit on the spark is where? , is a memory, a relational database into non-relational database at it ~!
I am just learning soon, as is a self understanding at ~! However, should probably just like this,
CodePudding user response:
Data processing frameworkCodePudding user response:
See the website introductionhttp://spark.apache.org/
CodePudding user response:
Set the hadoop MR & amp; Spark SQL and hive integration, figure calculation GraphX, machine learning ML, flow calculation spark streaming computing framework for a new data