Based on the Hadoop platform architecture components and multi-dimensional data acquisition, data consistency check, an invalid value and the default value of distributed computing, the processing of distributed storage system, data warehouse
Library such as the comprehensive application ability, the use of Java, Python development language, such as data cleaning, data storage, data conversion, data analysis, data to predict and a series of data operation and data push
Several table data and outlier handling
Through the common data analysis algorithm, the data is standardized, discretization and normalize analysis
Master data warehouse import, export and use of the data warehouse related commands or code to achieve data multidimensional, multi-level analysis
For data query, sorting and calculating, compiled, packaging, distribution, executable program, complete the data processing, cleaning,
Realize the file transfer and conversion between different database data forecast analysis