Home > database >  Unstructured data query and real-time computation, eventually to structured data?
Unstructured data query and real-time computation, eventually to structured data?

Time:09-24

1. According to a pile of unstructured data, such as in the word the data stored in the form of table, want to query and the document's data form the common statistical analysis, if they were going to the unstructured data into structured data?

2. Suppose a scene
No.12, every year will receive a lot of colleges and universities reported data in word, the word is a part of student achievement data tables, like:
Under the background of the data I want to do a query function, users can set the query conditions, and setting calculation results (such as total number, average scores)

My question:
1. In order to achieve this function is still want to put the word document data in a structured storage?
2. Use big data architecture how to do?

CodePudding user response:

1, don't call the data in the strict sense of Word, can only be called text, usually practice when the Word first converted into Excel (CSV) format, for example, and then other processing
Report 2, depends on the Word file is the unified format, if so, can use the Word and Excel in small programs do batch processing,

As for you want to Excel structured data processing, it depends on the data itself and what kind of system solutions, if is T or P levels of data, basically do not need to be structured (but you need to format), now the big data platform is easy,
  • Related