Not a very good idea, bosses help solve-CodePudding

Given two tables, each table of each item of A data contain the same length, as shown in table (1), each data contains four items, the contents of the data item Numbers for T1 for A, B, C, D, and the content of each item of data according to the dictionary sequence sorting, if A data in the table (1) and A data in table (2) k - 1 item of the same, before the last item, no data in the table (assuming the two have exactly the same item), then the two data can be connected to A new data with k + 1 item, such as the T1 and table in table (1) (2) of the T1, the first 3 items are the same, the last item is different, can get A new data connection. A, B, C, D, E, connect the two tables are full, can get A list of each item is 5, for example, the connection table (1) and (2) be able to get table (3),
Table (1) the sample data
Item ID
T1 A, B, C, D
T2. A, B, E, F
T3, A, C, E, F
T4 B, C, D, E,
...

Sample data tables (2)
Item ID
T1 A, B, C, E
T2. A, B, D, F
T3, A, C, E, K
T4 A, C, D, E,
...

Table (3) after the connection table
Item ID
T1 A, B, C, D, E,
T2. A, C, E, F, K
...

In the traditional table join operations, requires the time complexity is O (n2),
Requirements: design algorithm, and reduce the complexity of the join operation, improve efficiency,

CodePudding user response:

What is item? Four fields separated by commas or a field content?

CodePudding user response:

Is a field of content

CodePudding user response:

Since is the database, or should be how to make effective index, effective as long as the index to reduce the complexity of O (n),

Can consider to intercept respectively before 3 items and the last item, establish joint index, and then use the explain see scan lines

CodePudding user response:

If you don't consider the index, but the problem can be converted to merge two order table and remove duplicate elements, have what good method?

CodePudding user response:

The same, also direct merger have to find out the first three of the same data, hide not open connections,

As long as doing connection driver table O (n) is necessary, optimization is to find a way to avoid traverse table driven into O (n2), so the core still make effective index

Connection and then delete the duplicates will only more slowly

CodePudding user response:

Before my understanding of this subject is based on ID to connect the two tables, after the connection to merge, I this should understanding is wrong, please bosses to talk about your overall train of thought?

CodePudding user response:

The complexity of the convenient to explain the n square is how come?

CodePudding user response:

Imagine table 1 in table 2 are two text files, if let you manually for this match, what do you do,

Removed first table 1 first, before going to the second list to find three same, if don't have to search, then read the data in table 2 from the beginning again, this complexity is n, the data in table 1 is n times, each one all check is n, n x

And then you can find using the search, is to take the first of A, B, C to directly search in table 2, 4 and see after finding matches, this operation according to the big O notation is O (1), ignored, table 1 data after each take check complexity is n,

Database to remove all kinds of optimization, the basic principle of connection is the same with the train of thought,

CodePudding user response:

Well ok, understand, bosses, is there no additional space or take up less space extra way?

CodePudding user response:

No additional footprint is the reconstruction of table 2, interception of first three Settings made a field and a primary key, and item 4 as the second field

CodePudding user response:

references to the tenth floor will respond to the heavy rain:

no additional footprint is the reconstruction of table 2, the interception first three Settings made a field and a primary key, and as the second field 4

Table 1 need for processing, please?

CodePudding user response:

Table 1 didn't have to deal with

CodePudding user response:

The

reference 14 floor will respond to the heavy rain:

table 1 didn't have to deal with

Bosses, you can tell me something about this approach in more detail?