Home > other >  Hive parameters
Hive parameters

Time:10-05

The first part: the Java xue Hive parameter
Hive. The exec. Max. Created. Files
? Note: all hive running the map and reduce tasks can produce documents and
? Default value: 100000
Hive. The exec. Dynamic. Partition
? Description: whether to automatically partition
? Default value: false
Hive. Mapred. Reduce. The tasks. The speculative. Execution
? Description: whether to open the speculated that perform
? Default value: true
Hive. Input. The format
? Description: Hive default input format
? Default value: org.apache.hadoop.hive.ql.io.Com bineHiveInputFormat
? If there is a problem you can use the org. Apache. Hadoop. Hive. Ql. IO. HiveInputFormat
Hive. The exec. Counters. Pull. The interval
? Description: Hive with the JobTracker pull counter information time
? Default value: 1000 ms
Hive. Script. Recordreader
? Note: when using the script default read class
? Default value: org. Apache. Hadoop. Hive. Ql. Exec. TextRecordReader
Hive. Script. Recordwriter
? Note: when using the script default data writing class
? Default value: org. Apache. Hadoop. Hive. Ql. Exec. TextRecordWriter
Hive. Mapjoin. Check. The memory. Rows
? Note: data can be stored in memory the number of rows
? Default value: 100000
Hive. Mapjoin. Smalltable. Filesize
? Description: enter a small table file size threshold, if less than this value, the ordinary join
? Default value: 25000000
Hive. Auto. Convert. Join
? Note: according to the size of the input file, will Join into a normal Map Join
? Default value: false
Hive. Mapjoin. Followby. Gby. Localtask. Max. The memory. The usage
? Description: the map join do group by operation, can use how much memory to store data, if the data is too large, will not be stored in the memory
? Default value: 0.55
Hive. Mapjoin. Localtask. Max. The memory. The usage
? Note: the percentage of the local task can use memory
? Default value: 0.90
Hive. Heartbeat. The interval
? Note: when making MapJoin and filtering operation, send the heartbeat time
? The default value of 1000
Hive. Merge. The size, per task
? Note: the size of the merged file
? Default value: 256000000
Hive. Mergejob. Maponly
? Description: when only a Map task to merge the output
? Default value: true
Hive. Merge. Mapredfiles
? Default value: at the end of the work whether merger small file
? Description: false
Hive. Merge. Mapfiles
? Description: the Map - Only Job whether merger small file
? Default value: true
Hive. Hwi. Listen. The host
? Description: Hive UI the default host
? Default value: 0.0.0.0
Hive. Hwi. Listen. The port
? Description: Ui listener port
? Default value: 9999
Hive. The exec. Parallel. Thread. Number
? Description: hive can parallel processing Job number of threads
? Default value: 8
Hive. The exec. The parallel
? Description: whether parallel submitting
? Default value: false
Hive.exec.com press. The output
? Note: output using compressed
? Default value: false
Hive. Mapred. Mode
? Description: graphs of operation mode of restrictions, the operation of the operating in this mode no limit
? Default value: nonstrict
. Hive. Join. The cache size
? Description: join operation, can be in the memory of a number of
? Default value: 25000
Hive. Mapjoin. Cache. Numrows
? Description: mapjoin amount of data in the memory
? Default value: 25000
Hive. Join. Emit. The interval
? Description: is there a connection Hive before output caching time
? Default value: 1000
Hive. Optimize. Groupby
? Note: when doing group statistics, whether to use the bucket table
? Default value: true
Hive. Fileformat. Check the
? Description: whether to detect file input format
? Default value: true
Hive. Metastore. Client. Connect. Retry. Delay
? Note: the client connection fails, retry interval
? Default value:
1 SECHive. Metastore. Client. Socket. A timeout
? Note: the Client socket timeout
? Default value: 20 seconds
Mapred. Reduce. The tasks
? Default value: 1
? Note: the default value of each task to reduce
- 1 represents automatically according to the circumstance of the homework set the reduce the value of the
Hive. The exec. Reducers) bytes) per. Reducer
? Default value: 1000000000 (1 g)
? Note: accept the amount of data of each reduce
If to reduce the data of 10 g, then will generate 10 reduce task
Hive. The exec. Reducers. Max
? Default value: 999
? Description: reduce the maximum number of
Hive. The exec. Reducers. Max
? Default value: 999
? Description: reduce the maximum number of
Hive. Metastore. Warehouse. Dir
? Default value:/user/hive/warehouse
? Note: the default database location
Hive. Default. Fileformat
? Default value: TextFile
? Note: the default fileformat
Hive. The map. The aggr
? Default value: true
? Note: Map end aggregation, equivalent to a combiner
Hive. The exec. Max. Dynamic. Partitions. Pernode
? Default value: 100
? Note: each task node can produce the largest number of partition
Hive. The exec. Max. Dynamic. Partitions
? Default value: 1000
? Note: the default number of partitions can create
Hive. Metastore. Server. Max. Threads
? Default value: 100000
? Note: the maximum number of processing threads of metastore default
Hive. Metastore. Server. Min. Threads
? Default value: 200
? Description: metastore the default minimum number of processing threads

CodePudding user response:

Good, write very detailed

CodePudding user response:

Good, write very detailed

CodePudding user response:

Pick up points, make, by the way
  • Related