CDH cluster installation start times ERROR Failed to connect to previous supervisor.-CodePudding

26/Jun/2017 22:42:57 + 0000 3245 MainThread agent ERROR Failed to connect to previous supervisor.
Traceback (the most recent call last) :
The File "/opt/cm - 5.7.1/lib64/CMF/agent/build/env/lib/python2.6/site - packages/CMF - 5.7.1 - py2.6. An egg/CMF/agent. Py", line 2037, in find_or_start_supervisor
Self. Get_supervisor_process_info ()
The File "/opt/cm - 5.7.1/lib64/CMF/agent/build/env/lib/python2.6/site - packages/CMF - 5.7.1 - py2.6. An egg/CMF/agent. Py", line 2183, in get_supervisor_process_info
The self. The identifier=self. Supervisor_client. Supervisor. GetIdentification ()
The File "/usr/lib64/python2.6/xmlrpclib py", line 1199, in __call__
Return the self. __send (self __name, args)
The File "/usr/lib64/python2.6/xmlrpclib py", line 1489, in __request
Verbose=self. __verbose
The File "/opt/cm - 5.7.1/lib64/CMF/agent/build/env/lib/python2.6/site - packages/supervisor - 3.0 - py2.6. An egg/supervisor/XMLRPC. Py", line 460, in the request
Self. Connection. The request (' POST ', handler, request_body, self. The headers)
The File "/usr/lib64/python2.6/httplib py", line 936, in the request
Self. _send_request (method, url, body, headers)
The File "/usr/lib64/python2.6/httplib py", line 973, in _send_request
Self. Endheaders ()
The File "/usr/lib64/python2.6/httplib py", line 930, in endheaders
Self. _send_output ()
The File "/usr/lib64/python2.6/httplib py", line 802, in _send_output
The self. The send (MSG)
The File "/usr/lib64/python2.6/httplib py", line 761, in the send
The self. The connect ()
The File "/usr/lib64/python2.6/httplib py", line 742, in the connect
The self. The timeout)
The File "/usr/lib64/python2.6/socket. Py", line 567, in create_connection
Raise the error, MSG
Error: 111] [Errno Connection refused

CodePudding user response:

There are said to be the hostname and IP not corresponding, below is my configuration
/root @ master ~ # hostname
Master
/root @ master ~ # more/etc/hosts
127.0.0.1 localhost localhost. Localdomain localhost4 localhost4. Localdomain4
. : : 1 localhost localhost. Localdomain localhost6 localhost6 localdomain6
192.168.40.4 master
192.168.40.5 slave01
192.168.40.6 slave02
/root @ master ~ # more/etc/networks
The default 0.0.0.0
Loopback 127.0.0.0
The link - local 169.254.0.0
Root @ master ~ #

CodePudding user response:

/root @ master ~ # more/etc/sysconfig/network
NETWORKING=yes
The HOSTNAME=master

CodePudding user response:

Didn't find the problem where I took half a day, a great god please help advice please please

CodePudding user response:

Did you restart??

CodePudding user response:

The building Lord find didn't find out why??
I also reported the same problem here, is really don't know how to do

CodePudding user response:

The agent server restart, then started to the error, unless delete the host built again. Someone found the solution?

CodePudding user response:

I haven't solved, restart and reshipment as well have a headache

CodePudding user response:

The supervisor to restart a residual configuration in, need to empty the corresponding files, then restart

CodePudding user response:

[root @ slave1 cloudera - SCM - agent] # ps aux | grep super
Root 48 0.0 0.0 0 0? S May08 0:02 [sync_supers]
Root 0.0 0.3 202496 12472 13430? Ss May09 and/usr/lib64/CMF/agent/build/env/bin/python/usr/lib64/CMF/agent/build/env/bin/supervisord
Root 80096 0.0 103256 0.0 812 PTS/0 R + 17:31 0:00 grep super
[root @ slave1 cloudera - SCM - agent] # kill 9 13430
[root @ slave1 cloudera - SCM - agent] #/opt/cm - 5.14.2/etc/init. D/cloudera - SCM - agent start
Starting cloudera - SCM - agent: [OK]
[root @ slave1 cloudera - SCM - agent] #/opt/cm - 5.14.2/etc/init. D/cloudera - SCM - agent status
Cloudera - SCM - agent (pid 80147) is running...
[root @ slave1 cloudera - SCM - agent] #/opt/cm - 5.14.2/etc/init. D/cloudera - SCM - agent status
Cloudera - SCM - agent (pid 80147) is running...
[root @ slave1 cloudera - SCM - agent] #/opt/cm - 5.14.2/etc/init. D/cloudera - SCM - agent status
Cloudera - SCM - agent (pid 80147) is running...

CodePudding user response:

The inside of the empty zookeeper information, it might be useful

CodePudding user response:

Problem: abnormal system downtime, lead to start after the temporary file cannot access

 for MNT in $(mount | the cut - d '- 3 f). Do the stat $MNT 1 & gt;/dev/null 2 & gt; & 1; Rc=$? ; If [0] $rc - can be; Then the echo "error accessing $MNT"; Fi; The done 
Error accessing/run/user/42/GVFS

The solution

 umount/run/user/42/GVFS

CodePudding user response:

11 references github_38580358 response:

question reason: abnormal system downtime, lead to start after the temporary file can't visit
 for MNT in $(mount | the cut - d '- 3 f). Do the stat $MNT 1 & gt;/dev/null 2 & gt; & 1; Rc=$? ; If [0] $rc - can be; Then the echo "error accessing $MNT"; Fi; The done 
Error accessing/run/user/42/GVFS 
The solution
 umount/run/user/42/GVFS 

Bosses are too much! I solved the problem of