Home > other >  HPC multi-node parallel computing error
HPC multi-node parallel computing error

Time:11-23

In parallel computing is installed, MPICH3.3.2 testing example, using mpirun multi-node computing parallel machine test is successful, but parallel computing WRF. Exe always run for a while is interrupted, report a the following error:
D01 _00:2020-08-14 00:00 grid spacing, dt, time_step_sound=9000.000 27.00000 4
D01 _00:2020-08-14 00:00 call rk_step_prep
D01 _00:2020-08-14 00:00 calling inc/HALO_EM_A_inline inc
Fatal error in PMPI_Wait: Unknown error class, the error stack:
PMPI_Wait (203)... : MPI_Wait (request=0 x4de87f4, status=0 x7fff5ad13c20) failed
MPIR_Wait_impl (100)... :
MPIDU_Complete_posted_with_error (1137) : the Process failed

Is there a great god can see how to solve? Tried various methods, and change the version, modify ulmit parameter configuration is not
  • Related