mpirun has exited due to process rank 28 with PID 108752 on

General scientific issues regarding ROMS

Moderators: arango, robertson

Post Reply
Message
Author
JueNing
Posts: 3
Joined: Wed Aug 05, 2020 9:35 pm
Location: WHOI

mpirun has exited due to process rank 28 with PID 108752 on

#1 Unread post by JueNing »

Dear all,

When I run an ideal case of ROMS after got coawstG, I got the following error and don't know how to solve it.
Do you know how to solve this? Thanks in advance.

forrtl: error (78): process killed (SIGTERM)
Image PC Routine Line Source
coawstG 0000000000995E19 decode_line_ 815 inp_par.f90
coawstG 0000000000A3CC46 read_phypar_ 85 read_phypar.f90
coawstG 0000000000979A2D inp_par_ 91 inp_par.f90
coawstG 000000000041F10A ocean_control_mod 85 ocean_control.f90
coawstG 000000000041EDF3 MAIN__ 95 master.f90
coawstG 000000000041EB6C Unknown Unknown Unknown
libc.so.6 000000358481ED1D Unknown Unknown Unknown
coawstG 000000000041EA69 Unknown Unknown Unknown

forrtl: severe (174): SIGSEGV, segmentation fault occurred
Image PC Routine Line Source
coawstG 00000000009A1EFC load_s2d_ 1722 inp_par.f90
coawstG 0000000000A83CE5 read_phypar_ 1456 read_phypar.f90
coawstG 0000000000979A2D inp_par_ 91 inp_par.f90
coawstG 000000000041F10A ocean_control_mod 85 ocean_control.f90
coawstG 000000000041EDF3 MAIN__ 95 master.f90
coawstG 000000000041EB6C Unknown Unknown Unknown
libc.so.6 000000358481ED1D Unknown Unknown Unknown
coawstG 000000000041EA69 Unknown Unknown Unknown
--------------------------------------------------------------------------
mpirun has exited due to process rank 28 with PID 108752 on
node GUO exiting improperly. There are two reasons this could occur:

1. this process did not call "init" before exiting, but others in
the job did. This can cause a job to hang indefinitely while it waits
for all processes to call "init". By rule, if one process calls "init",
then ALL processes must call "init" prior to termination.

2. this process called "init", but exited without calling "finalize".
By rule, all processes that call "init" MUST call "finalize" prior to
exiting or it will be considered an "abnormal termination"

This may have caused other processes in the application to be
terminated by signals sent by mpirun (as reported here).
--------------------------------------------------------------------------

User avatar
kate
Posts: 4088
Joined: Wed Jul 02, 2003 5:29 pm
Location: CFOS/UAF, USA

Re: mpirun has exited due to process rank 28 with PID 108752 on

#2 Unread post by kate »

Because each of us runs with different cpp options, we get differing line numbers. You should look at your lines here:

Code: Select all

coawstG 00000000009A1EFC load_s2d_ 1722 inp_par.f90
coawstG 0000000000A83CE5 read_phypar_ 1456 read_phypar.f90
However, I suspect that you might be using a roms.in file which is inconsistent with your ROMS version. You have a ROMS version which needs NCLMFILES, NCLMFILES, and NSSFFILES. Do you have them in your roms.in?

JueNing
Posts: 3
Joined: Wed Aug 05, 2020 9:35 pm
Location: WHOI

Re: mpirun has exited due to process rank 28 with PID 108752 on

#3 Unread post by JueNing »

Hi Kate,

Thanks for your reply!!! I've solved this problem.

Best regard,
Jue

Post Reply