Question: causes to get 'core' files

General scientific issues regarding ROMS

Moderators: arango, robertson

Post Reply
Message
Author
modzy

Question: causes to get 'core' files

#1 Unread post by modzy »

I did a successful mpi parallel, barotropic model run with a setup of a contant, say 500 m, bathymetry over the entire grid. I then doubled the number of vertical layers, say from the original 20 to 40 layers, and ran the model with reduced time step. This time after about two days of simulation, the run blew up which created a lot of cores files. However, it didn't save any 'nan' data into the perfect restart file as ROMS udually does in case of a blowing up due to nan. What might be the cause of the blowing up? Thanks!

User avatar
kate
Posts: 4091
Joined: Wed Jul 02, 2003 5:29 pm
Location: CFOS/UAF, USA

Re: Question: causes to get 'core' files

#2 Unread post by kate »

ROMS is checking for unphysically large values for rho and velocity, as well as checking for things like NaN. It will report, then kill the job if it finds a problem.

Like ROMS, the Unix/Linux kernel is checking for certain conditions. If it finds an ill-behaved program it will kill it and generate a core file (or many). Many people find these core files unhelpful and limit their size through the coredumpsize limit. Instead, you can recompile your code with the USE_DEBUG switch and potentially find out where in ROMS your job died by using a debugger on the core file. There's more than one way your program can by ill-behaved from the point of view of the kernel and I don't know which off hand.

Post Reply