Out of Memory issue while reading large no of source files #1752

padesh · 2024-10-17T18:02:12Z

Hi Team,

I am doing noise cross-correlation simulations for a coupled acoustic-elastic media. When I have large no of external source files (60k files, each with 60k time steps), the step where solver reads the source files, the simulations breaks due to out of memory issue. My question is, are these source files read on a single node or these are read in parallel? If these are read by single node, increasing the no of nodes would not help in this case.

Since almost half of the values are zeros on later part of the time in STF file , can it be done that if NSTEP > no of lines in STF file, the program injects zeros for those extra no of steps? That will help trim down total no of lines in STF file and its size.

or any other suggestions?

STF files are in binary format.

Thanks.

danielpeter · 2024-10-28T16:15:28Z

for external source time functions, all MPI processes allocate the same array containing all sources and time steps.

in your case, the size of this array becomes ~ 13 GB:

60000 * 60000 * 4 / 1024. / 1024. / 1024. = 13.411

the number of time steps in the external STF file must be at least the number of time steps of the simulation. there is no fall back to zeros if it is shorter, instead the simulation would break.

the only help provided is that the solver can read binary files. you could store those files in binary format (***.bin file) to speed up the reading.

padesh · 2024-10-28T17:05:52Z

Thanks Peter.
I am already using the .bin format for source files.

This means there is a limit to which you can use timesteps and no of sources, irrespective of no of nodes you can use.

danielpeter · 2024-10-28T18:07:32Z

what if you run multiple MPI processes on a single node, just spread out the processes onto more compute nodes and use fewer MPI processes on a single node? 13 GB doesn't sound awfully lot for a single compute node's memory.

padesh · 2024-10-28T19:23:50Z

you mean I try allocating nodes ( I have 36 cores of each node) and do task_per_node <36 (say 30) and then call srun xspecfem3D for solver?

danielpeter · 2024-10-28T19:29:49Z

yes, find out how much memory you have per node and then estimate how many MPI processes you can run on a single node, taking into account that each process will require additional memory for other arrays (mesh, seismograms, etc.).

padesh · 2024-10-29T03:55:42Z

Thanks, let me try this and come back with an update.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Out of Memory issue while reading large no of source files #1752

Out of Memory issue while reading large no of source files #1752

padesh commented Oct 17, 2024 •

edited

Loading

danielpeter commented Oct 28, 2024

padesh commented Oct 28, 2024

danielpeter commented Oct 28, 2024

padesh commented Oct 28, 2024

danielpeter commented Oct 28, 2024

padesh commented Oct 29, 2024

Out of Memory issue while reading large no of source files #1752

Out of Memory issue while reading large no of source files #1752

Comments

padesh commented Oct 17, 2024 • edited Loading

danielpeter commented Oct 28, 2024

padesh commented Oct 28, 2024

danielpeter commented Oct 28, 2024

padesh commented Oct 28, 2024

danielpeter commented Oct 28, 2024

padesh commented Oct 29, 2024

padesh commented Oct 17, 2024 •

edited

Loading