Percent time for MPI functions #497
-
Hello, I am using Caliper from GEOS and I receive this output from "-t mpi-report"
Does it mean that MPI_Barrier and MPI_Comm_dup, etc. took %43.63 and %43.47, etc. percent of the total Wallclock time time or of the MPI time? thanks ! |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 5 replies
-
Hi @drmichaeltcvx , The percent total column would be for the total program time, including non-MPI time, across all processes. However it looks like non-MPI time (that's the row without a label) was only 0.63% of the total time, which is rather low. Judging from the high average and max times and the low minimum time in MPI_Barrier() and MPI_Comm_dup() it looks like most processes were stuck in those collective calls waiting for just one or a just few processes to complete, so there seems to be some serious load imbalance in the program. |
Beta Was this translation helpful? Give feedback.
-
Quick Q @daboehme : The mpi-report line w/o MPI_Fuction() in the first column should be interpreted to show the times the code spent outside any MPI call? thanks ... |
Beta Was this translation helpful? Give feedback.
Hi @drmichaeltcvx ,
The percent total column would be for the total program time, including non-MPI time, across all processes. However it looks like non-MPI time (that's the row without a label) was only 0.63% of the total time, which is rather low.
Judging from the high average and max times and the low minimum time in MPI_Barrier() and MPI_Comm_dup() it looks like most processes were stuck in those collective calls waiting for just one or a just few processes to complete, so there seems to be some serious load imbalance in the program.