grid_stat output sometimes gets delayed? #1100
-
In a sequential job which ran grid_stat in a loop of forecast hours, I found sometimes/occasionally grid_stat output got delayed. Attached is a sample log file. On L4563, 'ls' command lists all stat files which are generated by grid_stat. On this line it shows only 9 stat files are present/listed. That means though stat files are submitted to grid_stat, they are possibly delayed. Then it causes consequential errors. To confirm my guess, I reran the case and everything was all right. Thank you for your help! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
@YaliMao-NOAA I see you have a question about the output being generated by the script you're running to call Grid-Stat. Thanks for sending the log output. Yes, I do see the behavior you describe... the 'ls' command on line 4563 only shows the existence of 9 grid_stat .stat output files. But searching lines 1 - 4563, there are supposed to be 24 of them. I did find something a little concerning in this log file that might be related. I only see one instance of a 'cd' (change directory) command in this log file:
So I ASSUME that all commands are being run from this directory, but that may or may not be true. When I grep this log file for "Output file", I see that grid_stat is generating 288 output files. However, only 72 of those are unique file names!
If you configure/run Grid-Stat to write to the same output file name, the contents of the new file will clobber the old file. I do see that you're already using the "output_prefix" option to customize the output file names to avoid them being clobbered, but perhaps your output_prefix setting is not sufficient to do so? Now I haven't traced through the log message to back out exactly what you script is doing, but I'd recommend that you take a close look at the uniqueness of the output file names before debugging any further. Perhaps you are changing directories to some other location, but your script is not reporting that? Or perhaps grid_stat really is clobbering it's previously generated output files and that messing up your 'ls' counting logic? To make the behavior more clear, I'd recommend setting "-out_dir" to an absolute path instead of a relative one, like the current directory ("."). As we talked about in the METplus NOAA telecon this morning, another option is setting this up as a METplus use case. Please let me know what you find. |
Beta Was this translation helpful? Give feedback.
@YaliMao-NOAA I see you have a question about the output being generated by the script you're running to call Grid-Stat. Thanks for sending the log output.
Yes, I do see the behavior you describe... the 'ls' command on line 4563 only shows the existence of 9 grid_stat .stat output files. But searching lines 1 - 4563, there are supposed to be 24 of them.
I did find something a little concerning in this log file that might be related. I only see one instance of a 'cd' (change directory) command in this log file:
So I ASSUME that all commands are being run from this directory, but that may or may not be true. When I gr…