-
Notifications
You must be signed in to change notification settings - Fork 9
/
Copy pathshdom.txt
1632 lines (1486 loc) · 92 KB
/
shdom.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
778
779
780
781
782
783
784
785
786
787
788
789
790
791
792
793
794
795
796
797
798
799
800
801
802
803
804
805
806
807
808
809
810
811
812
813
814
815
816
817
818
819
820
821
822
823
824
825
826
827
828
829
830
831
832
833
834
835
836
837
838
839
840
841
842
843
844
845
846
847
848
849
850
851
852
853
854
855
856
857
858
859
860
861
862
863
864
865
866
867
868
869
870
871
872
873
874
875
876
877
878
879
880
881
882
883
884
885
886
887
888
889
890
891
892
893
894
895
896
897
898
899
900
901
902
903
904
905
906
907
908
909
910
911
912
913
914
915
916
917
918
919
920
921
922
923
924
925
926
927
928
929
930
931
932
933
934
935
936
937
938
939
940
941
942
943
944
945
946
947
948
949
950
951
952
953
954
955
956
957
958
959
960
961
962
963
964
965
966
967
968
969
970
971
972
973
974
975
976
977
978
979
980
981
982
983
984
985
986
987
988
989
990
991
992
993
994
995
996
997
998
999
1000
SHDOM - Spherical Harmonic Discrete Ordinate Method
for 3D Atmospheric Radiative Transfer
OVERVIEW
This implementation of SHDOM computes polarized monochromatic or
spectral band radiative transfer in a one, two, or three-dimensional
medium for either collimated solar and/or thermal emission sources of
radiation. The properties of the medium can be specified completely
generally, i.e. the extinction, single scattering albedo, coefficients
of the scattering phase matrix, and temperature for the particular
wavelength or spectral band may be specified at each input grid point.
SHDOM is superior to Monte Carlo radiative transfer methods when many
radiative quantities are desired, e.g. the radiance field across the
domain top. Radiances in any direction, hemispheric fluxes, net fluxes,
mean radiances, and net flux convergence (related to heating rates) may
be output anywhere in the domain. For highly peaked phase functions the
delta-M method may be chosen, in which case the radiance is computed
with an untruncated phase function single scattering correction. A
correlated k-distribution approach is used for the integration over a
spectral band. There may be uniform or spatially variable Lambertian
reflection and emission from the ground surface. Several types of
bidirectional reflection distribution functions (BRDF) for the surface
are implemented, and more may be added easily. SHDOM may be run on a
single processor or on multiple processors (e.g. an SMP machine or a
cluster) using the Message Passing Interface (MPI). SHDOM may be run
efficiently for unpolarized (scalar) radiative transfer by choosing
NSTOKES=1, or for polarized (vector) radiative transfer problems with
NSTOKES=3 or 4 Stokes parameters. SHDOM is implemented in Fortran, and
a Fortran 90 compiler is now required.
METHOD IN BRIEF
SHDOM uses an iterative process to compute the source function
(including the scattering integral) on a grid of points in space. The
angular part of the source function is represented with a spherical
harmonic expansion. Solving for the source function instead of the
radiance field saves memory, because there are often parts of a medium
where the source function is zero or angularly very smooth (hence few
spherical harmonic terms). The other reason for using spherical
harmonics is that the scattering integral is more efficiently computed
than in discrete ordinates. A discrete ordinate representation is used
in the solution process because the streaming of radiation is more
physically (and correctly) computed in this way. An adaptive grid that
chooses where to put grid points is useful in atmospheric radiative
transfer because the source function is usually rapidly varying in
some regions and slowly varying in others.
The iterative method is equivalent to a successive order approach.
For each iteration 1) the source function is transformed to discrete
ordinates at every grid point, 2) the integral form of the radiative
transfer equation is used to compute the discrete ordinate radiance at
every grid point, 3) the radiance is transformed back to spherical
harmonics, and 4) the new source function is computed from the radiance
in spherical harmonics. As with all order of scattering methods the
number of iterations increases with the single scattering albedo and
optical depth. A sequence acceleration method is used to speed up
convergence, which is typically achieved in under 50 iterations. During
the solution process, the grid cells with the integral of the source
function difference above a certain limit are split in half, generating
new grid points. The number of spherical harmonic terms kept at each
grid point also changes as the iterations proceed (i.e. adaptive
spherical harmonic truncation).
TERMINOLOGY
The medium properties (extinction, single scattering albedo, expansion
coefficients of the phase function/phase matrix elements, temperature)
are input at grid points from a "property file". These fields are
defined spatially (X, Y, and Z spacing) by the "property grid", which is
defined by the input file and is not the actual spatial grid used for
computatations. The internal grid used for computation has two
components: the "base grid" is regularly spaced (even in X and Y, even
or not in Z) and is defined by input parameters NX, NY, NZ, and
GRIDTYPE. The "adaptive grid" is all the grid cells including those
defined by subdividing the base grid cells. New cells are made by "cell
splitting", which is the process of figuring out which cells need to be
subdivided and dividing them in half to create new grid points. The
"sequence acceleration" is a method to reduce the number of iterations
by extrapolating the source function to where it is hoped the ultimate
solution is. The iterations proceed until the "solution criterion" is
reached, when the source function is changing by sufficiently small
amounts that convergence is declared.
The "spherical harmonic terms" are the series expansion of the angular
dependence of the source function. The angular dependence is also
represented by "discrete ordinates", which are discrete directions
(mu_i, phi_i) at which the source function or radiance is specified.
Angles are specified by mu, which is the cosine of the zenith angle,
and phi, which is the azimuth angle, counter-clockwise from the X axis.
"Delta-M scaling" is a standard method of reducing the forward
scattering peak in the phase function, by scaling the phase function,
single scattering albedo, and extinction so the solution to the
radiative transfer equation is the same. The "independent pixel"
approximation is a standard method of solving for horizontal domain
averaged radiative properties of an inhomogeneous medium by averaging
the results of 1D radiative transfer on separate columns.
POLARIZATION
Polarization in SHDOM has been implemented with the standard Stokes
parameters: I, Q, U, and V. I is the regular radiance or intensity, Q
and U measured the amount and orientation of linear polarization, and V
measures the amount and sign of circular polarization. The particular
definitions of the Stokes parameters are as in Mishchenko and Travis
1997. The Q and U parameters depend on their reference plane, which
rotates from the incoming meridional plane, to the scattering plane, and
then to the outgoing meridional plane. This coupling between Q and U
causes most of the complexity in implementing polarization in spherical
harmonics. The spherical harmonics basis implemented in SHDOM is the
"real generalized spherical harmonics" of Doicu et al. 2013. The key
subroutines implementing this spherical harmonics basis were adapted
from a research code kindly provided by Adrian Doicu. Even for
unpolarized calculations the spherical harmonics terms are different in
the new polarized SHDOM code from the old SHDOM code (due to a different
azimuth angle basis).
For polarized radiative transfer with randomly oriented particles
having a plane of symmetry, there are six unique elements of the 4x4
scattering phase matrix. One aspect of using the real generalized
spherical harmonics basis is that the phase matrix elements as a
function of scattering angle are expressed in Wigner d-function series.
For the I-I (P11) and V-V (P44) elements of the phase matrix these
series are the same as Legendre series, while for the other four
elements (P22, P33, P12, P34) the Wigner d-function series are not
equivalent to Legendre series. There is a new polarized property file
input format for SHDOM, which is based on the tabulated phase function
format, but including the coefficients for the six Wigner d-function
series expansions of the phase matrix elements. The old SHDOM property
file formats may still be used for unpolarized (NSTOKE=1) calculations.
The SHDOM output types that output polarized results are radiance (R),
visualization images (V), and source function (J). The hemispheric flux
(F), heating rate (H), spherical harmonics (S), and medium properties
(M) output unpolarized results as before.
An unpolarized radiative transfer calculation is chosen with
NSTOKES=1, in which case special subroutines are used so there is little
loss of efficiency compared to the old (unpolarized) SHDOM for 3D
problems. Polarized radiative transfer is chosen with NSTOKES=3 for I,
Q, U or NSTOKES=4 for I, Q, U, V Stokes parameters. For most
atmospheric radiative transfer problems the NSTOKES=3 approximation is
very good because V is very small.
INPUT PARAMETERS
The parameters are input in the routine USER_INPUT or
NAMELIST_INPUT. USER_INPUT uses READ statements from stdin in response
to written messages. Because not all parameters are relevant at once,
the complete list is not input. Run shdom interactively for the actual
sequence. The USER_INPUT routine echos the input values so that a
useful "log" file output is produced when run non-interactively. To use
the namelist style input, comment in the call to NAMELIST_INPUT in the
main routine. On some computers the NAMELIST input requires a space
before the parameters. All file name strings are a maximum of 80
characters.
Parameter Description
RUNNAME label for the run (also for multiple processor log file names)
PROPFILE name of the input medium property file
SFCFILE name of the input surface property file (or NONE)
CKDFILE name of the input correlated k-distribution file (or NONE)
INSAVEFILE name of the input binary save file (or NONE)
OUTSAVEFILE name of the output binary save file (or NONE)
NSTOKES number of Stokes parameters (1 for I, i.e. unpolarized;
3 for I, Q, U; or 4 for I, Q, U, V)
NX, NY, NZ base grid size in X, Y and Z
(NZ is one more than the number grid cells vertically;
NX and NY are the number of grid points horizontally;
for periodic boundaries there is actually an extra plane
of grid points on the horizontal boundaries)
NMU, NPHI number of discrete ordinates covering -1<mu<1 and 0<phi<2pi
BCFLAG Bit flags to specify the horizontal boundary conditions:
0 for periodic in X & Y, 1 for open in X, 2 for open in Y,
3 for open in X & Y.
IPFLAG Bit flags for independent pixel mode: 0 for 3D, 1 for
independent (2D) scans in X, 2 for 2D scans in Y (X-Z planes),
3 for indepedent pixels (i.e. bit 0 for X and bit 1 for Y).
Bit 2 of IPFLAG means do the direct beam in 3D, e.g.
IPFLAG=7 means 3D direct beam but IP diffuse radiative transfer.
DELTAM T (true) for delta-M scaling of medium and Nakajima and Tanaka
method of computing radiances
GRIDTYPE E for even Z base grid between bottom and top,
P for Z base grid levels from property file,
F for Z base grid levels from file: zgrid.inp.
SRCTYPE 'S' for solar source, 'T' for thermal source, 'B' for both
Following for Solar or Both source
SOLARFLUX top of medium solar flux on a horizontal surface (any units)
For k-distribution this is a multiplier for the solar flux
in the CKD file (i.e. normally should be 1).
SOLARMU cosine of the solar zenith angle (this represents the
direction of travel of the solar beam, so is forced
to be negative although it can be specified positive).
SOLARAZ solar beam azimuthal angle; specified in degrees but
immediately converted to radians for use in code.
0 is beam going in positive X direction, 90 is positive Y.
SKYRAD isotropic diffuse radiance incident from above
GNDALBEDO bottom surface Lambertian albedo
Following for Thermal source
GNDTEMP ground temperature in Kelvin
GNDALBEDO the input emissivity for Lambertian surface is converted
to albedo by GNDALBEDO=1-emissivity
SKYRAD blackbody temperature (Kelvin) of radiation incident from above
UNITS 'R' for radiance units (W/m^2 ster),
'T' for brightness temperature (Rayleigh-Jeans assumed)
WAVELEN wavelength in microns for 'R' units;
WAVELEN not needed for solar sources (except for Ocean surface)
(GNDTEMP and WAVELEN used for Both source)
WAVENO(2) wavenumber range (cm^-1) for correlated k-distribution.
This particular range must be in the CKD file.
If KDIST then WAVELEN set to 10000/(average wavenumber),
and UNITS='B' for band.
ACCELFLAG T (true) to do the sequence acceleration. An acceleration
extrapolation of the source function is done every other
iteration.
SOLACC solution accuracy - tolerance for solution criterion
MAXITER maximum number of iterations allowed
SPLITACC cell splitting accuracy; grid cells that have the adaptive
splitting criterion above this value are split.
This is an absolute measure, but cannot be easily associated
with the resulting radiometric accuracy.
Set to zero or negative for no adaptive cell splitting.
SHACC adaptive spherical harmonic truncation accuracy; the
spherical harmonic source function series is truncated
after the terms are below this level. Truncation can still
happens if SHACC=0 (for 0 source terms). This is also
an absolute measure, and is approximately the level of accuracy.
NUMOUT number of output files (different types usually)
For each output file there are three variables:
OUTTYPES(*) 'R' - for radiance,
'V' - for visualization image output,
'F' - for hemispheric flux,
'H' - for heating rate (net flux convergence),
'S' - for spherical harmonic (mean radiance, net flux),
'J' - for source function for one direction,
'M' - for medium properties.
OUTPARMS(*) Type Values
R Z level, X spacing, Y spacing, X offset, Y offset,
num angles, mu1, phi1, mu2, phi2, . . .
Z level is the height of the output locations.
X/Y spacing is distance between output X/Y locations.
X/Y offset is extra distance beyond domain (for open BCs).
Each outgoing angle is specified with -1 <= mu <= 1
(mu>0 is upwelling and mu must not be zero), and
phi in degrees (0 is exitting in +X direction)
V camera mode: 1, nbytes, scale, X,Y,Z, theta, phi, rotang,
nlines, nsamps, delline, delsamp
nbytes is number of bytes per pixel in PGM output images
(nbytes=2 outputs most significant byte first, -2 outputs LSB first)
scale is the scaling from radiance to pixel value in image
X, Y, Z is the camera location
theta is the zenith angle [deg] of the pointing direction
phi is the azimuth angle [deg] of the pointing direction
rotang is the angle [deg] camera is rotated around pointing
nlines is the number of pixels in vertical
nsamps is the number of pixels in horizontal
delline is the angle [deg] between pixels in vertical
delsamp is the angle [deg] between pixels in horizontal
V cross track mode: 2, nbytes, scale, X1,Y1,Z1, X2,Y2,Z2,
spacing, scan1, scan2, delscan
nbytes is number of bytes per pixel in PGM output images
scale is the scaling from radiance to pixel value in image
X1, Y1, Z1 is the starting camera location
X2, Y2, Z2 is the ending camera location
spacing is the along track distance between scans
scan1 is the min cross track scan angle [deg] (left side)
scan2 is the max cross track scan angle [deg] (right side)
delscan is the angle [deg] between pixels in the scan
F Format [, Z level, X spacing, Y spacing]
Format:
1 - flux at top (up) and bottom (down) of medium
at horizontal base grid locations
2 - up and down flux at Z level height at locations
with X and Y horizontal spacing,
3 - domain averaged vertical profile,
4 - output at every base grid point.
5 - output at every grid point.
Only format 2 inputs the parameters after the format.
H Format
1 - domain averaged vertical profile.
2 - output every base grid point: -Div(Fnet).
3 - output every grid point
S Format
1 - output every base grid point
2 - output every grid point
Output is mean radiance, X, Y, and Z net fluxes.
J Format, mu, phi
1 - output every base grid point
2 - output every grid point
Source function in direction mu, phi(degrees), mu may be 0.
M Format
1 - output every base grid point
2 - output every grid point
Output is medium properties: extinction, single scatter
albedo, asymmetry parameter, temperature.
OUTFILES(*) output file name (or NONE for no ascii file)
OutFileNC netcdf output file name (or 'NONE' or '' for no netcdf file).
The netcdf file has all NUMOUT outputs for the
OUTTYPES(*) specified above.
Memory parameters:
MAX_TOTAL_MB approximate maximum memory to use (MB for 4 byte reals)
ADAPT_GRID_FACTOR ratio of total grid points to base grid points
NUM_SH_TERM_FACTOR ratio of average number of spherical harmonic terms
to total possible (NLM)
CELL_TO_POINT_RATIO ratio of number of grid cells to grid points
CHOOSING INPUT PARAMETERS
See Unix scripts included with the distribution for examples of running
SHDOM. Also read propgen.txt for how to prepare optical property files.
The main difficulty in choosing parameters is determining the desired
tradeoff between accuracy and computation cost. The angular resolution
parameters (NMU, NPHI, SHACC) and the spatial resolution parameters (NX,
NY, NZ, SPLITACC) are the primary ones that determine both the accuracy
and the running time. See ACCURACY ISSUES for a discussion of this.
NMU must be even (rounded up internally). For NPHI greater than about
14, SHDOM will run faster if NPHI has small prime factors, due to the
FFT part of the spherical harmonic transform. The base grid size is
often the same size as the property grid, but can have more grid points
for higher spatial resolution or even can have less grid points than the
property grid.
PROPFILE always needs to be specified, but INSAVEFILE and OUTSAVEFILE
seldom do, and CKDFILE is only for doing a k-distribution. SFCFILE is
given for a Lambertian surface with variable albedo or temperature or a
different BRDF, but otherwise is NONE or NO.
Set BCFLAG=0 for periodic boundary conditions, and BCFLAG=3 for open
boundary conditions. See boundary condition section for explanation.
Set IPFLAG=0 for normal radiative transfer. IPFLAG=3 is for the
independent pixel approximation. The independent scanline approximation
is 2D radiative transfer in the solar plane for a 3D medium; it can be
chosen by setting the solar azimuth to 0 and IPFLAG=2 (or SOLARAZ=90
degrees and IPFLAG=1). Bit 2 of IPFLAG can be set to try an
approximation where the direct beam is computed in 3D, but the diffuse
transfer is done on indendent pixels or 2D slices. Note, IPFLAG=4 is
the same as IPFLAG=0.
Generally for solar (collimated) problems with highly peaked phase
functions DELTAM should be set to True; this is because the phase
function must be truncated because a limited number of discrete
ordinates can be used.
If you want the internal base grid to have the same height levels as the
input property grid then GRIDTYPE='P'. This option allows the vertical
grid spacing to be nonuniform, e.g. more in a cloud layer and less in
clear sky. Another way to have the vertical base grid not be uniform is
the GRIDTYPE='F' (file) option which will read the Z levels free format
from a file with the name "zgrid.inp". The other alternative is to
have the base grid levels evenly spaced between the bottom and top of
the domain (first and last level in property grid). For an even grid
GRIDTYPE='E', but keep in mind that there are NZ-1 grid cells, so that
for doubling the vertical resolution use NZ=2*(NZP-1)+1 where NZP is
the number of grid points vertically in the property file.
The SKYRAD parameter is usually 0, but is nonzero when the problem
includes diffuse radiation incident on the top boundary. This could be
the cosmic background radiation for microwave wavelengths, or diffuse
skylight for solar problems where the domain top is actually in the
lower atmosphere.
The UNITS parameter is only relevant for thermal emission without a
k-distribution.
The solution acceleration is usually turned on (ACCELFLAG=.TRUE.).
If the number of iterations is small because the problem is optically
thin or highly absorbing, then the sequence acceleration can be turned
off, and the memory for the DELSOURCE array freed up (by giving the array
a length of 1).
The solution accuracy (SOLACC) is the rms change in the source function
during an iteration normalized by the rms of the source function. There
is no need to make this much, much smaller than the general level of
accuracy (related to the spatial and angular resolution). Because it is
the rms change in an iteration, it should be smaller than the desired
accuracy. Try starting with SOLACC=1E-4.
See section on ARRAY SIZES AND MEMORY MANAGEMENT for how to choose the
four memory parameters (MAX_TOTAL_MB, ADAPT_GRID_FACTOR,
NUM_SH_TERM_FACTOR, CELL_TO_POINT_RATIO) used in shdom.f90 or to
adjust the array sizes in shdom.f to accomodate your problem size and
computer memory. The four memory parameters do not affect the SHDOM
solution (unless SHDOM runs out of memory); they only control the array
sizes.
See section on PROPERTY FILE FORMAT for how the medium properties are
input. The medium properties are specified at points on a grid and are
assumed to vary linearly in between. This is different from how Monte
Carlo radiative transfer codes usually define the medium, which is by
grid cells of uniform properties. The medium properties in the model
vary trilinearly across the grid cells. The internal representation of
the medium (on the base grid) is trilinearly interpolated from the input
properties of the medium. So if a sharp boundary is desired in the
medium (e.g. at cloud top) it is a good idea to have the two different
optical properties (e.g. cloud and clear) specified at two very close
grid levels. Otherwise the linear interpolation of the medium will
smear the transition over the resolution of the input property file
(nature is seldom perfectly sharp).
SURFACE REFLECTION MODELS
There are currently five types of surface reflection: Lambertian,
RPV-original, Ocean, Wave-Fresnel, and Diner. The RPV and Ocean
reflection models are unpolarized, so NSTOKES must be 1. Wave-Fresnel
is used for polarized ocean modeling, and Diner for polarized land
surface modeling. It is simple to add other surface types (see SOURCE
CODE OPTIONS).
A Lambertian surface reflects radiance isotropically, and is specified
with the surface albedo.
The RPV parameterization is used to model the general bidirectional
reflectance function (BRF) characteristics of land surfaces (soils and
vegetation) with three parameters (rho0, k, Theta) (Rahman, Pinty,
Verstraete, 1993: J. Geophys. Res., 98, 20791).
The Ocean BRDF parameterization models the glint specular reflection
averaged over wave facets and the internal reflection from sea water,
and is specified by the wind speed and chlorophyl-alpha pigment
concentration. The Ocean reflectance model is obtained from the 6S
model and modified by Norman Loeb at NASA Langley. High SHDOM angular
resolution (Nmu, Nphi) is needed at low wind speeds to resolve the
broadened specular reflection peak, so it is important to do a test
with increasing angular resolution until the reflection is stable.
The Wave-Fresnel model calculates the polarized Fresnel reflection
from a dielectric interface with a Gaussian distribution of slopes,
including shadowing. The complex index of refraction of the dielectric
(negative imaginary part for an absorbing medium) and the wind speed are
input. The mean square surface slope (s^2 or SIGMA2 in Mishchenko and
Travis) is obtained from the Cox and Munk formula in the paper.
(Mishchenko, M. I., and L. D. Travis, 1997: Satellite retrieval of
aerosol properties over the ocean using polarization as well as
intensity of reflected sunlight. J. Geophys. Res. 102, 16989-17013.)
As with the Ocean model it is important to do a test with increasing
angular resolution until the reflection is stable for low wind speeds,
due to the nearly specular sunglint peak.
The Diner et al. model is used to calculate polarized reflection from
land surfaces. It has two terms: 1) completely depolarizing volumetric
scattering with the modified RPV model, and 2) Fresnel reflection from
randomly oriented microfacets with either a uniform or Gaussian
distribution of slopes. The three parameters of the modified-RPV model
are A, K, and B. The weight given to the Fresnel-microfacet term is
ZETA (if ZETA=0 then only the REFLECT(1,1) element is non-zero with the
modified-RPV model). SIGMA is the standard deviation of microfacet
slopes for the Gaussian orientation distribution, but if SIGMA<=0 the
uniform orientation distribution is used. The complex index of
refraction of the Fresnel microfacets is fixed in the subroutine at
m=1.5+0i. The Diner et al. (2012) model leaves out the hotspot factor
of the modified RPV model (and the original RPV), but the hotspot factor
is included here. The model is described in Diner, D. J., F. Xu, J. V.
Martonchik, B. E. Rheingans, S. Geier, V. M. Jovanovic, A. Davis, R. A.
Chipman, S. C. McClain, 2012: Exploration of a polarized surface
bidirectional reflectance model using the ground-based multiangle
spectropolarimetric imager. Atmosphere 2012, 3, 591-619;
doi:10.3390/atmos3040591. The modified RPV model is from Martonchik et
al., 1998 (IEEE-TGARS, 36, 1266).
See the SURFACE FILE FORMAT section for how to specify the parameters.
The temperature and reflection properties of these models may vary
arbitrarily across the surface.
If the surface reflection parameters in the surface file are constant,
and thus the surface has horizontally uniform reflection, then the
surface reflection matrix is precomputed for the discrete ordinate
incident and outgoing directions, which can save considerable
computation. The temperature does not need to be uniform for this.
Lambertian surfaces are always computationally efficient, and so this
precomputation is not performed in that case.
BOUNDARY CONDITIONS
The upper boundary condition is open, i.e. there is no reflection from
the top of the domain. Isotropic incident radiance at the top boundary
may be specified. The surface reflection models are described above.
There are two options for the horizontal boundary conditions. The
first is periodic and the second is open. These are specified for X and
Y directions independently using the BCFLAG: bit 0 for X and bit 1 for
Y. Thus BCFLAG=0 is periodic in X and Y, BCFLAG=1 is open in X and
periodic in Y, and BCFLAG=3 is open in X and Y. The independent pixel
flag (IPFLAG) overrides the boundary condition flag, since an
independent pixel does not have a horizontal boundary condition.
Periodic boundaries mean that when rays exit one side they wrap
around to enter the opposite side. The periodic domain size is NX*DELX
by NY*DELY; for example if NX=64 and DELX=0.050 km, then X=3.2 km is
equivalent to X=0 km (left hand side).
The open horizontal boundary condition means that radiation leaving
the sides does not wrap around and is not reflected back in. However,
there may be radiation entering the sides. For open BCs the property
file grid points on the edges are used in independent pixel mode to
compute the boundary radiation. For example, in 2D (X and Z) the first
and last X property grid columns are computed with plane-parallel (IPA)
radiative transfer. These grid columns then provide the boundary
radiances for the interior of the domain. The interior domain is
(NX-1)*DELX wide. In 3D the four corner columns are computed with 1D
(plane-parallel) radiative transfer, while the four side slices are done
with 2D radiative transfer. If you want the boundary columns to produce
plane-parallel results, simply make sure the input optical properties
for the columns on the boundaries are the same. The radiance output
calculation can be done for locations beyond the horizontal domain; this is
the purpose of the "X and Y offsets" in the radiance output
specification. For the open boundary conditions the rays are traced
back through the regions beyond the domain using plane-parallel
radiative transfer, and then may enter the domain. This allows the
lower part of the domain to be seen through the domain "sides" for
slanted paths. Open boundary conditions are not allowed with MPI.
GRID CELL STRUCTURE
To implement the adaptive grid there is a data structure that
specifies the relation between grid cells and grid points. Grid cells
are rectangular volumes (e.g. cubes in 3D, rectangles in 2D). Grid points
are the points at which the medium properties and the radiances are
defined. Each grid cell has 8 grid points at its corners (in 1D and 2D
there are still 8 pointers to grid points though they are redundant).
Grid cells have pointers to neighboring cells, so that rays may be
traced from cell to cell. The adaptive grid is implemented by splitting
cells in half (in any direction) and using pointers to parent and
child cells. See VARIABLES section for details.
For periodic boundary conditions there are NX*NY*(NZ-1) grid cells
and (NX+1)*(NY+1)*NZ grid points. The IX=NX+1 column is the same as
the IX=1 column, because of the periodicity, but the grid points are
repeated. The last grid cell on the "right" has the last column of
grid points on the "left" and the repeated "virtual" set of grid points
on the "right". For open boundary conditions in X and Y there are
(NX+1)*(NY+1)*(NZ-1) grid cells and NX*NY*NZ grid points. Open boundary
conditions are done with the lowest "left most" and "right most" X and Y
columns being independent "pixels" with their own grid cells (zero
width), therefore one extra grid cell is required. Independent pixel
mode has NX*NY*(NZ-1) grid cells and NX*NY*NZ grid points
OUTPUT FORMATS
The output goes to a number of plain text files (except for the
visualization and netcdf output), each containing a header listing the
input parameters. The header also lists the number of iterations taken,
which for a k-distribution is the total number over the distribution
sum. There are seven output types:
R - radiance,
V - visualization,
F - hemispheric flux,
H - net flux convergence (heating rate),
S - spherical harmonic output,
J - source function, and
M - medium properties.
Any number of ascii files maybe output during a single run.
A single netcdf output file may be specified in addition to, or instead
of, the ascii output files. The netcdf file contains all the output
types of the specified ascii files. If the ascii files are not desired,
then use NONE for each of the ascii file names. The output in the
netcdf file is arranged somewhat differently from the ascii files. For
all of the output on a grid, X is the fastest moving index, and Z is the
slowest. The total downward flux and downward diffuse flux are output,
rather than the downward diffuse and direct flux separately, as in the
ascii files. Only those output formats allowed with multiple processors
are supported in the netcdf output. Specifically, the V, J, and M
formats are not supported, nor is the F 2 format, nor any output on the
adaptive grid (i.e. F, H, and S outputs are only on the base grid). All
of the information in the header of the ascii files is included as
global attributes in the netcdf file. The original unpolarized
shdom_netcdf module was written by Robert Pincus (December 2008).
The radiance output computes the radiance at a single vertical level for
a regular grid of locations for a series of directions. There are two
visualization output modes, both of which make binary (P5) PGM format
images. The first one simulates a fixed position camera, while the
second simulates a cross track scanning instrument flying on an
airplane. For the camera mode the inputs are the position and pointing
direction of the camera, the number of pixels vertically and
horizontally, and the pixel spacing. For the cross track scanning mode
the inputs are the starting and ending aircraft positions, the distance
along track beween scans, and the range of scan angles (<0 for left
side, > 0 for right side). The PGM image format has a human readable
ascii header with the SHDOM input parameters listed, followed by the
pixel values in straight binary format (from left to right and then top
to the bottom of the image). The pixel format may be byte (values 0 to
255) or two byte integers. PGM images may be viewed with many image
viewers, such as "display" from ImageMagick. For polarized output with
NSTOKES>1, the NSTOKES output images are stacked in the line (Y)
direction: I (radiance) on top, then degree of linear polarization (0 to
1), angle of linear polarization (-180 to 180 degrees), and (for
NSTOKES=4) degree of circular polarization (-1 to +1). The scaling to
the pixel values is done automatically (see the PGM header for the
scaling for each polarization component).
There are several ways to output hemispheric flux, including at the top
and bottom of the domain, on a regular horizontal grid at any level, or
for every grid point. The net flux convergence may be output for each
grid point or for a domain averaged vertical profile. The spherical
harmonic output is the mean radiance and net flux vector (proportional
to the first 4 terms in the spherical harmonic expansion) and for solar
problems includes the direct beam contribution.
The source function output lists the extinction and source function
computed for a specified direction at every grid point. This can be
useful for use by another program such as to compute limb radiances in a
spherical geometry. The medium properties output lists the (potentially
delta-M scaled) extinction, single scattering albedo, asymmetry
parameter, and temperature at the grid points, and is useful for making
sure the medium was input correctly.
There is a debugging output file format containing GLE graphics
commands to draw the grid cells and arrows showing the neighboring cell
relationships for an XZ plane. If this is desired, comment in the call
to VISUALIZE_CELLS after the solution iteration loop.
BINARY SAVE FILES
The source function solution and associated state may be saved in
binary files. The adaptive grid cell structure, the source function,
upwelling and downwelling fluxes, and lowest order terms of the
spherical harmonic radiance field are stored. These files can be
used to continue a run to higher accuracy or to save the solution
so more output files can be generated (e.g. radiances at more angles).
For a k-distribution, the input save file is used for the first k and
the output save file is from last k.
PROPERTY FILE FORMAT
The properties of the medium may be input in four different formats;
the first three are for unpolarized radiative transfer and the fourth is
for polarized calculations. 1) the "standard" format specifies the
extinction, scattering albedo, temperature, and Legendre phase function
expansion at every grid point; 2) the extinction only format only
specifies extinction at every grid point, while the other quantities are
given in the header part; 3) the tabulated phase function format
specifies all properties except the phase function for each grid point,
while many phase functions are tabulated in the header; and 4) the
polarized tabulated phase function format tabulates the six Wigner
d-function expansion coefficients for each phase matrix in the header.
All file types are plain ascii and read by Fortran free format (so most
"lines" may be broken across lines). With a small code change
(commenting in calls to read_property_size_netcdf and
read_properties_netcdf in shdom.f90) the two tabulated phase function
format property files may be read in netcdf format.
Note: Chi1 is 3 times the asymmetry parameter.
The standard file format has two header lines: the first line has
the size of the grid; the second line contains the X and Y grid
spacing and the Z levels. There is then one line per grid point
giving the properties. The format is as follows:
Nx Ny Nz
delX delY Z1 ... Zn
IX IY IZ Temp Extinct Albedo NumL Chi1 . . . ChiL
. . .
where IX, IY, and IZ are the spatial indices (1,...,Nx, etc.), Temp is
the grid point temperature in Kelvin, Extinct is the extinction in
units inverse of the grid spacing units, Albedo is the single
scattering albedo, NumL is the Legendre series degree, and Chi(n) are
the coefficients of the Legendre expansion of the phase function.
Chi1 is 3*g, where g is the asymmetry parameter.
The extinction only format has five header lines: the first line
must begin with 'E'; the second has the grid size; the third has the
grid spacing and Z levels; the fourth has the temperatures in Kelvin
of the Z levels; the fifth has the single scattering albedo, the degree
of the Legendre expansion of the phase function, and then the Legendre
coefficients. There is one line for every grid point containing the
grid point indices and the extinction; if the NY is 1 then only the
X and Z indices are given. The format is as follows:
E
Nx Ny Nz
delX delY Z1 ... Zn
T1 ... Tn
Albedo NumL Chi1 ... ChiL
IX IZ Extinct or IX IY IZ Extinct
. . .
The tabulated phase function format has 4+NUMPHASE header lines,
where NUMPHASE is the number of phase functions: the first line must
begin with 'T'; the second has the grid size; the third has the grid
spacing and Z levels; the fourth has the number of phase functions;
and the following "lines" have the phase functions containing the
degree of the Legendre expansion and then the Legendre coefficients.
There is one line for every grid point containing the grid point
indices, the temperature (Kelvin), the extinction, scattering albedo,
and phase function index (1 to NUMPHASE) to the table in the header.
The format is as follows:
T
Nx Ny Nz
delX delY Z1 ... Zn
numphase
NumL Chi1 ... ChiL (for each phase function)
. . .
IX IY IZ Temp Extinct Albedo Iphase
The polarized tabulated phase function format is the same as the
tabulated phase function format, but has has 4+6*NUMPHASE header lines:
the first line must begin with 'P'; the second has the grid size; the
third has the grid spacing and Z levels; the fourth has the number of
phase functions; and the rest of the header has six "lines" for each
phase matrix. These six lines start with number 1 to 6, indicating
which Wigner series, then the degree of the Wigner expansion, followed
by the Wigner d-function coefficients, starting with the zeroth term
(i.e. Chi0, not Chi1 as for the other formats).
The first Z level is the bottom surface and the last is the top
surface. The Legendre coefficients are normalized so that Chi0 (which
is not listed) is always 1. The phase function coefficients do not
actually need to be all on one line; in fact, propgen uses multiple
lines for phase functions with more than 200 terms.
SURFACE FILE FORMAT
There are currently five types of surface property files:
Lambertian, RPV-original, Ocean, Wave-Fresnel, and Diner. The file
format allows the temperature and reflection properties to vary
arbitrarily across the surface. The surface properties are specified on
a regular, evenly spaced grid. There should be Nx*Ny points in the
surface file. The properties are bilinearly interpolated to the domain
bottom grid point locations. The surface domain is also periodic in X
and Y, so that the domain is delX*Nx by delY*Ny in size. Thus there are
Nx cells across, and the routine that reads in the surface properties
assigns the properties of the left side of the domain (IX=1) to the
right side of the last cell (IX=NX+1). A uniform surface can be
specified by having Nx=1 and Ny=1 and making delX and delY large enough
to cover the whole domain.
The Lambertian surface file format is as follows:
L
Nx Ny delX delY
IX IY SfcTemp SfcAlbedo
. . .
The RPV surface is specified with three parameters (rho0, k, Theta):
R
Nx Ny delX delY
IX IY SfcTemp rho0 k Theta
. . .
The Ocean surface is specified with the near surface wind speed (m/s)
and the chlorophyl-alpha pigment concentration (mg/m^3):
O
Nx Ny delX delY
IX IY SfcTemp WindSpeed PigmentConc
. . .
The Wave-Fresnel surface is specified with the complex index of
refraction of the dielectric surface (Mre, Mim) and the near surface
wind speed (m/s):
W
Nx Ny delX delY
IX IY SfcTemp Mre Mim WindSpeed
. . .
The Diner surface is specified with five parameters (a, k, b, zeta, sigma)
R
Nx Ny delX delY
IX IY SfcTemp a k b zeta sigma
. . .
The IX and IY are the surface grid numbers (1 to Nx, 1 to Ny). The surface
temperature is specified in Kelvin.
CORRELATED K-DISTRIBUTION FILE FORMAT
The CKD file has the correlated k-distribution information for a
series of spectral bands for a particular atmospheric profile. This file
is made by another program from a profile. Each band, i.e. a range of
wavenumbers, has a k-distribution associated with it. The k-distibution
is performed as a discrete sum over probability intervals (the delta g
in the notation of Fu and Liou, 1993). A radiative transfer calculation
is done for each "k" in the sum, which represents the gaseous
absorption. The distribution of gases in the atmosphere is assumed to
horizontally uniform, and so the absorption coefficients depend only on
height. Thus the CKD file specifies the absorption coefficient (in
km^-1) for each height, "k" in the sum, and each band. The delta g's
are the weights with which the separate radiative transfer results are
added together.
There are three sections in the CKD file.
1. The band information.
2. The atmosphere information.
3. The absorption coefficients for each band and g range.
The format is:
Comment line
Number of bands
Comment line
For each band:
band num, waveno1, waveno2, solar flux, number of k's, all the delta g's
Number of z levels
Two comment lines
For each atmospheric level from the top down:
height, (other profile information is not used)
Comment line
For each band and level:
band number, level number, all the k_i's
Notes: waveno1 and waveno2 are the starting and ending wave numbers in
cm^-1 of the bands; solar flux is the top of the atmosphere solar
irradiance in W/m^2 for overhead sun. The level numbers start with 1
at the top of the atmosphere. The levels in the CKD file are completely
independent of the levels in the SHDOM run, though they should cover the
SHDOM levels. The k-distributions should start with the lowest k's
(they are used in the reverse order in the program).
DETAILED METHOD OF OPERATION/REFERENCES
See the journal papers:
Evans, K. F., 1998: The spherical harmonic discrete ordinate method
for three-dimensional atmospheric radiative transfer. J. Atmos. Sci.,
55, 429-446.
Pincus, R., and K. F. Evans, 2009: Computational cost and accuracy in
calculating three-dimensional radiative transfer: Results for new
implementations of Monte Carlo and SHDOM. J. Atmos. Sci., 66,
3131-3146.
Doicu, A., D. Efremenko, T. Trautmann, 2013: A multi-dimensional
vector spherical harmonics discrete ordinate method for atmospheric
radiative transfer. J. Quant. Spectrosc. Radiat. Transfer, 118, 121-131.
Mishchenko, M. I., and L. D. Travis, 1997: Satellite retrieval of
aerosol properties over the ocean using polarization as well as
intensity of reflected sunlight. J. Geophys. Res. 102, 16989-17013.
OPERATING STRATEGY
The first step is to compile the code. A makefile is provided, with
options to compile with or without MPI (for multiple processors) and
with or without the netcdf library. SHDOM was originally written in
Fortran 77 with common extensions. Later, parts of SHDOM were written
in Fortran 90, and now all of the SHDOM-specific code (shdom.f90 and
shdomsub?.f) requires a Fortran 90 compiler. Fortran 90 uses
allocatable arrays and frees the user from having to set most of the
parameters controlling the array sizes.
While a few algorithm and coding bugs may be left in the code, the
most common source of errors is incorrect input. Be sure to check the
input parameters against the documentation and the USER_INPUT routine.
The correctness of the input property file format can be found by using
the 'M' - medium property output option. There are routines that
check the input parameters and property file for validity and issue
warnings for common mistakes.
Conservation of energy can be used as a check for conservatively
scattering medium, but remember that the multiple reflections between
the surface and medium will introduce "extra" flux. The collimated
(solar) flux input parameter is the flux on a horizontal surface
(rather than the flux perpendicular to the beam), so for conservative
scattering and no surface reflection the reflected plus transmitted
flux over the domain should equal this parameter.
The delta-M scaling procedure is very useful for reducing the
angular resolution needed. It is generally used for collimated (solar)
source problems. One should remember that the delta-M scaling procedure
changes the extinction, single scattering albedo, and Legendre phase
function fields, as can be seen with the medium output option. This
means that the direct solar flux is increased with delta-M scaling and
must be added to the diffuse transmitted flux for meaningful results.
For putting in the rest of the atmosphere: a coarse grid outside the
cloud region works because the rest of the atmosphere does not interact
much with the cloud layer (little reflection back to the cloud layer).
The fluxes, which depend on the cell-to-cell discrete ordinate
integration, will not be as reliable above the cloud level. The
radiances, however, which are computed by integrating the source
function back all the way to the boundary will be fine at any level.
Output during Iterations
During the iterations, some useful information is output to stdout.
Six terms are output on a line each iteration:
Iterations, Log10(solution criterion), Cell splitting criterion,
Number of grid points, Average number of SH terms,
Ratio of number of SH terms to total possible
Using these parameters you can track the convergence towards the
solution and the behavior of the adaptive grid process. As the current
cell splitting accuracy parameter is reduced the number of grid points
increases and the cell splitting criterion decreases. The cell splitting
criterion is the maximum over the whole medium. The adaptive spherical
harmonic truncation may also be monitored with the average number of SH
terms.
The solution criterion often does not converge monotonically. When
the cell splitting accuracy parameter is lowered and many adaptive cells
are created the solution criterion typically increases. The sequence
acceleration method may cause the solution criterion to increase at a
particular step, though the overall convergence is usually speeded. The
adaptive spherical harmonic truncation (parameter SHACC) may also
interfer with a smooth convergence. If you have trouble with
convergence try adusting one of these parameters (SPLITACC, SHACC,
ACCEL).
ACCURACY ISSUES
Like all numerical solutions to the radiative transfer equation, the
operation of SHDOM entails a tradeoff between computational speed and
accuracy. One difference between 1D and 3D radiative transfer is that
it is very expensive to get very high accuracy (better than 1%) in 3D.
This is a result of the five dimensional nature of the problem. This is
also the case for Monte Carlo methods, but it is worse for explicit
radiative transfer methods. The strategy issues for running SHDOM
involve how to achieve a required accuracy in a minimum CPU time.
The grid spacing needed is determined by some assumptions of SHDOM:
1) the variation in the extinction and the product of the source
function and extinction across a grid cell is linear, 2) the source
function across a grid face is accurately interpolated with the bilinear
method, and 3) the radiance across a grid face is accurately
interpolated with the bilinear method. The first and second
assumptions should be satisfied by the source function based adaptive
cell splitting algorithm, and also would be by having the optical depth
across the grid cells small compared to one. The third assumption is
satisfied by not using too coarse a base grid, and probably only matters
as it affects the computation of the source function. It can make sense
to have a coarse base grid where there is little scattering (clear sky),
but then the output fluxes in those regions will be of low accuracy,
though the output radiances (computed by tracing back) will be good.
Because the accuracy can be compromised if the optical depth across a
grid cell is too large, a warning is given if the property grid has
cells with optical depth larger than 2. It is the internal grid that
matters, but most people use the same grid so the check is done there.
There are pathological cases in which a completely erroneous solar
radiative transfer solution can occur if the optical depth across the
cloud top grid cell is very large.
The accuracy is also determined by the angular resolution, specified
by the number of discrete ordinates (NMU and NPHI). To a lesser degree
the adaptive spherical harmonic truncation parameter SHACC also
determines the angular resolution. Most of the benefit from the
adaptive truncation occurs for low values of SHACC, so it is recommended
to keep this parameter small. A value of SHACC=0 still gives the
benefit of not allocating source function spherical harmonic terms where
there is no scattering. Then NMU and NPHI will determine the angular
resolution. The angular resolution needed depends on the desired
output: for solar beam problems, net flux convergence (heating rates)
requires the least, followed by hemispheric fluxes, and then radiances.
For heating rates Nmu=8, Nphi=16 is enough, for hemispheric fluxes
Nmu=8, Nphi=16 is often adequate, and radiances may need Nmu=16,
Nphi=32 or more depending on the phase function.
Choosing the cell splitting accuracy: Remember that the adaptive
parameters are in absolute units, not relative! The actual accuracy is
not linearly related to the cell splitting accuracy. The cell splitting
accuracy usually should be set substantially higher than the desired
absolute flux accuracy. Look at the examples in the 1998 journal paper
for guidance.
The only sure way to know how to set the spatial and angular
resolution is to test how the desired solution behaves with increasing
resolution, or make selected comparisons with an independent radiative
transfer method. It is not useful to increase the angular and spatial
accuracy much beyond one another. Tests have shown that the accuracy is
not controlled by the cell splitting accuracy alone; both the base grid
(NX,NY,NZ) and the splitting accuracy (SPLITACC) have to be considered.
Thus, setting the splitting accuracy so small that the number of
adaptive grid points is many times the number of base grid points, will
probably not achieve any higher accuracy.
Unlike for plane-parallel situations, the delta-M method does not
assure high accuracy for downwelling solar flux in 2D and 3D as it does
for 1D, thus the distribution of downwelling solar flux may not be
accurate for highly peaked phase functions unless high angular
resolution is used. This is because in 3D, sharp variations in the
extinction field mean that the downwelling solar flux is not averaged
over all angles in the same way as for a uniform medium. This limitation
appears to not affect upwelling flux, upwelling radiance, or heating
rates.
Because of the thresholding nature of the cell splitting and the
spherical harmonic truncation, very small changes in the inputs can
amplify into larger changes in the output results. For example, running
the same case on different machines can lead to noticeably different
results. These variations, however, should be within the overall
accuracy of the results governed by the angular and spatial resolution.
Upwelling hemispheric fluxes are slightly more accurately computed
by using the double Gaussian quadrature discrete ordinate set. This is
choosen by setting ORDINATESET=3 in SOLVE_RTE. With this option,
radiances, heating rates, and net fluxes may be slightly less accurate.
RUNNING SHDOM WITH MULTIPLE PROCESSORS
SHDOM may be run on multiple processors, either in a shared memory
system (e.g. SMP) or in a distributed memory system (e.g. a cluster),
using the Message Passing Interface (MPI) version 1. All of the SHDOM
code that calls MPI routines is in the shdom_mpi.f90 file. A set of
equivalent (mostly) dummy routines is provided in shdom_nompi.f, which
allows SHDOM to be compiled without an MPI libarary. Using the parallel
processing aspects of SHDOM requires that the MPI system be installed on
the target computer. Then the usual way to compile and run a program
with MPI is:
mpif90 -O2 -o shdom shdom.f90 shdom_mpi.f90 shdom_nonetcdf.f90 \
shdomsub1.f shdomsub2.f shdomsub3.f ocean_brdf.f fftpack.f
mpirun -np 4 shdom90 < shdom.inp
where shdom.inp has the user input parameters and "-np 4" specifies
that four processors will be used. The MPI shared memory device (e.g.
ch_shmem from the MPICH implementation) will be somewhat more efficient
on an SMP machine than a device (e.g. ch_p4) that uses a network
communication protocol (such as ssh).
The SHDOM computation is divided up on the multiple processors with
domain decomposition, meaning that each processor works on a portion of
the horizontal domain. MPI routines determine the 2D Cartesian topology
that maps the full domain onto the processors, but usually four
processors will be mapped to two in X by two in Y. The boundaries
between processors are at property grid lines, and the internal base
grid points are repeated on both sides of the boundaries. Thus a