From 243c896b56d4e37ed6d6ca13b057f3070158c6fb Mon Sep 17 00:00:00 2001
From: Jordan Musser <jordan.musser@netl.doe.gov>
Date: Fri, 26 Jan 2024 16:36:34 -0500
Subject: [PATCH 1/2] Add changes to MR

---
 .../user_guide/inputs/advanced.rst            |  83 +++++++
 docs/source_docs/user_guide/inputs/domain.rst |   2 +
 .../inputs/images/geometry/mesh_lev0_ex.png   | Bin 0 -> 5097 bytes
 .../user_guide/inputs/mesh_and_gridding.rst   | 219 +++++++++---------
 4 files changed, 199 insertions(+), 105 deletions(-)
 create mode 100644 docs/source_docs/user_guide/inputs/images/geometry/mesh_lev0_ex.png

diff --git a/docs/source_docs/user_guide/inputs/advanced.rst b/docs/source_docs/user_guide/inputs/advanced.rst
index 7ce68e8..ea67991 100644
--- a/docs/source_docs/user_guide/inputs/advanced.rst
+++ b/docs/source_docs/user_guide/inputs/advanced.rst
@@ -46,3 +46,86 @@ The following inputs must be preceded by "amrex."
 +------------------------+-----------------------------------------------------------------------+-------------+--------------+
 | the_area_init_size     |  Abort if and overflow is detected.                                   |  Int        | 0            |
 +------------------------+-----------------------------------------------------------------------+-------------+--------------+
+
+
+Load balancing
+--------------
+
+Note that when running a granular simulation, i.e., no fluid phase, :cpp:`mfix.dual_grid` must be 0. Hence,
+the :cpp:`particles.max_grid_size` (in each direction) have no meaning. Therefore the fluid grid and tile
+sizes should be set for particle load balancing. It may also be necessary to set the blocking factors to 1.
+
+
+The following inputs must be preceded by "mfix." and determine how we load balance:
+
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+|                      | Description                                                           |   Type      | Default      |
++======================+=======================================================================+=============+==============+
+| dual_grid            | If true then use the "dual_grid" approach for load balancing          |  Bool       | False        |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| regrid_int           | How often to regrid (in number of steps at level 0)                   |   Int       |    -1        |
+|                      | if regrid_int = -1 then no regridding will occur                      |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| load_balance_fluid   | Only relevant if (dual_grid); if so do we also regrid mesh data       |  Int        | 0            |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| load_balance_type    | What strategy to use for load balancing                               |  String     | KnapSack     |
+|                      | Options are "KnapSack", "SFC", or "Greedy"                            |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| knapsack_weight_type | What weighting function to use if using Knapsack load balancing       |  String     | RunTimeCosts |
+|                      | Options are "RunTimeCosts" or "NumParticles""                         |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| knapsack_nmax        | Maximum number of grids per MPI process if using knapsack algorithm   |  Int        | 128          |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| greedy_dir           | The direction in which the greedy algorithm cuts overloaded boxes     |  Int        | 0            |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| greedy_min_grid_size | The minimum particle grid size in the greedy load balance algorithm   |  Int        | 2            |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| greedy_3d            | Partition particle grids in 3D with the greedy algorithhm             |  Bool       | False        |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| overload_tolerance*  | The ratio between the maximum workload and the average workload       |  Real       | 1.2          |
+|                      | in the greedy algorithm                                               |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| underload_tolerance* | The ratio between the minimum workload and the average workload       |  Real       | 0.8          |
+|                      | in the greedy algorithm                                               |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| grid_pruning         | Remove all covered grids from the base mesh; this may result in       |  Bool       | False        |
+|                      | disjoined grids                                                       |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+
+\* The greedy partitioning algorithm uses the tolerances to set the expected
+workload range for each rank but doesn't strictly enforce them. The algorithm
+creates a partition as balance as possible even if the partition doesn't
+meet the tolerances.
+
+To allow a user to verify the breakdown of fluid grids created before running a full simulation, an input option,
+:cpp:`mfix.only_print_grid_report` is supported. By default, it is :cpp:`False`. When set to :cpp:`True`, the run uses
+minimal memory to print the grid coverage report and exits immediately after that.
+
+
+The following inputs are defined using the ``particles`` prefix.
+
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+|                      | Description                                                           |   Type      | Default      |
++======================+=======================================================================+=============+==============+
+| max_grid_size_x      | Maximum number of cells at level 0 in each grid in x-direction        |    Int      | 32           |
+|                      | for grids in the ParticleBoxArray if dual_grid is true                |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| max_grid_size_y      | Maximum number of cells at level 0 in each grid in y-direction        |    Int      | 32           |
+|                      | for grids in the ParticleBoxArray if dual_grid is true                |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+| max_grid_size_z      | Maximum number of cells at level 0 in each grid in z-direction        |    Int      | 32           |
+|                      | for grids in the ParticleBoxArray if dual_grid is true.               |             |              |
++----------------------+-----------------------------------------------------------------------+-------------+--------------+
+
+
+.. todo:: Is this needed anymore?
+
+   The following inputs must be preceded by "particles."
+
+   +----------------------+-----------------------------------------------------------------------+-------------+--------------+
+   |                      | Description                                                           |   Type      | Default      |
+   +======================+=======================================================================+=============+==============+
+   | reduceGhostParticles | Remove unused ghost particles from communication,                     |    Bool     | True         |
+   |                      | only supported by GPU build                                           |             |              |
+   +----------------------+-----------------------------------------------------------------------+-------------+--------------+
+
diff --git a/docs/source_docs/user_guide/inputs/domain.rst b/docs/source_docs/user_guide/inputs/domain.rst
index f1a1933..fe18fc1 100644
--- a/docs/source_docs/user_guide/inputs/domain.rst
+++ b/docs/source_docs/user_guide/inputs/domain.rst
@@ -8,6 +8,8 @@ low and high corners of the cuboid are defined by the ``prob_lo`` and ``prob_hi`
 
 The following inputs are defined using the ``geometry`` prefix.
 
+.. _InputsTable_domain:
+
 +-----------------+-----------------------------------------------------------------------+-------------+-----------+
 |                 | Description                                                           |   Type      | Default   |
 +=================+=======================================================================+=============+===========+
diff --git a/docs/source_docs/user_guide/inputs/images/geometry/mesh_lev0_ex.png b/docs/source_docs/user_guide/inputs/images/geometry/mesh_lev0_ex.png
new file mode 100644
index 0000000000000000000000000000000000000000..de1f7c1c1b9721cb93246729404b4271c294bc30
GIT binary patch
literal 5097
zcmds5XH*mG){X@|fQU#31r<^09fX5)kdA<n76?KJO+-Qyq^UGTdWm$U3V|Tf&PY2J
zYN!#2geH&>ij+f_kZ=dR>s$A(`+e))ANSWcYt5SMXTN(t&)&1&nU#5V%k(A-6CV=<
z0%0-G*Rg;=jyFIcM+}Z11Ci_i{04YA?y6~`34v56GVkAI0PQ2b7B{sZ*uHZMpmD)j
z-@*g}3BCe>L_|U$dmt1s4}thgLm-Qe5Qxf42!#7V&U<rJ&>?-x__prB!9i?n?8L+b
z6bkL>=_xNSmy?qNoosAufq{W5D=P<batE222R%Ip5Xb=%i9@qtLH7|SqnkRAScdcZ
zpuzY+-^LdLIsNJIJksI)1_6PbEj7^5y#0{4oZBBbGzWC#5B4M_y!h+o(1Q@Gg8M88
zg_q2d717K$asm`01sUysXnlwrzQu<FFlpyz>M@V^AAB-=9Q|#s9_>GV-C-(Qg?<e6
zheOASrWW$`o%C@>ck!Uh@d7srr>V*h=fv2W!;q7#RSxd*Ol+W@|Ma+E)H<z7KCRDv
zRLhdbI$VG+8C%3KtCUkZn2nE$7~trQrrnlKG_=A$nP#L`%e#XMKh{u9s6JD6c>h|h
zY*9?d0zckA;<vObry41IAcmMyP?*iCEy6*}U$&HQD-((^ZYDnXft$aFQdpYC!;MUu
zJn+y+$G~F0i4@68T=~65Ru`h^jf<00`rK^M_^k=vT%Q|RXspJIP^Aujyh5at3j47N
zR%Q&IGxoKKi8YAKoMgvyenw2c_G3U*<j2p4tP_gGV!N1CQRXMjfaeMXt(IA{_8)`P
z;ieP}dY*s55I<A#Xqr@2jAJBCj$xzjCBS^Ada=Ej51!QZ0zZ+3R|iAQe(!v5VkcJE
z9~7U;i68sC?Si3nscxylc46<+(mFF;2>4R}h2SRxrQ`X+NDFL>ciB$YC9m-J_YBY}
zXIYJRcng~k{ns*)F3uwPx<IgiML)aI@_9ics}!y;C6D${>t3M$;e@>X#)lBfO3Txr
zTEoP=HxvN6U}#Zr-6gBNeWvTVpQh%EtdgbiJ0e1*#;W2yOEsPztAFelynqFLk?&CC
z+DT~hip?x2|Kfi~EmerXM<a9-FWuO4vU#79_RwT2)05+?U9#l02x>jdTOHm*>QuPt
ziSdlvtEiKfvNsf+pDpjdqLiV2J{5Z{!rcxVFIQSD4z?o{SG%^;!f6riaNIB;zY;$A
zea&6>{iX0)k<weUK@&FT0IoT;(!7pse>P9J0U_lCHAiji`l+R%WDLSIYi_r_E7pmN
zOr-aO?OV_`>Gtn4<X<Y5_AnB<V!fR4oIf_Qf_mlQV=vDy@;2uf@oq=?G4zI<-Q-T+
zCPF(L8+(=PW9}B~&3kzsS&FM0Fp1PNVG%3I4D3IxY4M%W8ZO#ESP9+OGeU+A`SWn)
z>HWEPSL_EN-;$nPGM9J$Nzrw#w_&e(1BTxglZc+}Kt%COtL^OWjE)1;Ez9GoTKm9m
zT%XUuaMmEQu@I@<1(ltJ5qQ^{zhCNDnnF&#B?m(Qo;6;8N$==#f*GHoWT<EYEWH;)
zN60jXEj#vR7&XG2_ZXzai8f1*fC`G(7A~g0c4~9rq6KPjt@sT;-JP<7THqIBUOlJw
zVMp;ApfPi9L2$w9dGnVGm0D34QCkmDc%l1K)3QE=C#>Ti9dVqz%Tp)$ojvkGtBr-9
zuDjIfRZbx1W1fvL4`B>iSu)X{3Kxtlt{Vm^yi<L%pX2r-`oyy7ZA;h|$#|ZbMO0Vd
zA?$OC7`%AaS?4P5SQ=(fg8U_@E+D$IY|tTaRSUnWJx8`>>P$%!$A_87yRqKk<TE!k
z!PHTonX-qzB9#vSnTlD7!sxxIA54Q6iAZvJR6E4IRhZ(jZ&#Ek0(5T<j?P!!XCXOh
z#GC(6|N4l(QZwG7QYVYJ^VNAo_oifJfnaaLV2C)7rcBLg_QT$(I9=A(R34d-5)fZ2
zV<%aO5)a1Mr3L|9$KGA`ctM2M_ZFOhu2?=>|AY|4q~9N%Fo;5#GIHlzqb4r$iHuQZ
znCoC$$5gZ_Cv&paSH~LNyP+F;zZaHCWuh_c%G!J8!^u!#^azhRD%6qnHm6GXh_aD`
zIIHENkd-!zsY|zwAr?Qulsniyq*0u{@N~>0o|tOY-vCVo6D-};$S5+FtIG|5bCwe$
z_}<paG*0mmhmY7*H@2-abVHXPO9)hxL@|u;rz>SL-J)fm;^c~n0#JCRHLrPHlKEZ(
zru>_e^5D+J4U_}}!a&qRvelBsdeLlJ+kZU2ip!Ijh}(&`8lGAZYRg9tV+YkDUfetf
zFu8c?3>Qc4y<A^h+xnh_Doer$Gz=x(k;pLG!KBGWr-lm3N)bLqh#M3Z1w7V!=2wRU
zN;vp0(-sFNtipA#4&6P#=}l|bX}|Zo7O>$%!gqgy`b+;+eS^IRF02xudA|EJM>q<~
z;tp7`-dYi=Yg9{H>x%8c4L{3iaT`=DMC2;w`05)fsD$2Zbs8{;(_6h|?k{;>&Phqo
zvthSgnA&s4LEUSVQai7izPaNOh2vT-_^2~nmTcNq)rIcCxqzjPR3(a@qZhj_+f#~m
z^P#dtPqhr#qa1ivqxD8bXq$);E&aYs6Dv5eA{PQ>s!1sjlnZNTIv3^$5ce1s!SX*p
z-K#m08hJp|=%>fMn&;4D_&V<hpg4Cdz<Fiqw2#`!u01bKP&RuF-nA&lhG~aI7=R3;
zh;X1>B3=hd>^PNn7f%qCNj7(n+Su;c_^hF6d7M%{1db_titK*Q<|8J$!JCBryWOE&
zc9N^P(yA=O$zS@U0K&<cX3JRDhTXG2D1X9vW?B=a2wweXNh!V4B)$!C|F^K;Zx|&R
zDaNE?rfn7-fhKM7pT6>k87kJ2-mgJdU?%LPufN5LC~(@$L2Ur;Uv~`9@%;_h6k}f%
zm{)?0`IW=eWt-qBk74hWdpftCPcPRbq+@tHg|j~`b2V#7B03oWgd9l=iA!S6l+cz$
z)Sm*9MrHpkfI{OKbe;)wD?RHPwhchEjFW||WWCR#n*Ou!e^2l)D}FMT9ZV&VC$DaJ
zPOVrwtg_L(E6Fn@z?Fj4nO`!Rf=G+@uetPJz|8D8$tS^fWs`}M?*xTOwk13olB3^+
ztZ-pzNlONpg8c~CJ6<7fS-blgOW9Rvu;vX>+Es?+_pm9|nh`RdBEkgrSE*}wV^a{0
zy_J8X6rNoQ!DoQ)=Rh+w73u0qnQM7<4a_-ynedS!-|()*6=qiqu2`yq4`kI;ji4o-
z9|Z-Npek)P=ViPC^{`N;3tr$`%RR<7j8xMva`q>If|2}Ar4X8qNI$A?hpp4Kwf^B6
zdmIdO0D^}R^sW>+%HqN~0BMk4LtcDjJP67j6+`pAQ=_7~7TQLmVmLIqQ43~eu5B4c
z%L0W*07Q)ywnaE}d8uQlrs%zIhc?p%7a;8~=Kecwp;veq!CT*q+7jQou?*(R&0ggJ
z_U_b6jX8}TBfziytQl9g{ear-)Sr^2v1$!gBmf9Vg<#a}{rVEc#Vw9sK;OHc<trg&
zC*LryO7jP0+h8ZjWL?{Acy?|s?PhJ9(BTpl&cC*rXm7$x^Hh+ef~teDu#l24`a6Ad
zumI&<!=b)pF-5QeRNR(&ToN;|{9Vi{ktSo*(F4pJ18b;)^)av|&WS)eqsF6QTweZa
z=TH|r(4Kdp=VkgV&LlA_x4^c4to1)pzt@!n(@5K!Tm7y4^^!Ndzt{UBw^EowTMDM`
zduNF9H&w!cT6U?1tUOA1<I2SjoITy2j+nemddQ2qSmV+e)st}%oPuar=e_?10N}_+
z3qGgBWq&4lNc=7fUs+EU%T!`icKB}*{ENu{M~uhFqPH%bMJ-%4-QWHEjGe~808T57
zq+$(a{ldwlrG+gBeZ1H>XZLqJ>yG`KOem8UvcWC6*yIHF&kY}j;_T(~8gqJaCfj$p
zQ8W2)zXbMe5m|^g^d9SX6~oIcE`#&Aoh}wgOEq|UG&yA{?DJ!06r<}wVcVcOJ^Be-
zicy@(gXu?c_}%U;jo$b5DRKN$aOz%E?ESl>vwd_S0*x{5Nir{h8Z_0uzN{fR_+1RG
zrUET-M3A!749&r3?Wh%Yzb^E=5+y4r_0q3#E+o&~q3R|lhGWwR+V>;14$*v=9AU`R
zEpD_X5>_>}AC`UaYT)oGb(P>|!g7Ya)Po};0;Zi>66Euz#RB5#&zMBmPPmKuiXh9m
z#>=E&I3eMDPB2B|AWic%$Qx7x+tZD!*INn3I<MNR>P-K_bf4Nr#4#k#3QE*NP#3i&
z!LI((%%fq%4=187xd5B2-Z2<Yc5q)dlDf0m6_~WUQDoy9wsNe=p{`_Hj`X$TA~>k*
zC)=vECJ30v9<^YJ9N0PPk84ib&hZ?Vuj1!8K}%Gch&xPjb<-nOI<zVa-;^&`B7r7K
zMei$qU(1D!*8K*SPjT~UU2HR|&GaQ_mV3%<6}It02__!#*yrj?w$Wf0ABoQ7EVlK=
zAc-uOc8Hs1x~DMjwL)$b-nLZ7x-3YwMwrNFQSW8HKEav@yHu-oQ3k<`^b&_>go$a&
zWuNXfO%Bxfswin`T^ME8x0Z!b@s%IQG^QI_=0k_f@?fEgL|%Wd0=zuxZtGWf<4N~L
z*~)53^k!H!SxK(;UGN_iky>(I?TrEY$j}5cWq(iF^-YZP2Yw&!Vjfu<t8?>pv~n;K
zX|tMB(Ety0Ik03?-YT489FJ|h*4C5eP9O0A(rH!S9D19tEaj$dZ7=c5ykcDj_TRGA
zvIUwypZUX`KIzd6t=1EK#41uVschrjN>_ECr3==iog(^PbrV`C=xhNrz9xp>XK7ir
zPu}Y53Z`yR$~XpM_9{P&JqCs7_L(mtl!uVEb?&+&zh7NiCghh|418!o9{2uiKXk{x
z?(fX~sR6sDoAH0dU7|XD_^lykE#+6TA;q(9>c7O5d0id|`<26TO#c<FBG7bXtg5s=
zS%dnkLNhO+$J@VT<`KcaC9ciB?<T`;)Wshs_6JiC&9q<-e#YAT|2=UF?aDwH|80x0
zH&2Nt|M6qMH%EQ$0f7v^BVGk<qn(cgaeXd?)u1t{&-4|2o=vorL&bUv#9{rDj3Qo7
zd;EFj`2_xF^1j|hU%yq>d>q}wjuo-W)>p9zC|i(2j&#I0-5c@~#C;YyXIzpir1R+d
zQcpaA$Byuwe@LZaW%~6)613n*JNGO2SHW4R(nN=>Pe9vlYcbW4)h>_fFB})Pw9R^b
zDr$?-`DtZPWr;vor*(K)fa0EB?@qPx%S_GLWgWh5yS@nnp6M23)XS<fgNb1JgKK?f
zhC&2!q_WHS=d{(5TXPk1?QaFyM)uUhfcCG!ShiBZWfSXzUKJ&|dxF#{meo%G=QTbt
z7vrk`nmBd>K1=OjJthB!%Lj8vljWrR?2cclX<W@fzK`O4%j?~v@58qDCF<J~n@}_4
zz(gLlhIJs(<s*lOLG6sOwX?B^(4RpS<Og|F?zaiw0gp`+w=bdMsgk0}R`<kgMSVxQ
zf-7`uT`6>($=FK$xysY85IR$lieU8%RxpzS&N?cc1Xso(@_fmX%>wJ;Z#Wa3%YBwv
z`mD}4uaJBNaghedK3&R5*6IuZ*Mur+Z666)i66^vYDis8t^Y!My4rmAZMs=C=}=Vx
zsr~9CkpB6e-T{MarcU5N&%xljObf5l1TFc4U5I46V)m+Y=CwUBu7##5zM`Ww*Zy%D
zAdq=Oy5MeMq)nB>0QkuYBJ8JY?T5JQ=c3|_ase+8S!r2$327M#S;gBjvMLHnDl*q4
nq!m=8r60J-+Wb?1mp8&48T5Y#98q{qKIAsgHPyjtIX?Ltd1vuu

literal 0
HcmV?d00001

diff --git a/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst b/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
index 96f6d3f..62805ad 100644
--- a/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
+++ b/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
@@ -1,84 +1,153 @@
 .. sec:InputsMeshAndGridding:
 
-Mesh and gridding
-=================
-
-Mesh, grids and tiles
----------------------
+Mesh, grids, and tiles
+======================
 
 Mesh
-^^^^
+----
+
+.. rubric:: Level-0 mesh
+
+The following inputs are defined using the ``arm`` prefix.
 
-There are some settings we can specify for the mesh and the automatic mesh
-refinement algorithm. These settings must be preceded by "amr." in the input
-file.
+.. _InputsTable_mesh:
 
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
 |                      | Description                                                           |   Type      | Default   |
+|                      |                                                                       |             |           |
 +======================+=======================================================================+=============+===========+
-| max_level            | Maximum level of refinement allowed (0 when single-level)             |    Int      | 0         |
+| max_level            | Maximum level of refinement. The default value, 0, means that there   |    int      | 0         |
+|                      | is one cell size covering the entire domain (i.e., single-level).     |             |           |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
+| n_cell               | Number of cells at level 0 in each coordinate direction               |    ints     | 0 0 0     |
++----------------------+-----------------------------------------------------------------------+-------------+-----------+
+
+The level 0 mesh spacing is computed for each direction by dividing the :ref:`domain length<InputsTable_domain>` by the
+number of cells. The mesh spacing is **required** to be the same in all directions:
+
+.. math::
+
+   \frac{\text{prob_hi[0] - prob_lo[0]}}{\text{n_cell[0]}}
+   = \frac{\text{prob_hi[1] - prob_lo[1]}}{\text{n_cell[1]}}
+   = \frac{\text{prob_hi[2] - prob_lo[2]}}{\text{n_cell[2]}}
+
+
+The inputs for defining the mesh for a single-level simulation are demonstrated in the
+:ref:`following example<inputs_mesh_ex>` and  illustrated in :numref:`fig_basic_mesh_ex`.
+In this example, the domain is a :math:`4 \times 1 \times 1` cuboid, and there are
+:math:`32 \times 8 \times 8` cells in the *X*, *Y*, and *Z* directions, respectively.
+The result is a uniform mesh spacing of :math:`0.125` *m* in all three directions.
+
+.. _inputs_mesh_ex:
+
+.. code-block:: bash
+   :caption: Snippet of intpus for mesh example. This is not a complete input file.
+
+   # Define periodicity and domain extents
+   # -------------------------------------------------------------
+   geometry.coord_sys   =  0           # Cartesian coordinates
+   geometry.is_periodic =  0   0   0   # periodicity for each direction
+   geometry.prob_lo     =  0.  0.  0   # lo corner of physical domain.
+   geometry.prob_hi     =  4.  1.  1.  # hi corner of physical domain
+
+   # Define the maximum level of refinement and number of cells
+   # -------------------------------------------------------------
+   arm.max_level = 0
+   arm.n_cell = 32  8  8
+
+
+.. _fig_basic_mesh_ex:
+
+.. figure:: ./images/geometry/mesh_lev0_ex.png
+   :width: 100%
+   :align: center
+   :alt: domain used with box embedded boundary
+
+   Example of a single-level mesh.
+
+.. warning::
+
+   MFIX-Exa simulations with a non-uniform mesh will not run.
+
+
+-----
+
+
+.. rubric:: Mesh refinement
+
+
+The following inputs are defined using the ``arm`` prefix. These inputs control the automatic
+mesh refinement algorithm and are only applicable when ``amr.max_level > 0``.
+
++----------------------+-----------------------------------------------------------------------+-------------+-----------+
+|                      | Description                                                           |   Type      | Default   |
++======================+=======================================================================+=============+===========+
 | grid_eff             | Threshold value to ensure grids do not contain too large a fraction   |    Real     |  0.7      |
 |                      | of un-tagged cells.                                                   |             |           |
-|                      | Applicable only when mesh refinement is enabled ("max_level" > 0).    |             |           |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| n_error_buf          | Controls the number of tagged cells before grids are defined. Used    |    Int      |  1        |
+| n_error_buf          | Controls the number of tagged cells before grids are defined. Used    |    int      |  1        |
 |                      | to ensure coarse/fine boundaries are not too close to tagged cells.   |             |           |
-|                      | Applicable only when mesh refinement is enabled ("max_level" > 0).    |             |           |
-+----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| n_cell               | Number of cells at level 0 in each coordinate direction               |    Ints     | 0 0 0     |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
 
+.. caution::
+
+   **Mesh refinement limitations:**
+
+   * MFIX-Exa simulations with particles currently do not support mesh refinement.
+
+See the `AMReX documentation <https://amrex-codes.github.io/amrex/docs_html/index.html>`_
+for details on the adaptive mesh refinement algorithms.
+
+-----
+
 Grids
-^^^^^
+-----
 
-The following inputs must be preceded by "amr." and determine how we create the grids.
+The following inputs are defined using the ``arm`` prefix.
 
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
 |                      | Description                                                           |   Type      | Default   |
 +======================+=======================================================================+=============+===========+
-| max_grid_size_x      | Maximum number of cells at level 0 in each grid in x-direction        |    Int      | 32        |
+| max_grid_size_x      | Maximum number of cells at level 0 in each grid in *X*                |    int      | 32        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| max_grid_size_y      | Maximum number of cells at level 0 in each grid in y-direction        |    Int      | 32        |
+| max_grid_size_y      | Maximum number of cells at level 0 in each grid in *Y*                |    int      | 32        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| max_grid_size_z      | Maximum number of cells at level 0 in each grid in z-direction        |    Int      | 32        |
+| max_grid_size_z      | Maximum number of cells at level 0 in each grid in *Z*                |    int      | 32        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| blocking_factor_x    | Each grid must be divisible by blocking_factor_x in x-direction       |    Int      |  8        |
+| blocking_factor_x    | Each grid in *X* must be divisible by ``blocking_factor_x``           |    int      |  8        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| blocking_factor_y    | Each grid must be divisible by blocking_factor_y in y-direction       |    Int      |  8        |
+| blocking_factor_y    | Each grid in *Y* must be divisible by ``blocking_factor_y``           |    int      |  8        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
-| blocking_factor_z    | Each grid must be divisible by blocking_factor_z in z-direction       |    Int      |  8        |
+| blocking_factor_z    | Each grid in *Z* must be divisible by ``blocking_factor_z``           |    int      |  8        |
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
 
+The domain is decomposed into *grids* by dividing the number of cells by the max grid size
+for each direction (e.g., ``n_cells[0]/max_grid_size_x``). The blocking factor ensures that
+the grids will be sufficiently coarsenable for good multigrid performance; therefore, the
+``max_grid_size`` must be divisible by the corresponding ``blocking_factor``.
 
-The following inputs must be preceded by "particles."
+.. note::
+
+   The `AMReX documentation <https://amrex-codes.github.io/amrex/docs_html/index.html>`_ contains a significant
+   amount of information on grid creation and load balancing. Users are **strongly** encouraged to
+   read the relevant sections.
+
+
+-----
 
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-|                      | Description                                                           |   Type      | Default      |
-+======================+=======================================================================+=============+==============+
-| max_grid_size_x      | Maximum number of cells at level 0 in each grid in x-direction        |    Int      | 32           |
-|                      | for grids in the ParticleBoxArray if dual_grid is true                |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| max_grid_size_y      | Maximum number of cells at level 0 in each grid in y-direction        |    Int      | 32           |
-|                      | for grids in the ParticleBoxArray if dual_grid is true                |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| max_grid_size_z      | Maximum number of cells at level 0 in each grid in z-direction        |    Int      | 32           |
-|                      | for grids in the ParticleBoxArray if dual_grid is true.               |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
 
 Tiles
-^^^^^
+-----
 
-The following inputs must be preceded by "fabarray_mfiter." and determine how we create the logical tiles:
+The following inputs are defined using the ``fabarray`` prefix.
 
 +----------------------+-----------------------------------------------------------------------+----------+-------------+
 |                      | Description                                                           | Type     | Default     |
 +======================+=======================================================================+==========+=============+
-| tile_size            | Maximum number of cells in each direction for (logical) tiles         | IntVect  | 1024000     |
-|                      |        (3D CPU-only)                                                  |          | 1024000,8,8 |
+| mfiter_tile_size     | Maximum number of cells in each direction for (logical) tiles         | IntVect  | 1024000,8,8 |
 +----------------------+-----------------------------------------------------------------------+----------+-------------+
 
-The following inputs must be preceded by "particles."
+The following inputs are defined using the ``particles`` prefix.
 
 +----------------------+-----------------------------------------------------------------------+-------------+--------------+
 |                      | Description                                                           |   Type      | Default      |
@@ -87,69 +156,9 @@ The following inputs must be preceded by "particles."
 |                      | in the ParticleBoxArray if dual_grid is true.                         |             |              |
 +----------------------+-----------------------------------------------------------------------+-------------+--------------+
 
+When running on shared memory machines using an OpenMP enabled executable, *grids* are subdivided into
+*tiles* and iterated over to improve data locality by cache blocking.
 
-Load balancing
---------------
-
-Note that when running a granular simulation, i.e., no fluid phase, :cpp:`mfix.dual_grid` must be 0. Hence,
-the :cpp:`particles.max_grid_size` (in each direction) have no meaning. Therefore the fluid grid and tile
-sizes should be set for particle load balancing. It may also be necessary to set the blocking factors to 1.
-
-
-The following inputs must be preceded by "mfix." and determine how we load balance:
-
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-|                      | Description                                                           |   Type      | Default      |
-+======================+=======================================================================+=============+==============+
-| dual_grid            | If true then use the "dual_grid" approach for load balancing          |  Bool       | False        |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| regrid_int           | How often to regrid (in number of steps at level 0)                   |   Int       |    -1        |
-|                      | if regrid_int = -1 then no regridding will occur                      |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| load_balance_fluid   | Only relevant if (dual_grid); if so do we also regrid mesh data       |  Int        | 0            |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| load_balance_type    | What strategy to use for load balancing                               |  String     | KnapSack     |
-|                      | Options are "KnapSack", "SFC", or "Greedy"                            |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| knapsack_weight_type | What weighting function to use if using Knapsack load balancing       |  String     | RunTimeCosts |
-|                      | Options are "RunTimeCosts" or "NumParticles""                         |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| knapsack_nmax        | Maximum number of grids per MPI process if using knapsack algorithm   |  Int        | 128          |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| greedy_dir           | The direction in which the greedy algorithm cuts overloaded boxes     |  Int        | 0            |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| greedy_min_grid_size | The minimum particle grid size in the greedy load balance algorithm   |  Int        | 2            |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| greedy_3d            | Partition particle grids in 3D with the greedy algorithhm             |  Bool       | False        |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| overload_tolerance*  | The ratio between the maximum workload and the average workload       |  Real       | 1.2          |
-|                      | in the greedy algorithm                                               |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| underload_tolerance* | The ratio between the minimum workload and the average workload       |  Real       | 0.8          |
-|                      | in the greedy algorithm                                               |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-| grid_pruning         | Remove all covered grids from the base mesh; this may result in       |  Bool       | False        |
-|                      | disjoined grids                                                       |             |              |
-+----------------------+-----------------------------------------------------------------------+-------------+--------------+
-
-\* The greedy partitioning algorithm uses the tolerances to set the expected
-workload range for each rank but doesn't strictly enforce them. The algorithm
-creates a partition as balance as possible even if the partition doesn't
-meet the tolerances.
-
-To allow a user to verify the breakdown of fluid grids created before running a full simulation, an input option,
-:cpp:`mfix.only_print_grid_report` is supported. By default, it is :cpp:`False`. When set to :cpp:`True`, the run uses
-minimal memory to print the grid coverage report and exits immediately after that.
-
-
-.. todo:: Is this needed anymore?
-
-   The following inputs must be preceded by "particles."
-
-   +----------------------+-----------------------------------------------------------------------+-------------+--------------+
-   |                      | Description                                                           |   Type      | Default      |
-   +======================+=======================================================================+=============+==============+
-   | reduceGhostParticles | Remove unused ghost particles from communication,                     |    Bool     | True         |
-   |                      | only supported by GPU build                                           |             |              |
-   +----------------------+-----------------------------------------------------------------------+-------------+--------------+
+.. note::
 
+   MFIX-Exa disables tiling when using GPU accelerators.
-- 
GitLab


From 3a7b4bfc6e2d94a50e927ad65d9513dfff8ddd04 Mon Sep 17 00:00:00 2001
From: Jordan Musser <jordan.musser@netl.doe.gov>
Date: Mon, 29 Jan 2024 10:10:16 -0500
Subject: [PATCH 2/2] Add wdf corrections

---
 .../source_docs/user_guide/inputs/advanced.rst | 18 +++++++++---------
 .../user_guide/inputs/mesh_and_gridding.rst    |  6 +++---
 2 files changed, 12 insertions(+), 12 deletions(-)

diff --git a/docs/source_docs/user_guide/inputs/advanced.rst b/docs/source_docs/user_guide/inputs/advanced.rst
index ea67991..b650fdc 100644
--- a/docs/source_docs/user_guide/inputs/advanced.rst
+++ b/docs/source_docs/user_guide/inputs/advanced.rst
@@ -37,15 +37,15 @@ GPU memory
 
 The following inputs must be preceded by "amrex."
 
-+------------------------+-----------------------------------------------------------------------+-------------+--------------+
-|                        | Description                                                           |   Type      | Default      |
-+========================+=======================================================================+=============+==============+
-| the _arena_is_managed  |  Abort if an invalid floating point exception is encountered.         |  Int        | 0            |
-+------------------------+-----------------------------------------------------------------------+-------------+--------------+
-| abort_on_out_of_gpu_memory | rt if a division by zero is computed.                             |  Int        | 0            |
-+------------------------+-----------------------------------------------------------------------+-------------+--------------+
-| the_area_init_size     |  Abort if and overflow is detected.                                   |  Int        | 0            |
-+------------------------+-----------------------------------------------------------------------+-------------+--------------+
++----------------------------+-----------------------------------------------------------------------+-------------+--------------+
+|                            | Description                                                           |   Type      | Default      |
++============================+=======================================================================+=============+==============+
+| the_arena_is_managed       | Abort if an invalid floating point exception is encountered.          |  Int        | 0            |
++----------------------------+-----------------------------------------------------------------------+-------------+--------------+
+| the_arena_init_size        | Abort if and overflow is detected.                                    |  Int        | 0            |
++----------------------------+-----------------------------------------------------------------------+-------------+--------------+
+| abort_on_out_of_gpu_memory | Abort if a division by zero is computed.                              |  Int        | 0            |
++----------------------------+-----------------------------------------------------------------------+-------------+--------------+
 
 
 Load balancing
diff --git a/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst b/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
index 62805ad..82fbc73 100644
--- a/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
+++ b/docs/source_docs/user_guide/inputs/mesh_and_gridding.rst
@@ -8,7 +8,7 @@ Mesh
 
 .. rubric:: Level-0 mesh
 
-The following inputs are defined using the ``arm`` prefix.
+The following inputs are defined using the ``amr`` prefix.
 
 .. _InputsTable_mesh:
 
@@ -76,7 +76,7 @@ The result is a uniform mesh spacing of :math:`0.125` *m* in all three direction
 .. rubric:: Mesh refinement
 
 
-The following inputs are defined using the ``arm`` prefix. These inputs control the automatic
+The following inputs are defined using the ``amr`` prefix. These inputs control the automatic
 mesh refinement algorithm and are only applicable when ``amr.max_level > 0``.
 
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
@@ -103,7 +103,7 @@ for details on the adaptive mesh refinement algorithms.
 Grids
 -----
 
-The following inputs are defined using the ``arm`` prefix.
+The following inputs are defined using the ``amr`` prefix.
 
 +----------------------+-----------------------------------------------------------------------+-------------+-----------+
 |                      | Description                                                           |   Type      | Default   |
-- 
GitLab