Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Aurora config #2593

Open
wants to merge 638 commits into
base: master
Choose a base branch
from
Open

Aurora config #2593

wants to merge 638 commits into from

Conversation

oksanaguba
Copy link
Contributor

@oksanaguba oksanaguba commented Nov 6, 2023

this branch will be eventually reworked into a branch with clean history

@oksanaguba oksanaguba changed the title WIP WIP spot config Nov 6, 2023
@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Nov 7, 2023

Status Flag 'Pre-Test Inspection' - Auto Inspected - Inspection is Not Necessary for this Pull Request.

@E3SM-Bot
Copy link
Collaborator

E3SM-Bot commented Nov 7, 2023

Status Flag 'Pull Request AutoTester' - Failure: Timed out waiting for job SCREAM_PullRequest_Autotester_Mappy to start: Total Wait = 1803

  • Other jobs have been previously started - We must stop them...

@oksanaguba oksanaguba changed the title WIP spot config [WIP] spot config Jun 17, 2024
@oksanaguba
Copy link
Contributor Author

oksanaguba commented Jul 16, 2024

ERS test passing

~/ess/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr 
onguba@uan-0001:~/runhomme/for-cprnc/utils/cime/CIME/non_py/cprnc> pwd
/home/onguba/runhomme/for-cprnc/utils/cime/CIME/non_py/cprnc
onguba@uan-0001:~/runhomme/for-cprnc/utils/cime/CIME/non_py/cprnc> ./cprnc /home/onguba/ess/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr/run/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr.cpl.hi.0001-01-01-32400.nc.base /home/onguba/ess/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr/run/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr.cpl.hi.0001-01-01-32400.nc.rest
 source ~/ess/ERS_Ln9.ne4pg2_ne4pg2.F2010-SCREAMv1.sunspot-pvc_oneapi-ifxgpu.scream-sunspot_run.C.20240715_234719_xed0zr/.env_mach_specific.sh 

@oksanaguba oksanaguba changed the title [WIP] spot config Aurora config Sep 6, 2024
Copy link
Member

@rljacob rljacob left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

You should use/modify the aurora/sunspot configs added in E3SM-Project/E3SM#6553. This creates too many machine definitions and doesn't follow the conventions we've established for machines with cpu/gpu nodes.

@oksanaguba
Copy link
Contributor Author

@rljacob yes, though it may be a subsequent PR after this one goes in (but i will rename machines using scream suffix).

@rljacob
Copy link
Member

rljacob commented Sep 17, 2024

This PR can not go in as is. And adding "scream" is also not a solution.

@oksanaguba
Copy link
Contributor Author

i cannot yet clean this branch till homme sycl changes are in scream repo.

@bartgol bartgol removed the AT: WIP label Nov 1, 2024
@bartgol bartgol force-pushed the master branch 2 times, most recently from 7114e81 to 579850c Compare November 8, 2024 23:44
@bartgol bartgol force-pushed the master branch 3 times, most recently from 6318f41 to db5b35d Compare November 13, 2024 21:48
grnydawn and others added 13 commits November 25, 2024 14:06
* Changed from Core/24.07 to Core/24.00
* Changed from cmake/3.27.9 to cmake/3.21.3
* Restored '--allow-shlib-undefined --allow-multiple-definition' in CMAKE_EXE_LINKER_FLAGS
* Removed 'craype-accel-amd-gfx90a' module load for amdclanggpu_frontier
* Added two mosart source files into NOOPT_FILES to workaround to prevent optcg crayclanggpu build error

[BFB] No baseline for Frontier yet.
Remove a removed kokkos command line flag not applicable anymore
Addresses the print statements from each rank

[bfb]
	modified:   tech-guide/orodrag.md
	modified:   user-guide/namelist_parameters.md
[BFB]
mahf708 and others added 30 commits January 11, 2025 15:45
The issue was simply that we were grabbing a ref to an element in a vector that was later resized, invalidating the ref.
The class is in charge of reading in time-dependent datasets from input file(s), interpolate to current model time, and possibly do horizontal/vertical interpolation
Adds a vertical contraction utility equivalent to einsum('k,...k->...', weight, field) or einsum('ik,i...k->i...', weight, field)
The relaxed slab ocean (RSO) is implemented as a simpler alternative to the traditional slab ocean
mode that requires externally generated heat flux data that represents the effects of ocean transport
on ocean surface temperature. In RSO mode we want to relax to a target SST field using only a
specified mixed layer depth to allow things like cold wakes following a tropical cyclone.

Also add documentation for this mode of the data ocean model.

[BFB]
This PR brings in a new feature that (1) increases accuracy of semi-Lagrangian
tracer transport's trajectory calculations and (2) permits flexible trade-off
between the trajectory accuracy and speed. This PR has the following parts:

- F90 dycore support, with unit tests (principally in sl_advection.F90);
- C++ dycore support, with unit tests (principally in
  ComposeTransportImplEnhancedTrajectory.cpp);
- unit test driver updates: compose_ut.cpp;
- two new standalone-Homme tests, one each for the F90 and C++ dycores;
- new standalone-Homme transport test module for convergence testing: fully 3D,
  space-and-time-dependent surface pressure (dcmip2012_test1_conv_mod.F90);
- CIME-based ERS tests, one each, for EAM and EAMxx;
- cleanup of a timer issue orthogonal to this PR: see commit 'Hommexx: Rework
  skipping timers in first step.';
- updates to Homme machine files for Perlmutter;
- fix C++ dycore's handling of prescribed winds: had to move down in the call
  stack to match the F90 dycore.

e3sm_developer and e3sm_atm_integration pass on Chrysalis. EAMxx test suites
pass on Perlmutter and Frontier. There are no performance effects when the
stealth feature is off based on tests on Chrysalis (v3.LR 11-year control run),
Frontier, and Perlmutter.

[non-BFB] due to two new CIME tests, two new standalone-Homme tests, and two
modified standalone-Homme tests; otherwise BFB.
This brings in changes to mam4xx's wet_dep.hpp that correct the size of a work array
to fix out-of-bounds errors that have led to recent fails for the mam4_wetscav_standalone test.
Fixes a compilation error that is most likely just a compiler parsing issue. Still, the change is innocuous.
Set testing max ranks to 2 on ghci-snl-cuda
Do constant extrap rather than linear ramp down to 0 at p=0
This PR changes the test suite names and unifies
the testmods under eamxx.

[BFB]
Do constant extrapolation rather than linear ramp down to 0 (for p=0).

[non-BFB] for EAMxx
This PR changes the test suite names and unifies the testmods under
eamxx.

We'll have to remember to update E3SM_test_scripts once this is
in. The dashboard will also need adjustments.

This PR also adds a couple eamxx v1 tests to e3sm_developer including a prod test.

[BFB]

* jgfouca/begin_to_descream:
  Fix cime-nml test that was using old testmod dir
  eammx-prod only works with pg2 grids
  Begin to de-scream things (use eamxx instead)
Add MALI ocean extrapolation and fix to RK scheme

This PR updates the version of MALI from the MALI-Dev fork and primarily
brings in two important features needed for Greenland coupling work:
* ability to extrapolate ocean thermal forcing from a masked region of
  valid ocean data to the current MALI ice extent
* a critical bug fix for the RK timestepping scheme required for it to
  work correctly
* changes to handling of masks in the RK and FCT schemes to keep them
  consistent
Also included are some changes to the subglacial hydrology model, which
is not currently used in E3SM simulations.

[NML] for configurations with MALI
[non-BFB] for configurations with MALI
 Conflicts:
	cime_config/machines/cmake_macros/oneapi-ifxgpu_aurora.cmake
	components/eam/src/physics/rrtmgp/external
	components/eamxx/src/physics/rrtmgp/scream_rrtmgp_interface.hpp
	components/homme/cmake/HommeMacros.cmake
	components/homme/src/preqx_kokkos/cxx/cxx_f90_interface_preqx.cpp
	components/homme/src/share/cxx/ExecSpaceDefs.cpp
	components/homme/src/share/cxx/SimulationParams.hpp
	components/homme/src/theta-l_kokkos/cxx/cxx_f90_interface_theta.cpp
	components/homme/src/theta-l_kokkos/prim_driver_mod.F90
	externals/ekat
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.