BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210916T132456Z
LOCATION:
DTSTART;TZID=Europe/Stockholm:20210706T173000
DTEND;TZID=Europe/Stockholm:20210706T190000
UID:submissions.pasc-conference.org_PASC21_sess182_post151@linklings.com
SUMMARY:P25 - Implementation of the Performance-Portable ICON Model
DESCRIPTION:Poster\n\nP25 - Implementation of the Performance-Portable ICO
 N Model\n\nSawyer, Lapillonne, Alexeev, Dietlicher, Kornblueh...\n\nThe IC
 ON modeling framework is a unified numerical weather and climate model use
 d for operational numerical weather prediction as well as low- and high-re
 solution climate projection. It utilizes the Message-Passing Interface (MP
 I) for domain decomposition and has been extensively optimized for OpenMP 
 multi-threading. The model was recently adapted for GPU systems in order t
 o run high-resolution multi-year climate simulations at 2.8 km on the CSCS
  Piz Daint heterogeneous supercomputer. Because of the low compute intensi
 ty of atmospheric model and the cost of data transfer between CPU and GPU,
  it is necessary that all components within the time integration be ported
  to the accelerator. For the dynamics, most physical parameterizations and
  infrastructure code OpenACC compiler directives are used. For the soil pa
 rameterization, a Fortran-based domain-specific language "CLAW-DSL" is uti
 lized. We discuss the challenges associated with porting this large commun
 ity code, as well as the run-time and I/O challenges of simulations on lar
 ge-scale systems at 2.8 km horizontal resolution. We also present many of 
 the optimizations implemented for GPUs and high-resolution simulations, su
 ch as asynchronous I/O, GPU-to-GPU communication, asynchronous execution o
 f kernels, data compression on GPUs, as well as performance comparisons of
  the full model on CPU and GPU.
END:VEVENT
END:VCALENDAR
