BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20210916T132447Z
LOCATION:Henry Dunant
DTSTART;TZID=Europe/Stockholm:20210709T110000
DTEND;TZID=Europe/Stockholm:20210709T113000
UID:submissions.pasc-conference.org_PASC21_sess188_pap129@linklings.com
SUMMARY:Progress Towards Accelerating the Unified Model on Hybrid Multi-Co
 re Systems
DESCRIPTION:Paper\n\nProgress Towards Accelerating the Unified Model on Hy
 brid Multi-Core Systems\n\nZhang, Xu, Evans, Norman, Morales-Hernandez...\
 n\nThe cloud microphysics scheme, CASIM, and the radiation scheme, SO
 CRATES, are the two computationally intensive parts within<br />the Met Of
 fice’s Unified Model (UM). This study enables CASIM and SOCRATE
 S to use accelerated multi-core systems for optimal<br />computational per
 formance of the UM. Using profiling to guide our efforts, we refactor
 ed the code for optimal threading and kernel<br />arrangement and implemen
 ted OpenACC directives manually or through the CLAW source-to-source 
 translator. Initial porting results achieved 10.02x and 9.25x speedup
  in CASIM and SOCRATES respectively on 1 GPU compared with 1 CPU core
 . A granular performance analysis of the strategy and bottlenecks are
  discussed. These improvements will enable UM to run on heterogeneous
  computers and a path forward for further improvements is provided.\n
 \nDomain: CS and Math, Climate and Weather
END:VEVENT
END:VCALENDAR
