BEGIN:VCALENDAR VERSION:2.0 PRODID:Linklings LLC BEGIN:VTIMEZONE TZID:Europe/Stockholm X-LIC-LOCATION:Europe/Stockholm BEGIN:DAYLIGHT TZOFFSETFROM:+0100 TZOFFSETTO:+0200 TZNAME:CEST DTSTART:19700308T020000 RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU END:DAYLIGHT BEGIN:STANDARD TZOFFSETFROM:+0200 TZOFFSETTO:+0100 TZNAME:CET DTSTART:19701101T020000 RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU END:STANDARD END:VTIMEZONE BEGIN:VEVENT DTSTAMP:20210916T132456Z LOCATION: DTSTART;TZID=Europe/Stockholm:20210706T173000 DTEND;TZID=Europe/Stockholm:20210706T190000 UID:submissions.pasc-conference.org_PASC21_sess182_post131@linklings.com SUMMARY:P16 - MARS: Mesh Adaptive Refinement for Supercomputing DESCRIPTION:Poster\n\nP16 - MARS: Mesh Adaptive Refinement for Supercomput ing\n\nGanellari, Zulian, Fink, Fadel, Cumming...\n\nMARS is an open-sourc e mesh management library designed to handle N-dimensional elements (N < ;= 4). MARS is developed in C++ and makes use of template meta-progra mming to have compile time dimensions of elements and vectors, thus allowi ng for both compile time performance optimizations and concise and reusabl e code.

The main features of MARS consist of:

1.mesh generation
2.adaptive mesh refinement using bisection algorithms
3.conforming mesh data-structure
4.mesh quality estimators to study the output of different mesh-refinement strategies
5.performance por table algorithms and data-structures targetting different accelerators.

MARS targets multi-core CPUs and GPUs using the C++ Kokkos progra mming model. The mesh is entirely constructed and stored on the devic e (GPUs). This enables libraries using MARS to perform further o perations directly on the device, avoiding going through the host. Current ly, MARS supports as its performance portable, parallel, adaptive refineme nt based algorithm the LEPP (Longest edge propagation path) from Rivara. M esh generation is fully supported in parallel. As an example, 143 million Hex8 elements can be generated on a single node in just 0.86 sec.

A distributed memory, parallel implementation based on MPI is ongoing w ork, and forest of octrees and space filling curves algorithms for efficie nt mesh partitioning are being planned. END:VEVENT END:VCALENDAR