BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20220812T074334Z
LOCATION:Foyer 2nd Floor
DTSTART;TZID=Europe/Stockholm:20220628T090000
DTEND;TZID=Europe/Stockholm:20220628T110000
UID:submissions.pasc-conference.org_PASC22_sess181_pos140@linklings.com
SUMMARY:P51 - Efficient Discrete Cosine and Polynomial Transforms on GPUs 
 using VkFFT
DESCRIPTION:Poster\n\nP51 - Efficient Discrete Cosine and Polynomial Trans
 forms on GPUs using VkFFT\n\nTolmachev, Jackson, Marti, Castiglioni, Ganel
 lari\n\nThis poster will focus on the latest advancements in the field of 
 fast GPU algorithms for various types of discrete transforms. We present a
 n extension to VkFFT - GPU Fast Fourier Transform library for Vulkan, CUDA
 , HIP and OpenCL, that allows calculating Discrete Cosine Transforms of ty
 pes I-IV. They are often used in image processing, data compression and nu
 merous scientific tasks, like calculating various discrete transformations
  on Chebyshev grids. So far, this is the first publicly available optimize
 d GPU implementation of DCTs. We also present our advances in the GPU impl
 ementation of efficient spherical harmonic transforms and radial transform
 s in a spherical geometry. We will present Jones-Worland and Associated Le
 gendre Polynomial Transforms for modern GPU architectures, implemented bas
 ed on the VkFFT runtime kernel optimization model. These new implementatio
 ns will be used to create a GPU-enabled version of the fully spectral CFD 
 framework QuICC in spherical geometry.
END:VEVENT
END:VCALENDAR
