BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20220812T074334Z
LOCATION:Foyer 2nd Floor
DTSTART;TZID=Europe/Stockholm:20220628T090000
DTEND;TZID=Europe/Stockholm:20220628T110000
UID:submissions.pasc-conference.org_PASC22_sess181_pos137@linklings.com
SUMMARY:P24 - A FAIR Digital Object-Based Data Lake Architecture to Suppor
 t Various User Groups and Scientific Domains
DESCRIPTION:Poster\n\nP24 - A FAIR Digital Object-Based Data Lake Architec
 ture to Support Various User Groups and Scientific Domains\n\nNolte, Kaspr
 zak, Kunkel, Wieder\n\nAcross various domains, data lakes are successfully
  utilized to centrally store all data of an organization in their raw form
 at. Doing this with overarching governance for all the collected data and 
 the developed processes prevents the creation of isolated Data Silos, whic
 h can quickly arise if small research teams operate independently of each 
 other. Having a central Data Lake, however, promises high reusability of t
 he stored data since a schema is implied on reading, which prevents an inf
 ormation loss due to ETL processes. Despite this schema-on-read approach, 
 some modeling is mandatory to ensure proper data integration, comprehensib
 ility, and quality. These data models are maintained within a central data
  catalog which can be queried. To further organize the data in the data la
 ke, different architectures have been proposed, like the most widely known
  zone architecture. Here, data is assigned to different zones according to
  the processing they were subjected to. In this work, we present a novel d
 ata lake architecture based on FAIR Digital Objects (FDO) with (high-perfo
 rmance) processing capabilities. The FAIR Digital Objects are connected by
  a provenance-centered graph. Users can define generic workflows, which ar
 e reproducible by design, making this data lake implementation ideally sui
 ted for science.
END:VEVENT
END:VCALENDAR
