BEGIN:VCALENDAR
VERSION:2.0
PRODID:Linklings LLC
BEGIN:VTIMEZONE
TZID:Europe/Stockholm
X-LIC-LOCATION:Europe/Stockholm
BEGIN:DAYLIGHT
TZOFFSETFROM:+0100
TZOFFSETTO:+0200
TZNAME:CEST
DTSTART:19700308T020000
RRULE:FREQ=YEARLY;BYMONTH=3;BYDAY=-1SU
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:+0200
TZOFFSETTO:+0100
TZNAME:CET
DTSTART:19701101T020000
RRULE:FREQ=YEARLY;BYMONTH=10;BYDAY=-1SU
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTAMP:20220812T074334Z
LOCATION:Osaka Room
DTSTART;TZID=Europe/Stockholm:20220627T160000
DTEND;TZID=Europe/Stockholm:20220627T163000
UID:submissions.pasc-conference.org_PASC22_sess137_msa123@linklings.com
SUMMARY:A FAIR Digital Object-Based Data Lake Architecture to Support Vari
 ous User Groups and Scientific Domains
DESCRIPTION:Minisymposium\n\nA FAIR Digital Object-Based Data Lake Archite
 cture to Support Various User Groups and Scientific Domains\n\nNolte\n\nAc
 ross various domains, data lakes are successfully utilized to centrally st
 ore all data of an organization in their raw format. This promises a high 
 reusability of the stored data since a schema is implied on read, which pr
 events an information loss due to ETL (Extract, Transform, Load) processes
 . Despite this schema-on-read approach, some modeling is mandatory to ensu
 re proper data integration, comprehensibility, and quality. These data mod
 els are maintained within a central data catalog which can be queried. To 
 further organize the data in the data lake, different architectures have b
 een proposed, like the most widely known zone architecture where data is a
 ssigned to different zones according to the degree of processing. In this 
 talk, a novel data lake architecture based on FAIR (Findable, Accessible, 
 Interoperable, Reusable) Digital Objects (FDO) with (high-performance) pro
 cessing capabilities is presented. These FDOs abstract away the handling o
 f the underlying mass storage and databases, thereby enforcing a homogeneo
 us state, while offering a flat yet easily comprehensible research data ma
 nagement. The FDOs are connected by a provenance-centered graph. Users can
  define generic workflows, which are reproducible by design, making this d
 ata lake implementation ideally suited for science.\n\nDomain: Computer Sc
 ience and Applied Mathematics
END:VEVENT
END:VCALENDAR
