On Generating SHACL Shapes from Collective Collection of Plant Trait Data

Penulis: Saleh, Dadan Ridwan; Kartika, Yulia Aris; Akbar, Zaenal; Krisnadhi, Adila Alfa; Fatriasari, Widya
Informasi
JurnalACM International Conference Proceeding Series
PenerbitAssociation for Computing Machinery
Halaman326 - 330
Tahun Publikasi2022
ISBN978-145039791-9
Jenis SumberScopus
Abstrak
Collective data collection has become common in various domains, including biodiversity science. Multiple individuals work on the same biological samples or specimens using various scientific tools to measure different characteristics. Moreover, the measurements are typically regulated by different data collection procedures and protocols. Integrating and guaranteeing the quality of the data has become a significant issue. One solution is to adopt the RDF (Resource Description Framework) data model in combination with a language for validating RDF graphs such as SHACL (Shapes Constraint Language). The RDF data model provides flexibility in accommodating multiple data schemas, while SHACL uses a set of conditions so called shapes, to validate the RDF data graphs. The remaining challenge is an effective method to define SHACL shapes that can be used to validate any given RDF data. This work introduces a semi-Automatic database-driven solution to generate SHACL shapes. The solution relies on the database's internal structure and data items' values. The solution was applied to a traits database from natural fiber plants in Indonesia, where a high number of individual shapes were successfully generated. Furthermore, a qualitative evaluation indicated the appropriate quality of the shapes. This work contributes to increasing the quality of biodiversity data collections, which has become an essential factor in Big Biodiversity Data processing. © 2022 ACM.
Dokumen & Tautan

© 2025 Universitas Indonesia. Seluruh hak cipta dilindungi.