How to Design FDO Profiles and Kernel Attributes?

An Investigation Along Processing, Grammars and Automata

Authors

DOI:

https://doi.org/10.52825/ocp.v5i.1392

Keywords:

FAIR Digital Objects, Types, JSON, Context Free Grammars, Automata

Abstract

A major goal for FDOs is to enable machine readability, interpretability and actionability for data and metadata of digital objects. This paper examines, how this can be achieved at the level of processing, grammars, derived languages and push-down automata, and describes formal requirements for the definition of FDO records.

After a description of FDOs and what requirements are necessary that they can be processed by machines the importance of types for the processing of bitstreams as well as key-value-pairs in attributes is highlighted and the different options to represent values in attributes of FDOs are investigated.

Machine actionability describes the knowledge of machines about how to process the object. This requires to process the grammars behind the languages that can be used for attribute definitions to determine the type of the value and the grammars that generate these languages. A way out of a possibly unlimited need for different machines able to read different languages is presented on the level of grammars.

Even if these findings are on an abstract level, the outcome has direct consequences for the way how FDO records, data types or profiles have to be defined, which standardization agreements are needed and thus how attributes, types or profiles have to be implemented. It can be seen as a theoretical guideline for standarization and for implementation.

Downloads

Download data is not yet available.

References

I. Anders et al., “Fdo forum fdo requirement specifications,” version 3.0, Zenodo, Mar. 2023. DOI: 10.5281/zenodo.7782262. [Online]. Available: https://doi.org/10.5281/zenodo.7782262.

D. L. Parnas, J. E. Shore, and D. Weiss, “Abstract types defined as classes of variables,” Proceedings of the 1976 conference on Data : Abstraction, definition and structure, pp. 149–154, 1976. DOI: 10.1145/800237.807133.

C. Blanchi, M. Hellström, L. Lannom, A. Pfeil, U. Schwardmann, and P. Wittenburg, “Implementation of Attributes, Types, Profiles and Registries,” Apr. 2024. DOI: 10.5281/zenodo.7825572. [Online]. Available: https://doi.org/10.5281/zenodo.7825572.

“Fdo forum tsig working group: Machine rules for accessing fdos,” Accessed: Jun. 19, 2024. [Online]. Available: https://docs.google.com/document/d/1oR1_YZSL6OJbZh4ozoDowD8x3G23MshRX5q8hLnFiQM/edit.

Parquet. “Overview,” Accessed: Jun. 19, 2024. [Online]. Available: https://parquet.apache.org/docs/overview/.

Avro. “Documentation,” Accessed: Jun. 19, 2024. [Online]. Available: https://avro.apache.org/docs/.

C. Weiland, S. Islam, D. Broeder, I. Anders, and P. Wittenburg, “Fdo machine actionability,” version 2.2, Zenodo, Apr. 2023. DOI: 10.5281/zenodo.7825650. [Online]. Available: https://doi.org/10.5281/zenodo.7825650.

J. E. Hopcroft and J. D. Ullman, Introduction to Automata Theory, 1st. 1979, 3 [HMU06], ISBN: 020102988X.

L. Floridi, The Fourth Revolution - How the Infosphere is Reshaping Human Reality. OxfordUniversity Press, 2014, p. 266, ISBN: 9780199606726.

S. Greibach, “A new normal-form theorem for context-free phrase structure grammars,”Journal of the ACM. 12 (1), pp. 42–52, 1965. DOI: 10.1145/321250.321254.

CORDRA. “Homepage,” Accessed: Jun. 19, 2024. [Online]. Available: https://cordra.org.

L. Lannom, D. Broeder, and G. Manepalli, “Rda data type registries working group output,”Zenodo, Aug. 2018. DOI: 10.15497/A5BCD108-ECC4-41BE-91A7-20112FF77458. [Online].Available: https://doi.org/10.15497/A5BCD108-ECC4-41BE-91A7-20112FF77458.

M.-A. Baazizi, D. Colazzo, G. Ghelli, and C. Sartiani, “Parametric schema inference formassive json datasets,” The VLDB Journal 28, pp. 497–521, 2019. DOI: 10.1007/s00778-018-0532-7.

U. Schwardmann, “Automated schema extraction for PID information types,” in 2016 IEEE International Conference on Big Data (Big Data), IEEE, Dec. 2016. DOI: 10.1109/bigdata.2016.7840957. [Online]. Available: https://hdl.handle.net/21.11101/0000-0002-a987-7.

“Epic data type registry: Web interface,” Accessed: Jun. 19, 2024. [Online]. Available: https://dtr-pit.pidconsortium.net.

T. Weigel et al., “Rda research data collections wg recommendations,” version 1.0, Zenodo, Dec. 2018. DOI: 10.15497/RDA00022. [Online]. Available: https://doi.org/10.15497/RDA00022.

“Epic data type registry for testing: Web interface,” Accessed: Jun. 19, 2024. [Online]. Available: https://dtr-test.pidconsortium.net.

“Epic type api: Web interface,” Accessed: Jun. 19, 2024. [Online]. Available: https://typeapi.lab.pidconsortium.net/.

Published

2025-03-18

How to Cite

Schwardmann, U. (2025). How to Design FDO Profiles and Kernel Attributes? An Investigation Along Processing, Grammars and Automata. Open Conference Proceedings, 5. https://doi.org/10.52825/ocp.v5i.1392
Received 2024-06-21
Accepted 2025-02-15
Published 2025-03-18