Skip to content

Data Formats in DSP-API

As explained in What Is DSP and DSP-API (previous Knora)?, the DSP stores data in a small number of formats that are suitable for long-term preservation while facilitating data reuse.

The following is a non-exhaustive list of data formats and how their content can be stored and managed by DSP-API:

Original Format Format in DSP
Text (XML, LaTeX, Microsoft Word, etc.) Knora resources (RDF) containing Standoff/RDF
Tabular data, including relational databases Knora resources
Data in tree or graph structures Knora resources
Images (JPEG, PNG, etc.) JPEG 2000 files stored by Sipi
Audio and video files Audio and video files stored by Sipi (in archival formats to be determined)
PDF Can be stored by Sipi, but data reuse is improved by extracting the text for storage as Standoff/RDF

Last update: 2021-06-09