Semantic Support for Tables using RDF Record Table

M.L.I. Wigham, H. Rijgersberg, Martine de Vos, J.L. Top

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Tabular datasets are common in many domains, for example science and engineering. These are often not very well specified, and are therefore hard to understand and use. Semantic standards are available to express the meaning and context of the data. However, present standards have their limitations in expressing heterogeneous datasets with several types of measurements, missing data, and irregular structures. Such datasets are abundant in everyday life. We propose the RDF (Resource Description Framework) Record Table vocabulary for semantically modelling tabular data, as a supplement to the existing RDF Data Cube standard. RDF Record Table has a nested structure of records that contain self-describing observations, and is able to cope with irregular, missing and unexpected data. This allows it to escape the constraints of RDF Data Cube and to model complex data, such as that occurring in science and engineering. We demonstrate our Excel add-in for transforming data into the Record Table format. We propose a general approach to integrating tabular data in RDF, and confirm this approach by implementing integration support in the add-in and evaluating this in industrial use cases. This semantic support for tables helps researchers and data analysts to get the most out of available quantitative data with a minimum of effort.
Original languageEnglish
Pages (from-to)128-144
Number of pages1
JournalInternational Journal on Advances in Intelligent Systems
Volume8
Issue number1 and 2
Publication statusPublished - 2015

Fingerprint

Data description
Semantics
Data structures

Cite this

@article{a9f81fc148cc4c7f810e6b9c26b2e8e4,
title = "Semantic Support for Tables using RDF Record Table",
abstract = "Tabular datasets are common in many domains, for example science and engineering. These are often not very well specified, and are therefore hard to understand and use. Semantic standards are available to express the meaning and context of the data. However, present standards have their limitations in expressing heterogeneous datasets with several types of measurements, missing data, and irregular structures. Such datasets are abundant in everyday life. We propose the RDF (Resource Description Framework) Record Table vocabulary for semantically modelling tabular data, as a supplement to the existing RDF Data Cube standard. RDF Record Table has a nested structure of records that contain self-describing observations, and is able to cope with irregular, missing and unexpected data. This allows it to escape the constraints of RDF Data Cube and to model complex data, such as that occurring in science and engineering. We demonstrate our Excel add-in for transforming data into the Record Table format. We propose a general approach to integrating tabular data in RDF, and confirm this approach by implementing integration support in the add-in and evaluating this in industrial use cases. This semantic support for tables helps researchers and data analysts to get the most out of available quantitative data with a minimum of effort.",
author = "M.L.I. Wigham and H. Rijgersberg and {de Vos}, Martine and J.L. Top",
year = "2015",
language = "English",
volume = "8",
pages = "128--144",
journal = "International Journal on Advances in Intelligent Systems",
issn = "1942-2679",
publisher = "ThinkMind",
number = "1 and 2",

}

Semantic Support for Tables using RDF Record Table. / Wigham, M.L.I.; Rijgersberg, H.; de Vos, Martine; Top, J.L.

In: International Journal on Advances in Intelligent Systems, Vol. 8, No. 1 and 2, 2015, p. 128-144.

Research output: Contribution to journalArticleAcademicpeer-review

TY - JOUR

T1 - Semantic Support for Tables using RDF Record Table

AU - Wigham, M.L.I.

AU - Rijgersberg, H.

AU - de Vos, Martine

AU - Top, J.L.

PY - 2015

Y1 - 2015

N2 - Tabular datasets are common in many domains, for example science and engineering. These are often not very well specified, and are therefore hard to understand and use. Semantic standards are available to express the meaning and context of the data. However, present standards have their limitations in expressing heterogeneous datasets with several types of measurements, missing data, and irregular structures. Such datasets are abundant in everyday life. We propose the RDF (Resource Description Framework) Record Table vocabulary for semantically modelling tabular data, as a supplement to the existing RDF Data Cube standard. RDF Record Table has a nested structure of records that contain self-describing observations, and is able to cope with irregular, missing and unexpected data. This allows it to escape the constraints of RDF Data Cube and to model complex data, such as that occurring in science and engineering. We demonstrate our Excel add-in for transforming data into the Record Table format. We propose a general approach to integrating tabular data in RDF, and confirm this approach by implementing integration support in the add-in and evaluating this in industrial use cases. This semantic support for tables helps researchers and data analysts to get the most out of available quantitative data with a minimum of effort.

AB - Tabular datasets are common in many domains, for example science and engineering. These are often not very well specified, and are therefore hard to understand and use. Semantic standards are available to express the meaning and context of the data. However, present standards have their limitations in expressing heterogeneous datasets with several types of measurements, missing data, and irregular structures. Such datasets are abundant in everyday life. We propose the RDF (Resource Description Framework) Record Table vocabulary for semantically modelling tabular data, as a supplement to the existing RDF Data Cube standard. RDF Record Table has a nested structure of records that contain self-describing observations, and is able to cope with irregular, missing and unexpected data. This allows it to escape the constraints of RDF Data Cube and to model complex data, such as that occurring in science and engineering. We demonstrate our Excel add-in for transforming data into the Record Table format. We propose a general approach to integrating tabular data in RDF, and confirm this approach by implementing integration support in the add-in and evaluating this in industrial use cases. This semantic support for tables helps researchers and data analysts to get the most out of available quantitative data with a minimum of effort.

M3 - Article

VL - 8

SP - 128

EP - 144

JO - International Journal on Advances in Intelligent Systems

JF - International Journal on Advances in Intelligent Systems

SN - 1942-2679

IS - 1 and 2

ER -