Projects per year
Abstract
Environmental timeseries data variety is exploding in the Internet of Things era, making data reuse a very demanding task. Data acquisition and integration remains a laborious step of the environmental data lifecycle. Environmental data heterogeneity is a persistent issue, as data are becoming available through different protocols and stored under diverse, custom formats. In this work, we deal with syntactic heterogeneity in environmental timeseries data. Our approach is based on describing different dataset syntaxes using abstract representations, called templates. We designed and implemented EDAM (Environmental Data Acquisition Module), a template framework that facilitates timeseries data acquisition and integration. EDAM templates are written using programming language-agnostic semantics, and can be reused both for input and output, thus enabling data reuse via transformations across different formats. We demonstrate EDAM generality in seven case studies, which involve scraping online data, extracting observations from a relational database, or aggregating historical timeseries stored in local files. Case studies span different environmental sciences domains, including meteorology, agriculture, urban air quality and hydrology. We also demonstrate EDAM for data dissemination, as instructed by output templates. We identified several syntactic interoperability challenges though the case studies, that include managing with differences in formatting observables, temporal and spatial references, and metadata documentation, and addressed them with EDAM. EDAM implementation has been released under an open-source license.
Original language | English |
---|---|
Pages (from-to) | 237-249 |
Journal | Environmental Modelling and Software |
Volume | 117 |
DOIs | |
Publication status | Published - 1 Jul 2019 |
Keywords
- Big data
- Data acquisition
- Environmental timeseries
- Internet of things
- Syntactic interoperability
- Templates
Fingerprint
Dive into the research topics of 'A template framework for environmental timeseries data acquisition'. Together they form a unique fingerprint.Projects
- 1 Finished
-
Environmental data and model interoperability in the internet of things
Samourkasidis, A. (PhD candidate), Tekinerdogan, B. (Promotor) & Athanasiadis, I. (Co-promotor)
19/01/16 → 14/10/19
Project: PhD