Semantic Technologies for Semantic Container

The vast amount of data being produced everyday requires semantics to provide meanings to the data. The recent advances of Semantic Web technologies provides us with standard data formats (i.e., Resource Description Framework – RDF), vocabularies (e.g., PROV-O for provenance) and tools (e.g., SPARQL for querying data) to add semantics to data and structuring the metadata. This use of semantics would in turn allows advances data analysis and processing to the data and its metadata.

Semantics for Diabetes Data

The MyPCH project aims to develop methods and tools to allow diabetes patients to share their data using Semantic Container. As part of the process, we provide a mean to transform the raw JSON data of diabetes patients’ observation data into RDF. The transformation is conducted in two steps: (i) Ontology definition, and (ii) Raw data transformation

We define a MyPCH ontology for the patient’s observation data based on the HL7-FHIR ontology for patient observation. The ontology is designed to maintain a high-degree of compliance with the original FHIR model while reducing the needs for data duplications required in the original FHIR-RDF model. An excerpt of the adapted ontology is shown in the figure below.

Figure: the FHIR-based ontology for the MyPCH project (click to enlarge)

As the next step, we utilize the RML mapping and CaRML engine to transform the raw JSON data into its RDF. The current version of RML mapping is available in Semantic Container. The resulted RDF representation of patients’ data would allow further analysis of the data using SPARQL queries (see also the tutorial for using these technologies in the semcon/sc-sparql container and the following dataflow with concrete diabetes examples). Furthermore, it would allow inferring additional knowledge based on the underlying knowledge with logical reasoning as well as integration of additional background knowledge (e.g., information about devices used for diabetes monitoring) and external knowledge, e.g., WikiData and DBPedia.

Extending SPECIAL Vocabularies for MyPCH user consent

Semantic Containers utilizes RDF-based vocabularies to represent its metadata. It adopts the standard Provenance Ontology (W3C PROV-O) to represent its provenance information and SPECIAL vocabularies to represent user consent, in addition to a custom RDF vocabularies to represent container’s metadata. We have extended the user consent vocabularies (SPECIAL vocabularies) with a set of additional classes specific for Semantic Container use cases using a specific namespace scp: <>. We utilize the vocabulary to automatically check the compliance between user consent of their data and the possible usage of the data by a data processor.

In the MyPCH project, we extend it further with three additional classes for diabetes patients’ data. The classes are assigned as specific types of SPECIAL health data category (svd:Health). These classes are for now scp:Diabetes (i.e., the generic diabetes patients’ data), scp: DiabetesSensor (i.e., insulin data observation from sensors), and scp:InsulinPump (i.e., insulin data observation provided by insulin pumps).

Personium Collaboration

Personium and OwnYourData are proud to announce a partnership for collaborating in Personal Data Store interoperability.

During the MyData 2019 conference (September 2019) in Helsinki, several PDS (Personal Data Store) innovators had come together and discussed openly regarding the interoperability between different PDSs. After the conference, Christoph Fabianek (OwnYourData) and Salman Farmanfarmaian (Freezr) contributed a first draft of the implementation document: CEPS – Common Endpoint for Personal data Stores.

The goal of the initiative is to allow users of different PDSs (right now: Freezr, Personium, Datafund, Data Vault) to use the same app (i.e., data processing capabilities) and switch between data stores. In a first step a simple app (Tally Zoo: maintain a tally chart for frequent tasks and collect your everyday data) was implemented that can connect natively to the individual PDSs. Subsequently, common interfaces for authorization and common operations like read/write should be harmonized across participating organizations to decouple data store development and app development.

Feel free to contact Dixon or Christoph if you have any questions or want to join the effort. We expect to showcase results in the upcoming MyData 2020 events.

Milestone 3/3 – Summer

Semantic Containers for Data Mobility has reached all project goals by June 2019 and is now finalized with the final report. The three prototypes have developed well and Semantic Containers will continue to be used as infrastructure, especially at ZAMG.

As in the previous posts we summarize here our accomplished goals for this last milestone:

Dates, dates, appointments!

  • SemCon was presented at the PyDays 2019 on May 3rd and we submitted a scientific publication at the Semantics 2019 in Karlsruhe.
  • Our final project presentation took place on 17 June 2019 at the MyData Meetup at the WU Vienna (slides are available here). We were happy about a lively participation and lively discussions on the topic “Data as Propery or Human right”. That’s what MeetUps should be like!

  • Another presentation is planned on September 27, where Christoph Fabianek will give a talk at the MyData 2019 with from Denmark. We will continue to work on the topic of Semantic Containers even beyond the end of the funded project period!

In summary, we can look back on 9 successful project months. SemCon was presented around 10 times at events, we had about 60 internal project meetings, we were able to meet our milestones in time and there are 3 prototypes that show how the exchange with data can work simply and safely: by packaging data with usage rights and provenance information together with the necessary processing logic to enable reliable and easy management.