Course unit

Interoperability of data and semantics

Last updated: 18/08/2022


Course Director(s):


General Description:

This course is given in English as it corresponds to a Teaching Unit of the International MSc on Cyber Physical and Social Systems (CPS2): AI and IoT

This lecture aims at understanding the data interoperability issues that arise from the integration of existing application, or the evolution of systems. The students will understand and use the main data formats, query languages, and schema definition languages. They will understand the importance of controlled vocabularies and ontologies, and the importance of relaxing data schemas to enable the injection of additional information in documents. The course combines theory and practice on the following topics:

  • Generalities: data engineering and the data value chain, ISO/IEC definitions of interoperability, standards and standard-based interoperability
  • Encoding base data types: character encoding,  Computer number formats, Colors, Endianness,  Base32 and Base64 encoding, 
  • Data formats: Generalities, file types, Delimiter separated values, Extensible Markup Language, JavaScript Object Notation, YAML Ain't Markup Language,  Lightweight markup languages,  Compressed formats, Multimedia formats
  • Data schemas and semantics: XML Schema, JSON Schema, Heterogeneities and data conflicts, Controlled vocabularies and ontologies, Resource Description Framework, RDFa: Rich structured data markup for web documents, JSON-LD: JSON for Linking Data, Data integration and mediation

Key words:

data format semantic interoperability

Number of teaching hours


Fields of study

Computer Science, Information Systems

Teaching language


Intended learning outcomes

On completion of the unit, the student will be capable of: Classification level Priority
Know the main data formats 1. Knowledge Essential
Understand and use the different datatypes 3. Apply Important
Know how to use the data query languages for the main data formats 3. Apply Important
Know how to use the data schema description language 3. Apply Important
Understand the importance of using standardized vocabularies 5. Summarise Useful

Learning assessment methods

Percentage ratio of individual assessment Percentage ratio of group assessment
Written exam: 50 % Project submission: %
Individual oral exam: % Group presentation: %
Individual presentation: % Group practical exercise: %
Individual practical exercise: % Group report: %
Individual report: 50 %
Other(s): %

Programme and content

Type of teaching activity Content, sequencing and organisation