Nov 2014


ISWC14 paper: a hybrid semantic publishing architecture combining XML and RDF

I'm posting here a short summary of the paper I've given at the last International Semantic Web conference in Riva del Garda (ISWC14) together with my colleague Tony Hammond.

The presentation focused on an hybrid data architecture (XML for storage&querying, RDF for modeling&integration) which emerged as the most practical solution during the process of re-engineering of the publishing platform which has occurred within our company (Macmillan S&E) in the last years.

This is the abstract:

This paper presents recent work carried out at Macmillan Science and Education in evolving a traditional XML-based, document- centric enterprise publishing platform into a scalable, thing-centric and RDF-based semantic architecture. Performance and robustness guarantees required by our online products on the one hand, and the need to support legacy architectures on the other, led us to develop a hybrid infrastructure in which the data is modelled throughout in RDF but is replicated and distributed between RDF and XML data stores for efficient retrieval. A recently launched product – dynamic pages for scientific subject terms – is briefly introduced as a result of this semantic publishing architecture.

The paper is available online; slides from the presentation can be found below.

The ISWC industry track was packed with interesting papers so I think it's worth taking a look at the online proceedings. The uptake of tech outside academia is always revealing of the many real-world difficulties involved in making something fit within pre-existing work practices and legacy technologies. This is especially true of larger companies, where investment in older technologies (and in people who know about them) can be considerable, hence upgrades are costly and need to be evaluated more carefully.

This is the sort of background that led me and my colleagues at MacMillan to opt for a hybrid solution that combines the power of an established enterprise MarkLogic installation with more cutting edge data integration approaches based on RDF.

HybrydXMLRDF subject pages were one of the first products built on top of this architecture. And many more will come: we're still heavily involved in this work though, so stay tuned for more stuff in this space.

Soon, we will also be releasing our public ontologies online and making available a new and improved version of the datasets.

Cite this blog post:

Michele Pasin. ISWC14 paper: a hybrid semantic publishing architecture combining XML and RDF. Blog post on Published on Nov. 25, 2014.

Comments via Github:

See also:


paper  Fitting Personal Interpretation with the Semantic Web: lessons learned from Pliny

Digital Humanities Quarterly, Jan 2017. Volume 11 Number 1


paper  Insights into Nature’s Data Publishing Portal

The Semantic Puzzle (online interview), Apr 2016.


paper  Fitting Personal Interpretations with the Semantic Web

Digital Humanities 2013, University of Nebraska–Lincoln, Jul 2013.


paper  Semantic Web Approaches in Digital History: an Introduction

Lecture slides from the Course on digital history, part of the master in Digital Humanities at King's College, London., Oct 2011.


paper  AquaLog: An ontology-driven question answering system for organizational semantic intranets

Journal of Web Semantics, Sep 2007. Vol. 5, 2, (72-105), Elsevier

paper  PhiloSURFical: browse Wittgensteinʼs Tractatus with the Semantic Web

Wittgenstein and the Philosophy of Information - Proceedings of the 30th International Ludwig Wittgenstein Symposium, Kirchberg, Austria, Aug 2007. pp. 319-335

paper  Supporting Philosophers’ Work through the Semantic Web: Ontological Issues

Fifth International Workshop on Ontologies and Semantic Web for E-Learning (SWEL-07), held in conjunction with AIED-07, Marina Del Rey, California, USA, Jul 2007.


paper  A Task Based Approach to Support Situating Learning for the Semantic Web

International Workshop on Applications of Semantic Web Technologies for E-Learning (SWEL-06), held in conjunction with Adaptive Hypermedia 2006, Dublin, Ireland, Jun 2006.

paper  Paving the way towards the e-humanities: a Semantic Web approach to support the learning of philosophy

Poster paper presented at the 3rd European Semantic Web Conference (ESWC-06), Budva, Montenegro, Jun 2006.


paper  Semantic Learning Narratives

International Workshop on Applications of Semantic Web Technologies for E-Learning (SWEL-05), held in conjunction with KCAP-05, Banff, Canada, Oct 2005.

paper  AquaLog A Ontology-portable Question Answering interface for the Semantic Web

2nd European Semantic Web Conference (ESWC05), Heraklion, Crete, Greece, May 2005. pp. 546-562