The presentation focused on an hybrid data architecture (XML for storage&querying, RDF for modeling&integration) which emerged as the most practical solution during the process of re-engineering of the publishing platform which has occurred within our company (Macmillan S&E) in the last years.
This is the abstract:
This paper presents recent work carried out at Macmillan Science and Education in evolving a traditional XML-based, document- centric enterprise publishing platform into a scalable, thing-centric and RDF-based semantic architecture. Performance and robustness guarantees required by our online products on the one hand, and the need to support legacy architectures on the other, led us to develop a hybrid infrastructure in which the data is modelled throughout in RDF but is replicated and distributed between RDF and XML data stores for efficient retrieval. A recently launched product – dynamic pages for scientific subject terms – is briefly introduced as a result of this semantic publishing architecture.
The paper is available online; slides from the presentation can be found below.
The ISWC industry track was packed with interesting papers so I think it's worth taking a look at the online proceedings. The uptake of tech outside academia is always revealing of the many real-world difficulties involved in making something fit within pre-existing work practices and legacy technologies. This is especially true of larger companies, where investment in older technologies (and in people who know about them) can be considerable, hence upgrades are costly and need to be evaluated more carefully.
This is the sort of background that led me and my colleagues at MacMillan to opt for a hybrid solution that combines the power of an established enterprise MarkLogic installation with more cutting edge data integration approaches based on RDF.
Nature.com subject pages were one of the first products built on top of this architecture. And many more will come: we're still heavily involved in this work though, so stay tuned for more stuff in this space.
Soon, we will also be releasing our public ontologies online and making available a new and improved version of the nature.com datasets.
Cite this blog post:
Digital Humanities Quarterly, Jan 2017. Volume 11 Number 1
International Semantic Web Conference (ISWC-14), Riva del Garda, Italy, Oct 2014.
Digital Humanities 2013, University of Nebraska–Lincoln, Jul 2013.
Lecture slides from the Course on digital history, part of the master in Digital Humanities at King's College, London., Oct 2011.
LAP LAMBERT Academic Publishing, Aug 2010.
Journal of Web Semantics, Sep 2007. Vol. 5, 2, (72-105), Elsevier
Wittgenstein and the Philosophy of Information - Proceedings of the 30th International Ludwig Wittgenstein Symposium, Kirchberg, Austria, Aug 2007. pp. 319-335
Fifth International Workshop on Ontologies and Semantic Web for E-Learning (SWEL-07), held in conjunction with AIED-07, Marina Del Rey, California, USA, Jul 2007.
International Workshop on Applications of Semantic Web Technologies for E-Learning (SWEL-06), held in conjunction with Adaptive Hypermedia 2006, Dublin, Ireland, Jun 2006.
Poster paper presented at the 3rd European Semantic Web Conference (ESWC-06), Budva, Montenegro, Jun 2006.
International Workshop on Applications of Semantic Web Technologies for E-Learning (SWEL-05), held in conjunction with KCAP-05, Banff, Canada, Oct 2005.
2nd European Semantic Web Conference (ESWC05), Heraklion, Crete, Greece, May 2005. pp. 546-562