dbpedia – Parerga und Paralipomena http://www.michelepasin.org/blog At the core of all well-founded belief lies belief that is unfounded - Wittgenstein Fri, 30 Nov 2018 16:48:44 +0000 en-US hourly 1 https://wordpress.org/?v=5.2.11 13825966 Exploring scholarly publications using DBPedia concepts: an experiment http://www.michelepasin.org/blog/2018/11/23/exploring-scholarly-publications-via-dbpedia/ Fri, 23 Nov 2018 17:18:48 +0000 http://www.michelepasin.org/blog/?p=3254 This post is about a recent prototype I developed, which allows to explore a sample collection of Springer Nature publications using subject tags automatically extracted from DBPedia.

DBpedia is a crowd-sourced community effort to extract structured content from the information created in various Wikimedia projects. This structured information resembles an open knowledge graph (OKG) which is available for everyone on the Web.

Datasets

The dataset I used is the result of a collaboration with Beyza Yaman, a researcher working with the DBpedia team in Leipzig, who used the SciGraph datasets as input to the DBPedia-Spotlight entity-mining tool.

By using DBPedia-Spotlight we automatically associated DBpedia subjects terms to a  subset of abstracts available in the SciGraph dataset (around 90k abstract from 2017 publications).

The prototype allows to search the Springer Nature publications using these subject terms.

Also, DBpedia subjects include definitions and semantic relationships (which we are currently not using, but one can imagine how they could be raw material for generating more thematic ‘pathways’).

Results: serendipitous discovery of scientific publications

The results are pretty encouraging: despite the fact that the concepts extracted sometimes are only marginally relevant (or not relevant at all), the breadth and depth of the DBpedia classification makes the interactive exploration quite interesting and serendipitous.

You can judge for yourself: the tool is available herehttp://hacks2019.michelepasin.org/dbpedialinks

The purpose of this prototype is to evaluate the quality of the tagging and generate ideas for future applications. So any kind of feedback or ideas is very welcome!

We are working with Beyza to write up the results of this investigation as a research paper. The data and software is already freely available on github.

A couple of screenshots:

Eg see the topic ‘artificial intelligence

Screen Shot 2018-11-23 at 17.15.07.png

One can add more subjects to a search in order to ‘zoom in’ into a results set, eg by adding ‘China’ to the search:

Screen Shot 2018-11-23 at 17.16.38

Implementation details

 

 

]]>
3254
DBpedia rocks http://www.michelepasin.org/blog/2007/09/12/dbpedia-rocks/ Wed, 12 Sep 2007 09:39:24 +0000 http://people.kmi.open.ac.uk/mikele/blog/?p=250 It’s not the only semweb repository out there, but for sure it’s the more interesting. The whole wikipedia has been translated into RDF and made queryable through SPARQL.. lots of potential mashups waiting to be discovered! At the moment i’m looking at integrating the philosophy KB i’ve created with information from there… but I hope there’ll be time to experiment too…

 

]]>
250