Mar 2026
I watched the "Vibe Coding with Open Alex" video recently, and it's been rattling around in my head since.
A researcher who used to need a vendor's tools or a specialist analyst can now point an LLM at a CSV of publications and get something meaningful in an afternoon. NotebookLM, Cursor, ChatGPT with code interpreter - these tools have commoditised a lot of what used to be specialist craft. Knowing how to wrangle data, write the right pandas operations, build a decent visualisation: it's still a skill, but it's no longer a bottleneck.
The community is noticing. People at ISSI or STI who wouldn't have called themselves coders are now shipping notebooks.
The traditional pitch - "we make the data accessible and usable" - is eroding. But it hasn't gone. The remaining advantages are real, just narrower:
The role of a data vendor is genuinely shifting. Clients increasingly don't want a database to query - they want answers and frameworks.
That means becoming more opinionated: providing benchmarks, recommended metrics, pre-classified outputs. Less neutral data pipe, more methodology partner. The value-add is less about retrieval and more about knowing what to measure and why.
Something interesting is also happening with risk. Clients used to question their own capability to analyse data. Now many feel they have that capability - so the concern has shifted to whether the underlying data is trustworthy enough to build on. That's a subtle inversion, and it increases scrutiny on data quality. Pressure for some vendors, opportunity for others.
The vendors who thrive in five years will be the ones who made their data legible to AI workflows, not just to human analysts.
Cite this blog post:
Comments via Github:
2026
2025
paper The Dimensions API: a domain specific language for scientometrics research
Frontiers in Research Metrics and Analytics, Oct 2025. https://doi.org/10.3389/frma.2025.1514938
paper Enhancing the Accessibility of ORCID Public Data, now additionally hosted on Google BigQuery
4th International Conference on the Science of Science and Innovation, Copenhagen, Denmark, Jun 2025.
2023
2022
International Conference on Science, Technology and Innovation Indicators (STI 2022), Granada, Sep 2022.
2019
Second biennial conference on Language, Data and Knowledge (LDK 2019), Leipzig, Germany, May 2019.
2018
2017
paper Data integration and disintegration: Managing Springer Nature SciGraph with SHACL and OWL
Industry Track, International Semantic Web Conference (ISWC-17), Vienna, Austria, Oct 2017.
paper Using Linked Open Data to Bootstrap a Knowledge Base of Classical Texts
WHiSe 2017 - 2nd Workshop on Humanities in the Semantic web (colocated with ISWC17), Vienna, Austria, Oct 2017.
paper Fitting Personal Interpretation with the Semantic Web: lessons learned from Pliny
Digital Humanities Quarterly, Jan 2017. Volume 11 Number 1
2016
paper Insights into Nature’s Data Publishing Portal
The Semantic Puzzle (online interview), Apr 2016.
2015
paper Learning how to become a linked data publisher: the nature.com ontologies portal.
5th Workshop on Linked Science 2015, colocated with ISWC 2015., Bethlehem, USA, Sep 2015.
paper ResQuotes.com: Turn your Notes and Highlights into Research Ideas
Force11 - Research Communications and e-Scholarship conference, Oxford, UK, Jan 2015.
2014
International Semantic Web Conference (ISWC-14), Riva del Garda, Italy, Oct 2014.
2013
New Technologies in Medieval and Renaissance Studies, (forthcoming). (part of the 'Envisioning REED in the Digital Age' collection)
2012
NeDiMaH workshop on ontology based annotation, held in conjunction with Digital Humanities 2012, Hamburg, Germany, Jul 2012.
2011
paper Browsing highly interconnected humanities databases through multi-result faceted browsers
Digital Humanities 2011 , Stanford, USA, Jun 2011.
2010
paper How do philosophers think their own discipline? Reports from a knowledge elicitation experiment
European Philosophy and Computing conference, ECAP10, Munich, Germany, Oct 2010.
paper Data integration perspectives from the London Theatres Bibliography project
Annual Conference of the Canadian Society for Digital Humanities / Société pour l'étude des médias interactifs (SDH-SEMI 2010), Montreal, Canada, Jun 2010.
2009
paper Laying the Conceptual Foundations for Data Integration in the Humanities
Proc. of the Digital Humanities Conference (DH09), Maryland, USA, Jun 2009. pp. 211-215
2007
2006