The Italian government recently announced a new portal containing data on public spending: http://soldipubblici.gov.it. This is obviously great news; however, the website is still in beta, so in what follows I'd like to put forward a few (hopefully constructive) comments and suggestions for how it could and should be developed further.
Incidentally, I recently ran into Ian Makgill from spendnetwork.com/, a London startup funded by the Open Data Institute that aims to use open public data to create the first comprehensive and publicly available repository for government transaction data.
We ended up chatting about the situation with open government data in Italy. To be honest, I'm no expert on the matter, but a couple of names quickly came to mind.
First, the excellent OpenPolis association. Their mission is to enable free access to public information on political candidates, elected representatives, and legislative activity, thus promoting transparency and the democratic participation of Italian citizens.
One of their most successful projects is Open Parliament (similar in scope to theyworkforyou.com in the UK). More recently, the Open Bilanci platform was created to allow citizens to search and compare the budgets and expenses of municipalities (local boroughs) in Italy.
Second, the ongoing work done by the Open Knowledge Foundation, which also has an Italian chapter. For example, one of its long-standing projects, openspending.org, contains references to several datasets about Italy's public spending.
Another useful resource is the Italia open data census, a community-driven initiative to compare the progress made by different cities and local areas in releasing open data.
It should be clear by now that the people behind soldipubblici.gov.it are not the only ones working to increase transparency and democracy by releasing open data.
What's not clear at all, though, is whether these different groups are talking to each other - which would seem like the most obvious thing to do before embarking on a new enterprise like this. This is especially important since soldipubblici.gov.it is strikingly similar (in scope) to the aforementioned Open Bilanci portal. I'm sure that both the folks at the OKFN and openpolis.org would be interested in getting their hands on this data to integrate it with their existing services.
Nonetheless, it's great to hear that more is happening in this space. Even more so because it's the Italian government who's taking responsibility for it (as it should be).
That's the good news. The bad news is that, from a data perspective, there isn't much you can do with soldipubblici.gov.it in this beta release. If one wants to put this data to use, there are various key elements missing, I believe. Here are a few ideas:
There is no way to browse or review the data. Search is good, but if you have no idea what to search for (e.g., simply because you don't know what it's called), then you're fundamentally stuck. The system actually features a more advanced 'semantic' search, which essentially augments the scope of the keywords you enter via synonyms and related terms. That's nice, but it's also no substitute for a good old-fashioned yellow-pages-like category browser. You know, just to get a sense of what's in the box before opening it.
You can't download the data. To be fair, the FAQs clearly state that this feature is still being worked on. Fine - I guess they're talking about some nifty mechanism to select, collect, and download specific datasets one is interested in. However, I do wonder why one cannot download the entire dataset already. At the end of the day, that data is A) already made available via the current user interface; B) public and (in theory) already available on a different website called SIOPE (though "available" is a big word - I should probably say buried).
The visualization app is nice but very limited. Data doesn't become information unless you give it some meaningful context. This tool is a great idea, but it would be enormously more useful if you could decide yourself what to plot on the graph (e.g., which years, which datasets) depending on your research questions - that is, your context. Moreover, you want to be able to make comparisons between different datasets, etc. All things that Open Bilanci already does pretty well.
There's no data about the beneficiaries of public expenses. I'm not sure what the challenges are here, or whether this is feasible at all. But it would be great to have this extra piece of information, for transparency's sake. For example, on spendnetwork.com you can easily see the list of suppliers for the London Borough of Ealing expenses.
This is an inspiring start, and I can't wait to see it developed further - especially if it gets developed with real end users in mind! To that end, it's useful to bring up what the OKFN has declared to be the key features of data openness:
- Availability and access: The data must be available as a whole and at no more than a reasonable reproduction cost, preferably by downloading over the internet. The data must also be available in a convenient and modifiable form.
- Reuse and redistribution: The data must be provided under terms that permit reuse and redistribution, including intermixing with other datasets. The data must be machine-readable.
- Universal participation: Everyone must be able to use, reuse, and redistribute — there should be no discrimination against fields of endeavor or against persons or groups. For example, 'non-commercial' restrictions that would prevent 'commercial' use, or restrictions of use for certain purposes (e.g., only in education), are not allowed.
Personally, I can't stress strongly enough how useful it would be to have access to the raw data. Excel, CSV, a REST API, or even better, a Linked Data API. A data-level access point would turn this nice-looking but essentially siloed website into an open resource that thousands of data journalists or data scientists (of any kind) could build upon.
Are you looking forward to seeing this happen? I certainly am!
Cite this blog post:
Comments via Github:
2025
paper Enhancing the Accessibility of ORCID Public Data, now additionally hosted on Google BigQuery
4th International Conference on the Science of Science and Innovation, Copenhagen, Denmark, Jun 2025.
2019
paper Modeling publications in SN SciGraph 2012-2019
Workshop on Scholarly Digital Editions, Graph Data-Models and Semantic Web Technologies, Université de Lausanne, Jun 2019.
2013
New Technologies in Medieval and Renaissance Studies, (forthcoming). (part of the 'Envisioning REED in the Digital Age' collection)
2010
paper Review of Interontology conference 2010
Humana Mente, Journal of Philosophical Studies, 13, May 2010. Issue 13