Barcelona's push to digitise its urban heritage and public infrastructure records has collided with a problem that archivists, technologists and municipal officials are increasingly unwilling to ignore: thousands of duplicate images clogging the city's civic databases, distorting search results and quietly corroding the credibility of projects worth millions of euros in public funding.
The issue has moved from technical footnote to policy concern over the past six months, as the Ajuntament de Barcelona's Smart City programme and the Institut Municipal d'Informàtica — the city's in-house digital authority — have scaled up their open-data repositories ahead of a 2027 compliance deadline set by the European Commission's open-data directive. Duplicate imagery, particularly in cadastral records and heritage photo archives, has emerged as one of the most stubborn barriers to meeting that deadline.
Why Duplicate Images Have Become a Governance Problem
The core issue is not cosmetic. When an image of a building facade in the Eixample district appears under three different reference codes in the city's patrimoni database, it creates cascading errors — incorrect attribution, inflated asset counts, and flawed urban planning analysis. The Centre de Cultura Contemporània de Barcelona, whose public image archives feed into several municipal projects, has reportedly been working with technical staff to audit duplication rates in shared digital collections, though the full scope of the problem has not been made public.
Professionals at the Escola Superior d'Arxivística i Gestió de Documents, based on the Universitat Autònoma de Barcelona campus in Bellaterra, have described the problem as structural rather than accidental. The root cause lies in how successive digitisation waves — each using different scanning standards and metadata schemas — fed into systems that were never designed to cross-reference one another. Each department uploaded independently. Nobody checked for overlap.
The Barcelona Supercomputing Centre at the Parc de Recerca Biomèdica de Barcelona on Carrer del Doctor Aiguader has been piloting perceptual hashing tools — a technique that generates a fingerprint for each image to detect near-identical copies — as part of a broader computational research collaboration. The technique is already used in commercial content moderation at scale, and advocates argue it could be adapted for civic archive use at relatively low cost.
City Hall Response and What Comes Next
Mayor Jaume Collboni's administration has framed the broader digitisation agenda as a cornerstone of Barcelona's post-pandemic competitiveness strategy, allocating funds through the 2026 municipal budget for infrastructure modernisation across city departments. Officials at the Institut Municipal d'Informàtica have acknowledged in public communications that image data quality is a priority workstream, though specific budget lines for deduplication tooling have not been confirmed in documents reviewed by The Daily Barcelona.
The Museu Nacional d'Art de Catalunya, which manages one of the largest publicly accessible image collections in Catalonia, faces its own version of the problem. Its collections management system contains digitised works from multiple acquisition periods, and curators have flagged that duplicate entries — sometimes differing only in resolution or colour profile — create confusion for researchers using its online portal from Avinguda de la Reina Maria Cristina.
Experts working in the field say the fix is not simply a matter of running software. Governance frameworks matter more. Image deduplication requires decisions about which version of a record is canonical, who holds authority to delete, and how audit trails are preserved — questions that are as much legal and institutional as technical. Under Catalan archival law, deletion of public records requires documented justification, adding another layer of complexity.
The practical upshot for 2026 and into 2027 is that Barcelona's open-data ambitions will be tested by whether city departments can agree on shared metadata standards before the European Commission's compliance window closes. Technical staff at the Institut Municipal d'Informàtica are expected to publish updated data-quality guidelines before the end of the third quarter. For researchers, journalists and urban planners who rely on municipal image databases, the immediate advice from archivists is straightforward: cross-reference any image retrieved from city repositories against at least one secondary source before treating it as authoritative.