Barcelona's municipal digital infrastructure is carrying a hidden weight. Across the city's publicly funded databases — from the Institut Municipal d'Informàtica to the photographic holdings of the Arxiu Fotogràfic de Barcelona on Plaça de Pons i Clerch — tens of thousands of duplicate image files have accumulated over more than a decade of digitisation drives, creating storage bottlenecks, indexing errors and, in some cases, public records that simply cannot be found through standard search tools.
The issue has moved from an internal technical irritant to a matter of public debate because of Barcelona City Hall's 2025–2027 Digital Transformation Plan, which earmarks investment in open-data infrastructure and sets a hard target of making 80 percent of municipal visual archives searchable and publicly accessible by the end of 2027. Meeting that deadline, technicians and archivists say, requires resolving the duplicate problem first — a task that has proven far more complicated than anyone anticipated when the digitisation programmes launched in the early 2010s.
What the Experts Are Saying
Specialists working in digital heritage and records management have been vocal. At a symposium hosted by the Universitat Pompeu Fabra in the Ciutadella campus in late June 2026, researchers presented findings showing that image duplication rates in mid-sized European municipal archives commonly run between 15 and 30 percent of total holdings — a figure that carries real financial consequences when cloud storage and manual cataloguing costs are factored in. Barcelona's archives are not believed to be an outlier on that scale, though city officials have not yet published a specific internal audit figure.
The conversation has broadened beyond archivists. Urban planners relying on the city's Cartografia i GIS division for neighbourhood-level visual documentation — particularly for contested redevelopment zones in Poblenou's 22@ district and along the port-adjacent blocks of Barceloneta — report that duplicate georeferenced images create version-control confusion when teams pull records for planning reports. The practical cost is measured in staff hours: re-verification work that interrupts project timelines.
Mayor Jaume Collboni's administration has signalled awareness of the technical dimension without yet committing to a standalone remediation budget line. The Digital Transformation Plan references the need for automated deduplication tooling as part of broader data hygiene, but critics within the city's technology advisory circles argue that framing it as a minor sub-task underestimates the scope of the work involved. Those voices — speaking through professional bodies rather than named on the record — have pushed for a dedicated project team rather than absorbing the work into existing IT maintenance rotations.
Institutions Under Pressure to Act
The Arxiu Nacional de Catalunya, based in Sant Cugat del Vallès and the primary repository for Catalan public records above the municipal level, completed its own internal deduplication audit in March 2026 and concluded that eliminating redundant files reduced its active image index by roughly 18 percent while improving retrieval speeds across its public portal. That outcome has been cited repeatedly by Barcelona municipal technicians as a benchmark worth replicating — and as evidence that the investment pays off faster than sceptics suggest.
Closer to the city centre, the Centre de Cultura Contemporània de Barcelona on Carrer de Montalegre has been piloting AI-assisted image similarity detection across its digital exhibition archive since January 2026. The pilot, running on a six-month trial basis, is understood to be producing preliminary results that the CCCB's digital team plans to present publicly later this summer. That presentation will likely become a reference point for the municipal conversation, given the CCCB's role as a civic institution with direct links to both the culture and technology policy networks feeding into city hall.
For residents and researchers who use Barcelona's open-data portals, the practical advice from digital archivists is straightforward: when pulling images from the Ajuntament's open-data portal or the Arxiu Fotogràfic's online catalogue, cross-reference file metadata dates and identifiers carefully, since the same image may appear under multiple catalogue entries with different provenance labels. The deduplication work, whenever it is formally resourced, will eventually clean those records — but until a project start date is confirmed and staffed, the inconsistencies remain live in the public-facing systems.