Francesco Tosoni at UNESCO Paris

UNESCO Paris, 2026 • 10th anniversary of Software Heritage

Photo by Elarionne via Wikimedia Commons CC BY-SA 4.0 Source
Research Contractor

Francesco Tosoni

PhD, Computer Science

|

I make data more efficient, more accessible, and more sustainable. My work blends lossless compression algorithms with open data infrastructures to create greener computing solutions.

Based at Sant'Anna School of Advanced Studies in Pisa, Italy.

Lossless Compression Open Data Green Algorithms Matrix Computation String Indexing Wikimedia Volunteer

About

I'm a computer science researcher and a Wikimedian passionate about making open data more efficient, accessible, and sustainable. My work sits at the intersection of lossless compression algorithms, open data infrastructures, and green computing.

By day, at Sant'Anna School of Advanced Studies I research compressed data structures, indexing and retrieval techniques for the Software Heritage archive, the "Library of Alexandria" of code. By weekends, I keep contributing to open-data, collaborative projects like Wikidata.

My core research area is lossless compressionI completed my PhD at the University of Pisa under the supervision of Professors P. Ferragina and G. Manzini, focusing on computation-friendly compression - techniques that allow us to operate directly on compressed data without decompression overhead. The challenge is to develop tools that make data processing more energy-efficient, too. Compression is not just about saving space on disk: the real challenge is to adapt compression schemes so that they allow to operate directly in main memory (without compression overhead) and in time proportional to the compressed representation size.

I'm actively involved in:

  • Software Heritage - Making source code archival more efficient and accessible
  • Wikimedia projects - Technical contributions to Wikidata and Meta-Wiki
  • Green algorithms - Developing energy-aware compression techniques

Connect

Publications & Profiles

Research Activity

As an algorithmist, he primarily specialised in lossless data compression. Since July 2024, he has been working on optimizing the compression and efficient indexing of large code archives in collaboration with the Software Heritage team.

Current Research Focus

  • Compressed formats for matrices and trie structures
  • Sparse matrix formats supporting matrix-vector multiplications (SpMV) in the compressed domain
  • Energy-efficient computation on compressed data

Pronunciation: For those familiar with the IPA, his name is pronounced [fraŋ'ʧesko to'zoːni].

Contact & Location

Institutional Affiliation

Sant'Anna School of Advanced Studies

Research Contractor

Email

francesco@santannapisa.it

(obfuscated for spam protection)

Location

Sant'Anna School of Advanced Studies

L'EMbeDS room

p.zza Martiri della Libertà 33

56127 Pisa PI, Italy

Map

View on OpenStreetMap