MURAL - Maynooth University Research Archive Library



    MeteoSaver v1.0: a machine-learning based software for the transcription of historical weather data


    Muheki, Derrick, Vercruysse, Bas, Chandrasekar, Krishna Kumar Thirukokaranam, Verbruggen, Christophe, Birkholz, Julie M., Hufkens, Koen, Verbeeck, Hans, Boeckx, Pascal, Lampe, Seppe, Hawkins, Ed, Thorne, Peter, Ntumba, Dominique Kankonde, Moulasa, Olivier Kapalay and Thiery, Wim (2026) MeteoSaver v1.0: a machine-learning based software for the transcription of historical weather data. Geoscientific Model Development, 19. pp. 3213-3255. ISSN 1991-9603

    [thumbnail of gmd-19-3213-2026.pdf]
    Preview
    Text - Published Version
    Available under License Creative Commons Attribution Non-commercial Share Alike.

    Download (40MB) | Preview

    Abstract

    Archives of observed weather data present unique opportunities for scientists to obtain long time series of the historical climate for many regions of the world. Unfortunately, most of these observational records are to-date available only on paper, and thus require digitization and transcription to facilitate analysis of climatic trends. Here we present a new open-source software, MeteoSaver, that uses machine learning (ML) algorithms to transcribe handwritten records of historical weather data. MeteoSaver version 1.0 processes images of tabular sheets alongside user-defined configuration settings, performing transcription through five sequential steps: (i) image pre-processing, (ii) table and cell detection, (iii) transcription, (iv) quality assessment and quality control, and (v) data formatting and upload. As an illustration and evaluation of the software, we apply MeteoSaver to ten pictured sheets of handwritten temperature and precipitation observations from the Democratic Republic of the Congo. The results show that 95 %–100 % of the daily temperature values can be transcribed, of which a median of 74.4 % reached the highest internal quality flag and 74 % matches with the manually transcribed record, yielding a median mean absolute error of 0.3 °C. These results illustrate that MeteoSaver can be applied to a range of handwriting styles and varying tabular dimensions, paper sizes, and maintenance conditions, highlighting its potential for transcribing tabular meteorological observations from multiple regions, especially if the sheets have a consistent format. Overall, our open-source software can help address the challenges of limited available hydroclimatic data within many regions of the world, by helping to save millions of handwritten records of historical weather data presently stored in archives, and expedite research on the climate and environmental changes in data scarce regions.
    Item Type: Article
    Additional Information: This research has been supported by the Fonds Wetenschappelijk Onderzoek (grant nos. 11M8825N and 11M8823N), the HORIZON EUROPE European Research Council (grant no. 101076909), and the European Union's Horizon 2020 (grant agreement no. 101081369).
    Keywords: MeteoSaver; machine-learning; software; transcription; historical weather data;
    Academic Unit: Faculty of Social Sciences > Geography
    Faculty of Social Sciences > Research Institutes > Irish Climate Analysis and Research Units, ICARUS
    Item ID: 21535
    Identification Number: 10.5194/gmd-19-3213-2026
    Depositing User: ICARUS Geography
    Date Deposited: 12 May 2026 11:44
    Journal or Publication Title: Geoscientific Model Development
    Publisher: Copernicus Publications
    Refereed: Yes
    Related URLs:
    Use Licence: This item is available under a Creative Commons Attribution Non Commercial Share Alike Licence (CC BY-NC-SA). Details of this licence are available here

    Downloads

    Downloads per month over past year

    Origin of downloads

    Altmetric Badge

    Repository Staff Only (login required)

    Item control page
    Item control page