Which of the following descriptions best represent the overarching design of your forecasting model?
- Machine learning-based weather prediction.
What techniques did you use to initialise your model? (For example: data sources and processing of initial conditions)
Our forecasts are initialized from two distinct reanalysis products.
- The atmospheric variables are initialized from the ERA5T reanalysis using the 10 ensemble members of the ERA5 Ensemble of Data Assimilations (EDA).
- The ocean variables are initialized using the ORAS6 ocean and sea-ice reanalysis. Because ORAS6 becomes available with a latency of about 8–12 days (depending on the alignment between the forecast start time and the 5-day assimilation window of the reanalysis), a separate deterministic machine-learning model was trained to bridge this short gap. This autoregressive model, which uses surface ocean and sea-ice conditions forced by the operational atmospheric analysis, provides the most accurate possible ocean initial conditions during this delay period. Unlike the atmosphere, the ocean's initial conditions are identical for all ensemble members.
If any, what data does your model rely on for real-time forecasting purposes?
For real-time forecasting, AIFSthalassa uses two consecutive states (t, t–24 h), similar to AIFSgaia and AIFShera. As described above, the atmospheric input comes from ERA5T and its associated EDA, while the ocean and sea ice input comes from an ORAS6-like model. Input variables are:
- Temperature, humidity, geopotential height, and horizontal winds at 1000, 925, 850, 700, 600, 500, 400, 300, 250, 200, 150, 100, 50 hPa.
- Surface variables: 10m winds, 2m-temperature, mean sea level pressure, total column water vapour.
- Ocean variables: sea surface temperature, sea surface height, sea surface salinity, and sea surface velocity (zonal and meridional components).
- Forcing variables: orography, land-sea mask, insolation.
What types of datasets were used for model training? (For example: observational datasets, reanalysis data, NWP outputs or satellite data)
The model was trained on the ERA5 and ORAS6 reanalyses from 1993 – 2024 with 6-hourly temporal resolution on an o96 grid (~1°). In addition to the input variables, the model predicts precipitation and cloud cover as diagnostic outputs.
Please provide an overview of your final ML/AI model architecture (For example: key design features, specific algorithms or frameworks used, and any pre- or post-processing steps)
AIFSTalassa follows the AIFS-CRPS architecture described by Lang et al. (2024b) and summarized in the AIFSgaia documentation. The main differences with AIFSgaia are:
• The embedding dimension of the processor is 1536 for approximately 500M trainable model parameters.
• Training is run for 300k iterations.
In AIFSthalassa, there is no explicit coupling between the ocean and atmospheric components. The fields are trained jointly in a single model with a common loss, at the same spatiotemporal resolution.
Have you published or presented any work related to this forecasting model? If yes, could you share references or links?
AIFSthalassa is based on Lang et al., AIFS-CRPS: Ensemble forecasting using a model trained with a loss function based on the Continuous Ranked Probability Score (2024b), http://arxiv.org/abs/2412.15832. The specific treatment of the ocean variables is not yet described in a peer-reviewed publication.
Before submitting your forecasts to the AI Weather Quest, did you validate your model against observational or independent datasets? If so, how?
Model verification follows the same procedure as for AIFSgaia, using ERA5 data for out-of-sample evaluation and one March-May of this year for sanity checks after fine-tuning.
Did you face any challenges during model development, and how did you address them?
The technical challenges were largely identical to those encountered in AIFSgaia, primarily related to automating forecast generation and submission. A forced ML ocean model has been developed to obtain the updated ocean initial conditions that enable us to run the forecast system in real time.
Are there any limitations to your current model that you aim to address in future iterations?
Like AIFSgaia, AIFSthalassa tends to produce an ensemble spread narrower than the observed variability. Future work will focus on improving ensemble calibration.
Are there any other AI/ML model components or innovations that you wish to highlight?
AIFSthalassa is the first model of the AIFS family that explores a prognostic representation of the surface ocean and sea ice as sources of S2S predictability.
Who contributed to the development of this model? Please list all individuals who contributed to this model, along with their specific roles (e.g., data preparation, model architecture, model validation, etc) to acknowledge individual contributions.
AIFSthalassa was developed by the AIFS Team at ECMWF, encompassing efforts in data preparation, model architecture, code development, high-performance computing, and operational deployment.