Data to go with our submission to Climatic Change titled "Progress on Climate Action: a Multilingual Machine Learning Analysis of the Global Stocktake".
Dataset contains the embeddings (.zip with pickles) as well as the associated document items (idem), the most-closely associated keywords and paragraphs per topic in the final model (.xlsx), the reduced 2d embeddings with all selected paragraphs (.csv utf-8 encoded), as well as an overview with the meta-data per source (.csv utf-8 encoded)