by Aida Usmanova and Ricardo Usbeck
This paper is part of the proceedings of the first ClimateNLP workshop at ACL 2024. The paper is available here.
Following the introduction of the European Sustainability Reporting Standard (ESRS), companies will have to adapt to a new policy and provide mandatory sustainability reports. However, implementing such reports entails a challenge, such as the comprehension of a large number of textual information from various sources. This task can be accelerated by employing Large Language Models (LLMs) and ontologies to effectively model the domain knowledge. In this study, we extended an existing ontology to model ESRS Topical Standard for disclosure. The developed ontology would enable automated reasoning over the data and assist in constructing Knowledge Graphs (KGs). Moreover, the proposed ontology extension would also help to identify gaps in companies’ sustainability reports with regard to the ESRS requirements.Additionally, we extracted knowledge from corporate sustainability reports via LLMs guided with a proposed ontology and developed their KG representation.
- Create a virtual environment
python -m venv venv
and install all requirementspip install -r requirements.txt
- The
data/
folder contains sustainability reports. In this study we used pre-processed reports from Bronzini et.al 2024 - Create
results/
folder containingprocessed/
andraw/
subfolders - Extract triples with
src/extract_triples.py
and generate a knowledge graph from the report withsrc/generate_kg.py
. Before running set up your OpenAI key in both files. - Once KG is generated, you can visualize it with
src/visualize_kg.py