This repository contains datasets and supporting scripts from the study "RETROFIT-LAT: A Comprehensive Dataset for Energy Efficiency Investments in Latvia". It provides detailed information on residential building energy efficiency projects funded by the Latvian Environmental Investment Fund (LEIF). The data focuses on retrofitting and solar panel installations, offering insights into energy performance improvements and sustainability measures.
- About the Dataset
- Structure of the Repository
- Key Features of the Dataset
- Data Preprocessing and Privacy
- Potential Applications
- Limitations
- Citation
- Acknowledgements
The repository contains two primary datasets:
- EF_comp Dataset: Focuses on building retrofitting, including energy performance, CO2 emissions, and energy-saving measures before and after renovations.
- Sol_pan_comp Dataset: Details solar panel installations, including electricity consumption, energy production, and CO2 emissions reductions.
These datasets cover projects implemented from 1870 to 2022 across Latvia, providing a rich source for evaluating the impacts of energy-saving initiatives.
data/
EF_comp.csv
: Retrofitting data for buildings.Sol_pan_comp.csv
: Data on solar panel projects.
scripts/
- Python scripts for data preprocessing and analysis.
docs/
- Detailed documentation on dataset structure and feature descriptions.
README.md
: Overview of the repository.LICENSE
: License information for the dataset and scripts.
The EF_comp dataset contains:
- General building information: Location, year of construction, and building size.
- Energy performance: Energy class, energy consumption, and CO2 emissions before and after retrofitting.
- Structural details: Floor height, number of floors, and insulation metrics.
- Energy-saving measures: Heat loss coefficients, energy-saving percentages, and compliance with almost zero energy standards.
The Sol_pan_comp dataset includes:
- Project details: Year, region, and funding specifics.
- Energy metrics: Pre- and post-installation energy consumption, inverter capacity, and solar electricity production.
- Environmental impact: Reductions in primary energy use and CO2 emissions.
To ensure privacy and comply with data protection standards:
- Personally identifiable information (PII) such as addresses and project numbers has been removed.
- Location data has been pseudo-anonymized (e.g.,
town1
,county1
). - Specific dates have been generalized to the year level.
The datasets can be utilized for:
- Analyzing the impact of building renovations on energy efficiency and CO2 emissions.
- Evaluating the effectiveness of solar panel installations in reducing energy consumption.
- Identifying trends and patterns in energy efficiency across different building types and regions.
- Supporting policy development and decision-making in energy sustainability.
- The datasets are limited to 1,010 buildings, potentially affecting the generalizability of findings.
- Some features are underrepresented or incomplete, such as region-specific energy class distributions.
If you use this dataset in your research, please cite this repository