Leroy Merlin Scraper

This project is a web scraping pipeline that extracts product data from Leroy Merlin’s website.
It uses a modular approach with dedicated functions to fetch product categories, subcategories, pages, and individual items, then saves the results in CSV format.

Features

Scrape product categories, subcategories, and paginated pages
Extract items from each page using custom parsing functions
Save results as structured CSV files inside the output/ folder
Skip already-scraped pages to avoid duplicates

Requirements

This project works with Python 3.8+.
External dependencies must be installed with pip:

pip install requests beautifulsoup4

Project Structure

├── main.py                  # Entry point
├── script/
│   ├── leroymerlin.py        # get_products, get_pages, get_items
│   ├── util.py               # save_csv, get_last_path_parts
├── credential.example.py     # Example API credential file
├── output/                  # Scraped CSV files (auto-generated)
└── README.md

Installation & Usage

1. Clone the repository

git clone https://github.com/harivonyR/LeroyMerlyn_scraping

2. Open the project folder

cd LeroyMerlyn_scraping

3. Create your credential file

Copy the example file:

copy credential.example.py credential.py

Open credential.py and paste your PILOTERR API KEY:

x_api_key = "paste your API key here !"

4. Install dependencies

pip install requests beautifulsoup4

5. Run the scraper

python main.py

Notes

The scraper automatically skips files that already exist in output/.
If a subcategory has no pagination, the scraper moves on.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
logs		logs
sample		sample
script		script
.gitignore		.gitignore
README.md		README.md
credential.example.py		credential.example.py
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Leroy Merlin Scraper

Features

Requirements

Project Structure

Installation & Usage

1. Clone the repository

2. Open the project folder

3. Create your credential file

4. Install dependencies

5. Run the scraper

Notes

About

Uh oh!

Releases

Packages

Languages

harivonyR/LeroyMerlyn_scraping

Folders and files

Latest commit

History

Repository files navigation

Leroy Merlin Scraper

Features

Requirements

Project Structure

Installation & Usage

1. Clone the repository

2. Open the project folder

3. Create your credential file

4. Install dependencies

5. Run the scraper

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages