Skip to content

Navigation Menu

Appearance settings

OpenIsraeliSupermarkets

Explore
By company size
By use case
By industry
View all solutions
Topics
- AI
- DevOps
- Security
- Software Development
- View all
Explore
- GitHub Sponsors
  Fund open source developers
- The ReadME Project
  GitHub community articles
Repositories
- Enterprise platform
  AI-powered developer platform
Available add-ons
Pricing

Search code, repositories, users, issues, pull requests...

Search

Clear

Search syntax tips

Provide feedback

We read every piece of feedback, and take your input very seriously.

Include my email address so I can be contacted

Saved searches

Use saved searches to filter your results more quickly

Name

Query

To see all available qualifiers, see our documentation.

Appearance settings

You signed in with another tab or window. Reload to refresh your session. You signed out in another tab or window. Reload to refresh your session. You switched accounts on another tab or window. Reload to refresh your session.

Dismiss alert

Open Israeli Supermarket

We utilize the data published by the Israeli supermarket to help the community.

8 followers
Israel
erlichsefi@gmail.com

Overview
Repositories
Projects
Packages
People

More

Overview
Repositories
Projects
Packages
People

README.md

V0: Current State

Historical Data:

Retained in Kaggle, ensuring no data loss.
Future possibility to load data into an on-demand API.

On-Demand:

Basic API to get data like the old interface, which makes the old way of accessing data obsolete.

Infrastructure:

Running on small unstable intance.

Project Roadmap

Next Goal: Public Supermarket Data Website

Goal:

GA application to access supermarket data.

Potential customers:

Any customer currently accessing the website directly.

How:

Running on a stable infrastructure (remote server).
API accessible to support a few dozen customers.
Reliable data processing.
Data served according to each chain’s schema format.
Only daily data avaliable in the API.
Understanding minimal hosting requirements.

Additional Goals:

Hosting a hackathon to solve DS-related challenges:
- Transform Kaggle dataset into a hackathon challenge.
- Organize and run the hackathon.
Cloud Infrastructure:
- Create a landing page (AWS required).

V2: Supporting Formatted Interactions

Goal:

Serve one schema format across all chains.
Support search pipeline from natural language queries.

Customers:

GenAI applications.

V3: Supporting Product Comparison

Goal:

Enable price comparison across supermarket chains.

Customers:

Comparison applications.
Price comparison platforms.

How:

Minimal data retention for in-day changes.
Expose an Alternatives API that:
- Finds equivalent products across different chains.

V4: Supporting Time-Series Analysis

Goal:

Provide price predictions and trend detection.

Customers:

Price prediction models.
Trend detection applications.

How:

Load historical data from Kaggle.

V4: Exposing Web Page and User Access

Goal:

Provide monitoring and visibility for data pipeline status and quality.

Customers:

Internal stakeholders and users.

How:

Monitoring Page:
- Based on MongoDB collection, create a page to track data pipeline status.
Data Quality Metrics Page:
- Based on MongoDB collection, publish metrics about data quality.

Work Breakdown & Definition of Done (DoD)

V1: Reducing Latency

Scrapers:
- Support writing XML to a stream (no IO to hard drive).
- Use queues to populate scrapers with data nodes (one queue per chain).
Scrapers:
- Write execution status into MongoDB instead of JSON files.
- Create a new collection for all scraper execution statuses.
Parsers:
- Pull data from the queue instead of reading from the hard drive.
- Write parsed results to MongoDB (one collection per file type and chain, keeping all versions).
- Investigate latency reduction strategies.
Stability Improvements:
- Perform scraper stress tests (analyze impact of high CPU allocation on timeouts).
- Error analysis to ensure correct data publication to Kaggle.
Kaggle Dump:
- Develop a process to transform MongoDB data into a Kaggle dataset and publish it.

V2: Data Analysis

Schema Linking:
- Solve schema linking between supermarket chains.
- Compute product prices across chains.
Promotions:
- Implement a method to apply promotions to product prices.
Schema Transformation:
- Given a CSV, execute schema transformations.
- Evaluate transformations on random Kaggle data versions.
- Plan for processing all Kaggle data into MongoDB.

V2: Search

Schema Engine:
- Implement a search engine that allows text searches across any textual column.
- Merge and pre-filter lists based on search queries.

V3: Time-Series Data

Kaggle Load:
- Load Kaggle datasets into MongoDB.

Additional Action Item

Create README:
- Document project state and roadmap clearly for contributors.

Pinned Loading

israeli-supermarket-scarpers israeli-supermarket-scarpers Public

A python package with client to scrape the israeli supermarkets data

Jupyter Notebook 21 5
israeli-supermarket-parsers israeli-supermarket-parsers Public

A python package with parsers to process the supermarkets data

Python 5 3
daily-publish-supermarket-data daily-publish-supermarket-data Public

Daily Cron job to publish the supermarkets data to Kaggle

Python

Repositories

Loading

Type

Select type

All Public Sources Forks Archived Mirrors Templates

Language

Select language

All Jupyter Notebook Python

Sort

Select order

Last updated Name Stars

Showing 5 of 5 repositories

israeli-supermarket-scarpers Public
A python package with client to scrape the israeli supermarkets data

OpenIsraeliSupermarkets/israeli-supermarket-scarpers’s past year of commit activity

Jupyter Notebook 21 5 0 0 Updated Jul 16, 2025
daily-publish-supermarket-data Public
Daily Cron job to publish the supermarkets data to Kaggle

OpenIsraeliSupermarkets/daily-publish-supermarket-data’s past year of commit activity

Python 0 MIT 0 0 2 Updated Jul 9, 2025
israeli-supermarket-parsers Public
A python package with parsers to process the supermarkets data

OpenIsraeliSupermarkets/israeli-supermarket-parsers’s past year of commit activity

Python 5 3 1 0 Updated Jul 8, 2025
entity-matching Public
Playing around with entity matching over supermarket items

OpenIsraeliSupermarkets/entity-matching’s past year of commit activity

0 MIT 0 0 0 Updated Jun 1, 2025
.github Public

OpenIsraeliSupermarkets/.github’s past year of commit activity

0 0 0 0 Updated Feb 15, 2025

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Uh oh!

There was an error while loading. Please reload this page.

Most used topics

Loading…

Uh oh!

There was an error while loading. Please reload this page.

Footer

© 2025 GitHub, Inc.

Footer navigation

Terms
Privacy
Security
Status
Docs
Contact

You can’t perform that action at this time.