You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi there! This is recommender system project prepared for data [ContentWise Impressions](https://github.com/ContentWise/contentwise-impressions) using:
15
-
-`lightning` for model training
16
-
-`hydra` for configuration
17
-
-`clearml` for ML cycle control
18
-
- (optionally) aws s3 as remote storage
14
+
This repository provides a modular template for building recommender systems in Python using **implicit feedback data**. It is designed to streamline experimentation of recommendation models with a modern ML stack.
15
+
16
+
### 🔧 Tech Stack
17
+
-**PyTorch Lightning** – for scalable and structured model training
18
+
-**Hydra** – for flexible configuration management
19
+
-**ClearML** – for experiment tracking and ML workflow orchestration
20
+
-**(Optional) AWS S3** – for storing datasets and models remotely
21
+
22
+
### 📦 Dataset
23
+
24
+
As an example, this template uses the [ContentWise Impressions](https://github.com/ContentWise/contentwise-impressions) dataset, which contains real-world implicit feedback data.
25
+
26
+
### 🚀 Use Cases
27
+
28
+
- Rapid prototyping of recommender systems
29
+
- Benchmarking implicit models
30
+
- Educational purposes (learning modern ML tools in practice)
31
+
32
+
---
33
+
34
+
More details about setup, usage, and customization can be found in the sections below.
35
+
19
36
20
37
## Prerequisites
21
38
22
-
To make use of this repository, you have to:
23
-
-[download](https://github.com/ContentWise/contentwise-impressions) ContentWise data, concretally `CW10M` directory, and place it at `cache/CW10M/` (cache directory is included in .gitignore)
24
-
- set up a ClearML server and, optionally, AWS S3 storage
39
+
To make use of this repository, follow these steps:
40
+
41
+
1.**Download the dataset**
42
+
Download the [ContentWise Impressions dataset](https://github.com/ContentWise/contentwise-impressions), specifically the `CW10M` directory.
43
+
Place it in the following path: `cache/data-cw10m/`
44
+
45
+
2.**Set up external services**
46
+
- Configure your connection to a ClearML server for experiment tracking.
47
+
- (Optional) Set up access to AWS S3 if you want to use remote storage for data or models.
0 commit comments