RecordScape

A powerful visual browser automation platform for recording interactions and automating data extraction through intelligent session replay.

Features

🎥 Visual Recording: Record your browser interactions in real-time
🎯 Element Selection: Visually select DOM elements for data extraction
🔄 Automated Replay: Replay sessions in headless mode for data scraping
⏰ Scheduling: Set up periodic scraping with customizable intervals
📊 Dashboard: Modern RecordScape web interface for managing sessions and schedules
💾 Data Export: Export scraped data in JSON/CSV formats

🎥 Demo

▶ Full video:
https://twitter.com/Shubh3m/status/2027349108256887131

Record once → Automate forever.

Architecture

┌─────────────────────────────────────────────────────────┐
│              RecordScape Web Dashboard                   │
│              (Flask + HTML/CSS/JS)                       │
└─────────────────┬───────────────────────────────────────┘
                  │
                  ▼
┌─────────────────────────────────────────────────────────┐
│                   Flask API Server                       │
│  /api/sessions, /api/schedules, /api/data              │
└────┬────────────────────────────────────────────┬───────┘
     │                                             │
     ▼                                             ▼
┌─────────────────┐                    ┌──────────────────┐
│   Recorder      │                    │    Replayer      │
│  (Selenium UI)  │                    │  (Headless)      │
└────┬────────────┘                    └────┬─────────────┘
     │                                      │
     ▼                                      ▼
┌─────────────────────────────────────────────────────────┐
│              Storage Layer (SQLite)                      │
│  Sessions, Schedules, Extracted Data                    │
└─────────────────────────────────────────────────────────┘

Installation

Clone the repository:

git clone <repository-url>
cd RecordBrowser

Install dependencies:

pip install -r requirements.txt

Run the application:

python app.py

Open your browser to http://localhost:5000

Usage

Recording a Session

Click "Record New Session" in the dashboard
Enter the URL you want to scrape
Browser opens - perform your actions (navigate, click, scroll)
Use the visual selector overlay to mark elements for extraction
Click "Stop Recording" to save the session

Replaying & Extracting Data

Find your session in the "Saved Sessions" section
Click "Replay" to run it once manually
Extracted data appears in "Data Exports"
Download as JSON or CSV

Scheduling Periodic Scraping

Click "Schedule" on any saved session
Set the frequency (minutes, hours, days)
The scheduler automatically runs the session and saves data
View scheduled jobs in the "Schedules" section

Project Structure

RecordBrowser/
├── app.py                 # Flask application & API
├── vpr/                   # Visual Page Recorder package
│   ├── __init__.py
│   ├── recorder.py        # Session recording engine
│   ├── replayer.py        # Headless replay engine
│   ├── storage.py         # Database management
│   └── scheduler.py       # Background job scheduler
├── static/
│   ├── css/
│   │   └── style.css      # Dashboard styles
│   └── js/
│       └── app.js         # Dashboard JavaScript
├── templates/
│   └── index.html         # Dashboard HTML
└── requirements.txt

Technologies

Backend: Python, Flask, Selenium WebDriver
Frontend: HTML5, CSS3, Vanilla JavaScript
Database: SQLite
Scheduling: APScheduler
Browser Automation: Selenium + WebDriver Manager

License

MIT

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RecordScape

Features

🎥 Demo

Architecture

Installation

Usage

Recording a Session

Replaying & Extracting Data

Scheduling Periodic Scraping

Project Structure

Technologies

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
static		static
templates		templates
vpr		vpr
.gitignore		.gitignore
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

RecordScape

Features

🎥 Demo

Architecture

Installation

Usage

Recording a Session

Replaying & Extracting Data

Scheduling Periodic Scraping

Project Structure

Technologies

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages