GitHub Repository RAG Assistant

A full-stack application that uses Retrieval-Augmented Generation (RAG) to analyze GitHub repositories and answer questions about their code.

Features

Index any public GitHub repository
Process and chunk code files respecting function/class boundaries
Generate embeddings for code chunks and store them in a vector database
Ask natural language questions about the repository
Get AI-generated answers based on the relevant code contexts

Technology Stack

Frontend: React, TypeScript, Tailwind CSS, React Query
Backend: Python, FastAPI
Vector Database: Qdrant (in-memory for development)
Embedding Model: OpenAI Text Embedding API
LLM: OpenAI GPT-3.5 Turbo

Getting Started

Prerequisites

Node.js and npm
Python 3.9+
OpenAI API key

Installation

Clone this repository
Install frontend dependencies:
```
npm install
```

Install backend dependencies:

cd backend
pip install -r requirements.txt

Create a .env file in the backend directory:
```
OPENAI_API_KEY=your-openai-api-key
```

Running the Application

Start the backend server:
```
cd backend
uvicorn main:app --reload
```
In another terminal, start the frontend:
```
npm run dev
```
Open your browser to http://localhost:5173

Usage

Enter a GitHub repository URL in the input field
Click "Process Repository" to start indexing
Wait for the indexing to complete (this may take some time for large repositories)
Ask questions about the repository in the chat interface
View AI-generated answers with references to specific code files

Limitations

The in-memory vector database does not persist data between server restarts
Large repositories may take a significant amount of time to process
The chunking algorithm may not perfectly respect code boundaries in all languages
The quality of answers depends on the OpenAI model used

Future Improvements

Persistent vector database storage
Support for private GitHub repositories
More sophisticated code parsing and chunking
Multi-user support with authentication
Caching of previously processed repositories

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
backend		backend
src		src
README.md		README.md
eslint.config.js		eslint.config.js
index.html		index.html
package-lock.json		package-lock.json
package.json		package.json
postcss.config.js		postcss.config.js
tailwind.config.js		tailwind.config.js
tsconfig.app.json		tsconfig.app.json
tsconfig.json		tsconfig.json
tsconfig.node.json		tsconfig.node.json
vite.config.ts		vite.config.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

GitHub Repository RAG Assistant

Features

Technology Stack

Getting Started

Prerequisites

Installation

Running the Application

Usage

Limitations

Future Improvements

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

aayush-kapoor/GitRAG

Folders and files

Latest commit

History

Repository files navigation

GitHub Repository RAG Assistant

Features

Technology Stack

Getting Started

Prerequisites

Installation

Running the Application

Usage

Limitations

Future Improvements

License

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages