Name	Name	Last commit message	Last commit date
parent directory ..
assets	assets	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025
backend	backend	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025
frontend	frontend	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025
LICENSE	LICENSE	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025
README.md	README.md	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025
requirements.txt	requirements.txt	feat: Implement chat functionality with message rendering, typing ani…	May 8, 2025

Personal Knowledge Assistant

Welcome to the Personal Knowledge Assistant project! This guide will walk you through setting up both the frontend and backend components of the system, a RAG-based platform for querying books and personal knowledge.

System Architecture Overview

1. Document Processing

Document Loading: Processes PDF documents using PyPDFLoader
Text Chunking: Splits documents into manageable chunks using RecursiveCharacterTextSplitter
Embedding Generation: Converts chunks into vector representations using HuggingFaceEmbeddings
Vector Storage: Stores embeddings in a FAISS vector store for efficient retrieval

2. Query Processing

Query Rewriting: Rewrites the original query to be more effective for retrieval
Base Retrieval: Retrieves initial set of relevant documents from the vector store
Contextual Compression: Applies filtering and extraction to improve retrieval quality

3. Confidence-Based Evaluation

Document Evaluation: Evaluates each retrieved document for relevance and reliability
Score Calculation: Combines relevance and reliability into a confidence score
Confidence Routing: Routes the query to different processing paths based on confidence:
- High Confidence (>0.7): Uses direct knowledge refinement
- Medium Confidence (0.3-0.7): Uses hybrid approach
- Low Confidence (<0.3): Falls back to web search

4. Knowledge Refinement

Knowledge Strip Decomposition: Breaks documents into individual "knowledge strips"
Strip Relevance Scoring: Scores each strip's relevance to the query
Strip Filtering: Filters strips based on relevance threshold

5. Web Search Integration (for low confidence)

Search Query Generation: Creates optimized search queries
DuckDuckGo Search: Performs web search using DuckDuckGo
Result Processing: Extracts and processes relevant information from search results

6. Response Generation

Prompt Template: Assembles a prompt with context, confidence level, and query
Conversation Memory: Maintains chat history for contextual responses
LLM Generation: Generates final response using Groq LLM (Mistral model)
Response Formatting: Formats response based on confidence level with appropriate caveats

Key Innovations

Confidence-Based Routing: Intelligently routes queries based on document relevance
Knowledge Strip Decomposition: Extracts and filters relevant information pieces
Dynamic Web Search Fallback: Uses web search when document knowledge is insufficient
Document Evaluation: Explicitly evaluates document relevance and reliability
Contextual Compression: Uses embeddings filtering and LLM extraction to improve retrieval quality

Prerequisites

Before starting, ensure you have the following tools installed:

Python 3.9+ for the backend
Node.js 18+ for the frontend
Git (optional)
PDF books you want to include in your knowledge base

Backend Setup

1. Clone the repository (or set up a new project)

mkdir personal-knowledge-assistant
cd personal-knowledge-assistant
mkdir backend frontend

2. Set up the backend folder structure

Organize your project directory as follows:

backend/
├── app/
│   ├── main.py
│   ├── api/
│   │   ├── __init__.py
│   │   ├── routes/
│   │   │   ├── __init__.py
│   │   │   └── chat.py
│   │
│   ├── core/
│   │   ├── __init__.py
│   │   ├── config.py
│   │   └── security.py
│   │
│   ├── db/
│   │   ├── __init__.py
│   │   └── vector_store.py
│   │
│   ├── models/
│   │   ├── __init__.py
│   │   └── schemas.py
│   │
│   ├── services/
│   │   ├── __init__.py
│   │   ├── rag.py
│   │   └── llm.py
│   │
│   └── utils/
│       ├── __init__.py
│       └── text_processing.py
│
├── data/
│   └── embeddings/
│
├── ingest.py
├── requirements.txt
└── .env

3. Create a virtual environment and install dependencies

cd backend
python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# Create requirements.txt with the following content

Add the dependencies to your requirements.txt:

fastapi
uvicorn
pydantic
pydantic-settings
langchain
langchain-groq
langchain-community
langchain-huggingface
faiss-cpu
python-dotenv
pypdf
sentence-transformers

Then install the dependencies:

pip install -r requirements.txt

4. Set up environment variables

Create a .env file in the backend directory:

GROQ_API_KEY=your_groq_api_key_here

5. Create empty `init.py` files

touch app/__init__.py
touch app/api/__init__.py
touch app/api/routes/__init__.py
touch app/core/__init__.py
touch app/db/__init__.py
touch app/models/__init__.py
touch app/services/__init__.py
touch app/utils/__init__.py

6. Ingest your books

Place your PDF books in a directory and ingest them:

mkdir books
# Copy your PDF books into the books directory

python ingest.py --dir books

7. Run the backend server

uvicorn app.main:app --reload --host 0.0.0.0 --port 8000

Frontend Setup

1. Set up the Next.js project

cd ../frontend
npx create-next-app@latest .
# Select Yes for TypeScript
# Select Yes for ESLint
# Select Yes for Tailwind CSS
# Select Yes for src/ directory
# Select Yes for App Router
# Select Yes for import alias

2. Install additional dependencies

npm install lucide-react react-markdown

3. Install shadcn/ui components

npx shadcn-ui@latest init
# Select Default for style
# Select Default for baseColor
# Select Yes for CSS variables
# Use App dir structure
# Select src/components for components directory
# Select @/components for import alias
# Select Yes for React Server Components
# Select Yes for tailwind.config.ts
# Select @/lib/utils for utils

# Install the required components
npx shadcn-ui@latest add button textarea card

4. Set up environment variables

Create a .env.local file in the frontend directory:

NEXT_PUBLIC_API_URL=http://localhost:8000/api

5. Update the code

Replace the contents of the following files with the provided code:

src/app/page.tsx
src/app/layout.tsx
src/app/globals.css
tailwind.config.ts

6. Run the frontend

npm run dev

Your application should now be running at http://localhost:3000.

Using the Application

Navigate to http://localhost:3000 in your web browser.
Ask questions about the books you've ingested.
The application will search through the book content and provide relevant answers.

Troubleshooting

Vector Store Issues

If you encounter issues with the vector store:

rm -rf data/vector_store
python ingest.py --dir books

API Connection Issues

If the frontend can't connect to the backend:

Ensure the backend is running on port 8000.
Check that CORS is properly configured.
Verify your .env.local file has the correct API URL.

Model API Key Issues

If you encounter authentication errors:

Double-check your Groq API key in the .env file.
Ensure your HuggingFace token has the necessary permissions.

Customization

Changing the LLM Model

To change the LLM model, edit app/core/config.py:

LLM_MODEL: str = "your-preferred-model"  # e.g., "llama3-8b-8192" for a smaller model

Adjusting RAG Parameters

Edit app/core/config.py to customize the RAG behavior:

CHUNK_SIZE: int = 1000  # Increase for larger contexts
CHUNK_OVERLAP: int = 200  # Adjust to reduce information loss at chunk boundaries
TOP_K_RESULTS: int = 5  # Increase for more comprehensive context

Changing the Embedding Model

Edit app/core/config.py to use a different embedding model:

EMBEDDING_MODEL: str = "your-preferred-embedding-model"  # e.g., "sentence-transformers/all-mpnet-base-v2"

Collapse file tree

Files

Personal Knowledge Assistant

Directory actions

More options

Directory actions

More options

Latest commit

History

Personal Knowledge Assistant

Folders and files

parent directory

README.md

Personal Knowledge Assistant

System Architecture Overview

1. Document Processing

2. Query Processing

3. Confidence-Based Evaluation

4. Knowledge Refinement

5. Web Search Integration (for low confidence)

6. Response Generation

Key Innovations

Prerequisites

Backend Setup

1. Clone the repository (or set up a new project)

2. Set up the backend folder structure

3. Create a virtual environment and install dependencies

4. Set up environment variables

5. Create empty __init__.py files

6. Ingest your books

7. Run the backend server

Frontend Setup

1. Set up the Next.js project

2. Install additional dependencies

3. Install shadcn/ui components

4. Set up environment variables

5. Update the code

6. Run the frontend

Using the Application

Troubleshooting

Vector Store Issues

API Connection Issues

Model API Key Issues

Customization

Changing the LLM Model

Adjusting RAG Parameters

Changing the Embedding Model

5. Create empty `init.py` files