Skip to content

zzstoatzz/raggy

Repository files navigation

raggy

A Python library for scraping and document processing.

Installation

pip install raggy

For additional features:

pip install raggy[scrapling]  # Enhanced web scraping via Scrapling
pip install raggy[chroma]     # ChromaDB support
pip install raggy[tpuf]       # TurboPuffer support
pip install raggy[pdf]        # PDF processing

Read the docs

What is it?

A Python library for:

  • scraping the web to produce rich documents
  • putting these documents in vectorstores
  • querying the vectorstores to find documents similar to a query

Tip

See this example to chat with any website, or this example to chat with any GitHub repo.

License and Dependencies

Important

This project is licensed under the MIT License - see the LICENSE file for details.

When installing the optional [scrapling] dependency, please note that Scrapling is licensed under the BSD-3-Clause license. By using this optional feature, you agree to comply with Scrapling's license terms.

Contributing

We welcome contributions! See our contributing guide for details.