A Python library for scraping and document processing.
pip install raggy
For additional features:
pip install raggy[scrapling] # Enhanced web scraping via Scrapling
pip install raggy[chroma] # ChromaDB support
pip install raggy[tpuf] # TurboPuffer support
pip install raggy[pdf] # PDF processing
Read the docs
A Python library for:
- scraping the web to produce rich documents
- putting these documents in vectorstores
- querying the vectorstores to find documents similar to a query
Important
This project is licensed under the MIT License - see the LICENSE file for details.
When installing the optional [scrapling]
dependency, please note that Scrapling is licensed under the BSD-3-Clause license. By using this optional feature, you agree to comply with Scrapling's license terms.
We welcome contributions! See our contributing guide for details.