Skip to content

Releases: mindsdb/aipdf

v0.0.6.3

12 Jun 00:53
b849772
Compare
Choose a tag to compare

What's Changed

Full Changelog: v0.0.6.2...v0.0.6.3

v0.0.6.2

29 May 22:45
38650dc
Compare
Choose a tag to compare

What's Changed

New Contributors

Full Changelog: v0.0.6...v0.0.6.2

v0.0.6

19 May 09:13
2035ba4
Compare
Choose a tag to compare

TL;DR

Concurrency Control

  • Introduced a limit on concurrent LLM API calls across both sync and async functions via the AIPDF_MAX_CONCURRENT_REQUESTS environment variable.

What's Changed

Full Changelog: v0.0.5...v0.0.6

v0.0.5

01 May 10:52
38b6ec9
Compare
Choose a tag to compare

TL;DR

Performance Improvements

  • Pages are now evaluated for visual content before being passed to the LLM.
  • Non-visual pages are processed with traditional text parsing.
  • Override with use_llm_for_all=True to use LLM for all pages.

Dependency Cleanup

  • Replaced pdf2image with PyMuPDF for image conversion.
  • Removes the need for poppler, simplifying installation.
  • Dependencies are cleaned up and version-pinned to avoid conflicts (e.g., with MindsDB).

Enhanced Configurability

  • Added support for additional OpenAI parameters (e.g., temperature).
  • API key can now be set via the AIPDF_API_KEY environment variable.

Async OCR Support

  • Introduced ocr_async() to make asynchronous OpenAI API calls.
  • Complements the existing multi-threaded ocr() function.

Code Quality & Testing

  • Refactored functions for better readability and maintainability.
  • Added unit and integration tests to ensure reliability.

What's Changed

Full Changelog: v0.0.4...v0.0.5

v0.0.4

14 Oct 23:59
17a6847
Compare
Choose a tag to compare

Full Changelog: v0.0.3...v0.0.4

v0.0.3

11 Oct 18:19
Compare
Choose a tag to compare

Full Changelog: v0.0.2...v0.0.3

v0.0.2

09 Oct 16:46
ae9041d
Compare
Choose a tag to compare
Update ocr.py

no max tokens

v0.0.1

04 Oct 17:01
c0a16b0
Compare
Choose a tag to compare