feat: add PDF Keyword Highlighter script (closes #478) #522

SurfyPenguin · 2025-12-30T21:17:47Z

PR Title

feat: add PDF Keyword Highlighter script (closes #478 )

Summary

Added a new command-line Python script that highlights specified keywords in PDF files using PyMuPDF, complete with a dedicated folder, README, and entry in the main repository README.

Description

This pull request implements a fully featured PDF keyword highlighter as requested in issue #478, creating a new highlighted output file while keeping the original unchanged.

The changes are as follows:

Created new folder PDF Highlighter Script/ with pdf_highlight.py and a README.md
Implemented efficient keyword highlighting using page.get_text("words") for fast text extraction
Supported multiple keywords, optional case-sensitive search (-s flag), and punctuation stripping for accurate matching (e.g., "keyword;" matches "keyword")
Printed per-page and total highlight statistics in a formatted table
Updated root README.md to add the new script entry in alphabetical order

Checks

in the repository

Made no changes that degrades the functioning of the repository
Gave each commit a better title (unlike updated README.md)

in the PR

Followed the format of the pull_request_template
Made the Pull Request in a small level (for the creator's wellfare)
Tested the changes you made

Thank You,

Amartya Anand

…ink to main branch

SurfyPenguin added 3 commits December 31, 2025 02:27

feat: add PDF Keyword Highlighter script (closes wasmerio#478)

f717529

fix: correct folder name in README

e8f8a5d

fix: update usage examples to use correct filename and point README l…

a26518a

…ink to main branch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add PDF Keyword Highlighter script (closes #478) #522

feat: add PDF Keyword Highlighter script (closes #478) #522

SurfyPenguin commented Dec 30, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

feat: add PDF Keyword Highlighter script (closes #478) #522

Are you sure you want to change the base?

feat: add PDF Keyword Highlighter script (closes #478) #522

Conversation

SurfyPenguin commented Dec 30, 2025

PR Title

Summary

Description

The changes are as follows:

Checks

in the repository

in the PR

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant