feat: add error handling for scrapers with expected results #1449

Luis-manzur · 2025-06-16T21:59:36Z

This pull request introduces error handling for scrapers that are expected to return results but fail to do so. The changes include updates to the CHANGES.md file to document the new feature, as well as modifications to the AbstractSite class in juriscraper to implement the functionality.

Documentation Updates:

CHANGES.md: Added a note under "Features" about the new error handling for scrapers with expected results.

Code Enhancements:

juriscraper/AbstractSite.py:
- Added a new should_have_results attribute in the __init__ method to indicate whether a scraper is expected to return results.
- Updated the _check_sanity method to log an error if should_have_results is True and no results are returned, while maintaining a warning for cases where results are not required.

grossir

For this to be useful you will have to go through the scrapers one by one and identify those that "should_have_results" and set that attribute to true

Luis-manzur · 2025-06-17T15:54:29Z

For this to be useful you will have to go through the scrapers one by one and identify those that "should_have_results" and set that attribute to true

I was wondering if we could do separate issues to not overload this PR.

…-are-found

flooie · 2025-06-17T18:22:50Z

I think I agree with @Luis-manzur that adding this field should be a separate PR.

grossir · 2025-06-17T20:27:32Z

I'd prefer all the changes to be together for the reasons below, but feel free to approve it and merge it as is

the PR is not cluttered as it is right now, it's a few lines in a single file
changes don't really affect anything without changing the relevant scraper files, so there is not much to review here
you will need to open, link to the issue (or another one), and review another PR instead of doing it now, which is more clerical work

…etect failing scrapers when no results are found

…n-no-results-are-found' into 1447-detect-failing-scrapers-when-no-results-are-found

flooie · 2025-06-23T17:02:46Z

Howdid you find which ones to update?

…g scrapers when no results are found

…failing scrapers when no results are found

flooie · 2025-07-02T15:54:58Z

@Luis-manzur can you resolve conflicts and respond to my question?

Luis-manzur · 2025-07-02T19:04:59Z

to Identify the sites that needed this update I looked up inside the code of each one looking that the there were no filtering before or after the first request, and confirmed going into the court page. also I left outside sites that the court page don't need any filtering but they clear the opinion list each month/year.

…-are-found # Conflicts: # CHANGES.md # juriscraper/opinions/united_states/state/tenn.py

…-are-found # Conflicts: # CHANGES.md

feat: add error handling for scrapers with expected results

f8930bb

Luis-manzur requested review from flooie and grossir June 16, 2025 21:59

Luis-manzur assigned flooie Jun 16, 2025

Luis-manzur added this to Case Law Sprint Jun 16, 2025

Luis-manzur linked an issue Jun 16, 2025 that may be closed by this pull request

Detect failing scrapers when no results are found #1447

Open

grossir reviewed Jun 17, 2025

View reviewed changes

Luis-manzur moved this to PRs to Review in Case Law Sprint Jun 17, 2025

Merge branch 'main' into 1447-detect-failing-scrapers-when-no-results…

41a2ef6

…-are-found

Luis-manzur added 2 commits June 23, 2025 12:14

feat: add should_have_results flag to admin and federal scrapers to d…

e611af3

…etect failing scrapers when no results are found

Merge remote-tracking branch 'origin/1447-detect-failing-scrapers-whe…

62bb6e7

…n-no-results-are-found' into 1447-detect-failing-scrapers-when-no-results-are-found

Luis-manzur added 2 commits June 23, 2025 14:49

feat: add should_have_results flag to state scrapers to detect failin…

02eb9a6

…g scrapers when no results are found

feat: add should_have_results flag to territories scrapers to detect …

12fd9d0

…failing scrapers when no results are found

flooie assigned Luis-manzur and unassigned flooie Jul 2, 2025

Merge branch 'main' into 1447-detect-failing-scrapers-when-no-results…

1433309

…-are-found # Conflicts: # CHANGES.md # juriscraper/opinions/united_states/state/tenn.py

Luis-manzur assigned flooie and unassigned Luis-manzur Jul 2, 2025

Merge branch 'main' into 1447-detect-failing-scrapers-when-no-results…

e975bbd

…-are-found # Conflicts: # CHANGES.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

feat: add error handling for scrapers with expected results #1449

feat: add error handling for scrapers with expected results #1449

Uh oh!

Luis-manzur commented Jun 16, 2025

Uh oh!

grossir left a comment

Uh oh!

Luis-manzur commented Jun 17, 2025

Uh oh!

flooie commented Jun 17, 2025

Uh oh!

grossir commented Jun 17, 2025

Uh oh!

flooie commented Jun 23, 2025

Uh oh!

flooie commented Jul 2, 2025

Uh oh!

Luis-manzur commented Jul 2, 2025

Uh oh!

Uh oh!

Uh oh!

feat: add error handling for scrapers with expected results #1449

Are you sure you want to change the base?

feat: add error handling for scrapers with expected results #1449

Uh oh!

Conversation

Luis-manzur commented Jun 16, 2025

Documentation Updates:

Code Enhancements:

Uh oh!

grossir left a comment

Choose a reason for hiding this comment

Uh oh!

Luis-manzur commented Jun 17, 2025

Uh oh!

flooie commented Jun 17, 2025

Uh oh!

grossir commented Jun 17, 2025

Uh oh!

flooie commented Jun 23, 2025

Uh oh!

flooie commented Jul 2, 2025

Uh oh!

Luis-manzur commented Jul 2, 2025

Uh oh!

Uh oh!