Skip to content

937 alaska and alaskactapp missing opinions #1476

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
wants to merge 8 commits into
base: main
Choose a base branch
from

Conversation

Luis-manzur
Copy link
Contributor

This pull request enhances the functionality of the alaska scraper by adding a fallback mechanism to retrieve PDF download URLs from an alternate case page when they are not available in the main table. Additionally, the CHANGES.md file has been updated to document this improvement.

@Luis-manzur Luis-manzur requested a review from flooie July 1, 2025 22:01
@Luis-manzur Luis-manzur linked an issue Jul 1, 2025 that may be closed by this pull request
@Luis-manzur Luis-manzur moved this to PRs to Review in Case Law Sprint Jul 1, 2025
Comment on lines 58 to 63
if not self.test_mode_enabled():
# If there is no link in the first column, find it inside the case page
url = self.retrieve_pdf_from_alternate_page(row)
if not url:
continue
continue
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

this code doesnt work. The continue on 63 makes it so the secondary fetch never gets used but does the secondary fetch.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also I think we need a placeholder value for the test page here.

Copy link
Contributor

@flooie flooie left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This scraper by default scrapes the entire page. way too much. we should limit it to the last 30 days, and the backscraper can handle things prior.

It would be no wonder if this is failing as we are hitting hundreds of secondary pages every time this crawler is used. lets tighten this up and make it much nicer otherwise I think we will get blocked soon

looks like we just skip all the ones with citations and they never get added in the current version. so im not going to disable it.

@flooie
Copy link
Contributor

flooie commented Jul 2, 2025

also - I think we have two other pages that should be scraped as well

https://appellate-records.courts.alaska.gov/CMSPublic/Home/AppellateOpinions lists three pages

Memorandum opinions I think.
should be considered unpublished supreme court opinions _u
https://appellate-records.courts.alaska.gov/CMSPublic/Home/MOJOpinions?isCOA=False

Published orders
https://appellate-records.courts.alaska.gov/CMSPublic/Home/PublishedOrders?isCOA=False
we should collect these as they are definitely in the pacific reporter

and our slip opinions that we currently collect.
https://appellate-records.courts.alaska.gov/CMSPublic/Home/AppellateOpinions

@flooie flooie assigned Luis-manzur and unassigned flooie Jul 2, 2025
@Luis-manzur
Copy link
Contributor Author

I'll create a new issue for this.

also - I think we have two other pages that should be scraped as well

https://appellate-records.courts.alaska.gov/CMSPublic/Home/AppellateOpinions lists three pages

Memorandum opinions I think. should be considered unpublished supreme court opinions _u https://appellate-records.courts.alaska.gov/CMSPublic/Home/MOJOpinions?isCOA=False

Published orders https://appellate-records.courts.alaska.gov/CMSPublic/Home/PublishedOrders?isCOA=False we should collect these as they are definitely in the pacific reporter

and our slip opinions that we currently collect. https://appellate-records.courts.alaska.gov/CMSPublic/Home/AppellateOpinions

@Luis-manzur
Copy link
Contributor Author

Everything should be done except the disposition

@Luis-manzur Luis-manzur assigned flooie and unassigned Luis-manzur Jul 7, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: PRs to Review
Development

Successfully merging this pull request may close these issues.

alaska and alaskactapp missing opinions
2 participants