fix(ala): by fetching detailed publication data from new API endpoint #1759

Luis-manzur · 2026-01-12T17:00:19Z

for more information, see https://pre-commit.ci

…hanged-significantly' into 1758-the-ala-api-structure-has-changed-significantly # Conflicts: # juriscraper/opinions/united_states/state/ala.py

flooie · 2026-01-12T20:26:51Z

Take another look at this. Your data doesnt look right to me

flooie

Look at your output. it's not quite right.

juriscraper/opinions/united_states/state/ala.py

flooie · 2026-01-12T20:53:47Z

tests/examples/opinions/united_states/ala_example.json

-               {
-                  "publicationItemUUID":"93C503F6-7E90-40CF-9C0F-FA9C20A8A036",
-                  "docketEntryUUID":"73CC07D7-54C4-4BBA-9973-C1F4F5E5E3A3",
-                  "caseInstanceUUID":"F46D02FF-367A-46FF-


this file doesnt match what I expect, can you update the json to match the new api endpoint.

I updated all three example files, but just using the second API call json to do the testing

for more information, see https://pre-commit.ci

juriscraper/opinions/united_states/state/ala.py

flooie

a few more things

flooie · 2026-01-13T17:06:26Z

juriscraper/opinions/united_states/state/ala.py

+    def _download(self, request_dict=None):
+        """Download the publication list and then fetch detailed publication data.
+
+        The initial API returns a list of publications, but we need to fetch
+        the detailed publication endpoint to get full case information.
+        """
+        if self.test_mode_enabled():
+            return super()._download(request_dict)
+
+        # First, get the list of publications
+        html = super()._download(request_dict)
+
+        # Get the publicationUUID from the initial response
+        releases = html["_embedded"]["results"]
+        publication_uuid = releases[0].get("publicationUUID")

-        # Processes only the first result to scrape the most recent data.
-        item = self.json["_embedded"]["results"][0]
+        # Fetch detailed publication data
+        self.url = f"{self.base_url}/courts/{self.court_str}/cms/publication/{publication_uuid}"
+        return super()._download(request_dict)


maybe something like this

def _download(self, request_dict=None): """Download the publication list and then fetch detailed publication data. The initial API returns a list of publications, but we need to fetch the detailed publication endpoint to get full case information. """ if self.test_mode_enabled(): return super()._download(request_dict) resp = super()._download(request_dict) releases = resp["_embedded"]["results"] publication_uuid = releases[0].get("publicationUUID") self.url = f"{self.base_url}/courts/{self.court_str}/cms/publication/{publication_uuid}" self.json = super()._download(request_dict)

and drop the item = self.html since this is json

I feel like this isnt resolved

…ficantly # Conflicts: # CHANGES.md

flooie · 2026-01-13T21:27:55Z

juriscraper/opinions/united_states/state/ala.py

+            return super()._download(request_dict)
+
+        # First, get the list of publications
+        html = super()._download(request_dict)


maybe this should be called resp and not html since its not html

flooie · 2026-01-13T21:28:49Z

juriscraper/opinions/united_states/state/ala.py

+                r"\((?:Appeal from ([^:]+):\s*([^)]+)|[^;]+;\s*([^:]+Appeals):\s*([^)]+))\)",
                name,
            )
            if match:
-                lower_court = match.group("lower_court").strip()
-                lower_court_number = match.group("lower_court_number").strip()
-                # Remove the parenthetical from the name
-                name = name[: match.start()].rstrip()
+                # Groups 1,2 for "Appeal from"; groups 3,4 for Ex parte format
+                lower_court = (match.group(1) or match.group(3) or "").strip()
+                lower_court_number = (
+                    match.group(2) or match.group(4) or ""


why did you strip out the group names

flooie · 2026-01-13T21:29:30Z

juriscraper/opinions/united_states/state/ala.py

+                    name = re.sub(
+                        r"\s*PETITION FOR WRIT OF .+?(?=\(|$)", "", name
+                    ).strip()
+                    name = re.sub(r"\s*\(In re:\s*.+?\)", "", name).strip()


this makes me think we are collecting things we shouldnt.

we dont wnat to remove in re from case names but we dont want to be collecting petitions for writ of anything I think.

…ficantly

…hanged-significantly' into 1758-the-ala-api-structure-has-changed-significantly

for more information, see https://pre-commit.ci

fix(ala): by fetching detailed publication data from new API endpoint

acef09d

Luis-manzur requested review from flooie and grossir January 12, 2026 17:00

Luis-manzur assigned flooie Jan 12, 2026

Luis-manzur added this to Case Law Sprint Jan 12, 2026

Luis-manzur linked an issue Jan 12, 2026 that may be closed by this pull request

The ala API structure has changed significantly. #1758

Open

Luis-manzur moved this to PRs to Review in Case Law Sprint Jan 12, 2026

pre-commit-ci bot and others added 3 commits January 12, 2026 17:00

[pre-commit.ci] auto fixes from pre-commit.com hooks

0bf4130

for more information, see https://pre-commit.ci

fix(ala): update tests files

f71f822

Merge remote-tracking branch 'origin/1758-the-ala-api-structure-has-c…

cf3da21

…hanged-significantly' into 1758-the-ala-api-structure-has-changed-significantly # Conflicts: # juriscraper/opinions/united_states/state/ala.py

flooie requested changes Jan 12, 2026

View reviewed changes

flooie reviewed Jan 12, 2026

View reviewed changes

juriscraper/opinions/united_states/state/ala.py Outdated Show resolved Hide resolved

flooie reviewed Jan 12, 2026

View reviewed changes

juriscraper/opinions/united_states/state/ala.py Outdated Show resolved Hide resolved

flooie reviewed Jan 12, 2026

View reviewed changes

Luis-manzur and others added 2 commits January 12, 2026 18:48

fix(ala): refactor API fetch logic and improve case name parsing

b77f69d

[pre-commit.ci] auto fixes from pre-commit.com hooks

8c7e026

for more information, see https://pre-commit.ci

Luis-manzur requested a review from flooie January 12, 2026 22:53

flooie reviewed Jan 13, 2026

View reviewed changes

juriscraper/opinions/united_states/state/ala.py Outdated Show resolved Hide resolved

flooie requested changes Jan 13, 2026

View reviewed changes

flooie removed the request for review from grossir January 13, 2026 17:06

flooie assigned Luis-manzur and unassigned flooie Jan 13, 2026

Luis-manzur added 3 commits January 13, 2026 14:32

Merge branch 'main' into 1758-the-ala-api-structure-has-changed-signi…

4398977

…ficantly # Conflicts: # CHANGES.md

refactor(ala): simplify _process_html

b3740b7

refactor(ala): clean params and url definition

d26be02

Luis-manzur requested a review from flooie January 13, 2026 19:30

Luis-manzur assigned flooie and unassigned Luis-manzur Jan 13, 2026

flooie reviewed Jan 13, 2026

View reviewed changes

flooie assigned Luis-manzur and unassigned flooie Jan 13, 2026

refactor(ala): Skip petition cases

c52dd84

Luis-manzur requested a review from flooie January 15, 2026 19:47

Luis-manzur assigned flooie and unassigned Luis-manzur Jan 15, 2026

Luis-manzur and others added 4 commits January 15, 2026 20:46

Merge branch 'main' into 1758-the-ala-api-structure-has-changed-signi…

50a9c29

…ficantly

refactor(ala): Use self.json and restore named regex groups

c9d3319

Merge remote-tracking branch 'origin/1758-the-ala-api-structure-has-c…

ad7fc2b

…hanged-significantly' into 1758-the-ala-api-structure-has-changed-significantly

[pre-commit.ci] auto fixes from pre-commit.com hooks

ebe3782

for more information, see https://pre-commit.ci

Uh oh!

fix(ala): by fetching detailed publication data from new API endpoint #1759

Are you sure you want to change the base?

fix(ala): by fetching detailed publication data from new API endpoint #1759

Uh oh!

Conversation

Luis-manzur commented Jan 12, 2026

Uh oh!

flooie commented Jan 12, 2026

Uh oh!

flooie left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

flooie left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants