-
-
Notifications
You must be signed in to change notification settings - Fork 111
Court Websites
Mike Lissner edited this page Aug 11, 2014
·
32 revisions
This page keeps track of the progress of each court and its scraper. It is broken into three sections:
- Federal and Tribal Courts
- State and Territorial Courts
- Technique Notes
This page is a wiki. Please feel free to edit it as you work.
- Most Federal District Courts are going to be handled by the WrittenOpinions report provided by PACER.
- However, some District Courts have an opinions page that may be preferable or contain different material.
- We'll list those here:
- District Court of D.C. (2014 page; set up a year variable in main scraper)
- District Court of D.C. (everything for backscraper)
- Patent Trial and Appeal Board
- U.S. GAO
- Merit Systems Protection Board (Precedential) and Non-precedential
- U.S. Army Court of Criminal Appeals (Opinions, Memorandum Opinions, and Summary Dispositions)
- Navy-Marine Corps Court of Criminal Appeals
- U.S. Air Force Court of Criminal Appeals
- Coast Guard Court of Criminal Appeals
- Armed Services Board of Contract Appeals
- Alabama - TODO (hard) - DONE
-
Alaska - DONE
- xpath examples of child, descendant, following-sibling
- Multiple opinions listed under a single date
- Assumes precedential
-
Arizona - DONE
- Titlecase used on case_names field
- xpath examples of
'contains @id'
- Arizona Court of Appeals
- Division 1 DONE
- Division 2 - TODO (Brian and Deb worked on this but ultimately could not figure it out. Would get fewer old opinions, but a more reasonable page by using their RSS feed.)
- Arizona Tax Court TODO
-
Arkansas - DONE
- Titlecase used on judges field
- xpath examples all use
@class=
- HTML Element converted to string
- Assumes precedential
- Court of Appeals merely points to different URL
-
Arkansas Court of Appeals - DONE
- Extends ark.py
-
California - DONE
- RegEx used for case_names
- Replaces abbreviation of 'People' in case_names
- Handles multiple date formats
- Assumes precedential
- Courts of Appeal scrapers just add 'division' metadata
- Separate page for non-precedential opinions assumes non-precedential
- California Court of Appeals w/various divisions - DONE
- Colorado - DONE
- Court-provided pdf announcement difficult to work with
- Contact in mid-2013 yielded no change
- Need to use state bar page TODO
- State Bar page for Court of Appeals TODO
-
Connecticut - DONE
- Daily page + annual archive TODO
- Archive pages list multiple opinions under one date; see Alaska for example scraper.
- Delaware Supreme Court - DONE
- Delaware Lower Courts - DONE
- District of Columbia - DONE
-
Florida - DONE
- Lists multiple opinions under each date (see alaska)
- Many opinions are accompanied by revised opinions or separate orders that are not hyperlinked with their case name. Somewhat similar to padding strategy in /federal/district/dcd could be used.
-
Georgia - DONE
- Appears to be hand-written HTML with numerous mistakes that make accurate parsing a challenge.
-
Georgia Appeals Court
- May require a POST, otherwise looks easy.
- Hawaii - DONE
- Hawaii Appellate - DONE
- Idaho - DONE
- Idaho Civil - DONE
- Idaho Criminal - DONE
- Idaho Court Appeals Civil - DONE
- Idaho Court Appeals Criminal - DONE
- Idaho Unpublished - DONE
- Illinois Supreme - DONE
- Illinois Appeals - DONE
- Indiana Supreme - DONE
-
Iowa Supreme - DONE
- Releases opinions on Fridays and makes a separate page for each release date.
- Iowa Appeals - DONE
- Kansas Supreme and Appellate - DONE
-
Kentucky Supreme and Appellate - DONE
- This uses the venerable dtsearch on the backend, which will return everything if you search for 'xfirstword'. Unfortunately, the name of the case is not included in this, making it nearly useless. Good news! You can get the case name from a separate system!
-
Louisiana - DONE
- Only provides results by search query, but a complete result set can be found with the following query: "court OR louisiana OR facts OR page OR district OR circuit OR state OR see". Those are among the top non-stopwords in CourtListener
-
Maine - DONE
- Backscrapers to 1997 possible, but errors in handwritten HTML requires attention
- Maryland - DONE
- Maryland Court of Special Appeals - ** DONE**
- Massachusetts - DONE
- Michigan - DONE
- Minnesota - DONE
- Source can be this or this (research needed).
- Oral arg videos
-
Mississippi - DONE
- Completed by Raymond Yee
- Uses date-based URLs and beginning of year files.
- Missouri - DONE
- Supreme Court
- Court of Appeals
- Completed by Ben Cassidy
- Uses date-based URLs
- Montana - DONE
- Has extremely long URL. See file for details.
- Completed by mlr
- Nebraska - DONE
- Nebraska Court of Appeals - DONE
- Nevada DONE
-
New Hampshire - DONE
- Contains a fragment of code, but nothing to get excited about.
-
New Jersey Published - DONE
- Uses dynamic XPath queries to vary the scrapers for this and the NJ Super Ct scraper
- New Jersey Unpublished
- New Jersey Super Court Appellate Division - DONE
- New Mexico - DONE
- New Mexico Court of Appeals - DONE
- New York - DONE
- North Carolina - DONE
-
North Dakota - DONE
- Shows how two example files can be used for extensive testing.
- North Dakota Court of Appeals - DONE
- Ohio - DONE
- Oklahoma - DONE
- Oregon - DONE
- Pennsylvania - DONE
- Rhode Island - DONE
- South Carolina - DONE
- South Dakota - DONE
- Tennessee - DONE
- Tennessee Criminal Appellate Court - DONE
- Tennessee Court of Appeals - DONE
- Texas - DONE
- Texas Appellate Divisions - DONE
- Utah - DONE
- Vermont - DONE
- Virginia - DONE
- Virginia Court of Appeals - DONE
-
Washington - DONE
- Link requires numerous parameters -- see file for details.
-
Washington Court of Appeals - DONE
- Same URL as Wash, but with different parameters.
- West Virginia - DONE
- Wisconsin - DONE
- Wyoming - DONE
- American Samoa
- Guam Backscrapers to 1996 possible.
- Northern Marianas Islands Backscrapers to 1989 possible.
- Puerto Rico (en espanol)
- U.S. Virgin Islands Backscrapers to 2007 possible. Unpublished opinions also available.