Releases: hhursev/recipe-scrapers
15.2.1
15.2.0
Adds support for:
- donalskehan.com (#1237)
- eatthismuch.com (#1237)
- glutenfreeonashoestring.com (#1220)
- miljuschka.nl (#1254)
- theglutenfreeaustrian.com (#1220)
- theloopywhisk.com (#1220)
Bugfixes:
- Fix ingredient grouping retrieval for arla (#1236)
- Remove workaround from kitchenstories (#1240)
- Update/fix scraper for gesund+aktiv (#1238)
- Update/fix scraper for realsimple (#1242)
- Update/fix scraper for sallysblog (#1243)
- Update/fix scraper for tofoo (#1244)
- Update/fix scraper for wikicookbook (#1239)
Misc:
- Drops support for Python 3.8 (#1265)
- Modernize some type hints given that Python 3.9 has become baseline (#1267)
- Adds an enforced order for nutritional expectations in JSON test data (#1203)
- Refactor the
generate.py
scraper-generator script (#1232) - Raise a newly-added
RecipeSchemaNotFound
exception when the schema.org-fill plugin fails to find metadata on the page (#1230) - Require only
ingredients_list
, not alsoingredients
, in the test JSON data (#1211) - Cleanups: remove redundant methods and test expectation data (#1197, #1228, #1255)
Thank you, @jaspervzwi @jknndy! π
14.60.0
Adds support for:
- donalskehan.com (#1237)
- eatthismuch.com (#1237)
- glutenfreeonashoestring.com (#1220)
- miljuschka.nl (#1254)
- theglutenfreeaustrian.com (#1220)
- theloopywhisk.com (#1220)
Bugfixes:
- Fix ingredient grouping retrieval for arla (#1236)
- Remove workaround from kitchenstories (#1240)
- Update/fix scraper for gesund+aktiv (#1238)
- Update/fix scraper for realsimple (#1242)
- Update/fix scraper for sallysblog (#1243)
- Update/fix scraper for tofoo (#1244)
- Update/fix scraper for wikicookbook (#1239)
Misc:
- Drops support for Python 3.8 (#1265)
- Modernize some type hints given that Python 3.9 has become baseline (#1267)
- Adds an enforced order for nutritional expectations in JSON test data (#1203)
- Refactor the
generate.py
scraper-generator script (#1232) - Raise a newly-added
RecipeSchemaNotFound
exception when the schema.org-fill plugin fails to find metadata on the page (#1230) - Require only
ingredients_list
, not alsoingredients
, in the test JSON data (#1211) - Cleanups: remove redundant methods and test expectation data (#1197, #1228, #1255)
Thank you, @jaspervzwi @jknndy! π
15.1.0
Adds support for:
- books.ottolenghi.co.uk (#1227)
- celebratingsweets.com (#1196)
- ingoodflavor.com (#1196)
- kalejunkie.com (#1204)
- kitchendreaming.com (#1196)
- leitesculinaria.com (#1196)
- myvegetarianroots.com (#1196)
- peelwithzeal.com (#1196)
- sandwichtribunal.com (#1196)
Improvements:
- streetkitchen.hu: additional field retrievals (#1183)
- whatsgabycooking.com: ingredient group support (#1210)
Bugfixes:
- streetkitchen.hu: field retrieval fixups (#1183)
Thank you, @bartaldavid @jknndy @tomtjes!
15.0.1
Bugfixes:
- mob.co.uk: rating-rounding and site name retrieval (#1198)
- reishunger.de: HTML and scraping refresh (#1202)
- weightwatchers.*: dynamic category retrieval, language correction (#1213, #1216)
- abstract schema.org: fixup for retrieval of graph lists (#1199)
Misc:
14.59.0
Adds support for:
- books.ottolenghi.co.uk (#1227)
- celebratingsweets.com (#1196)
- ingoodflavor.com (#1196)
- kalejunkie.com (#1204)
- kitchendreaming.com (#1196)
- leitesculinaria.com (#1196)
- myvegetarianroots.com (#1196)
- peelwithzeal.com (#1196)
- sandwichtribunal.com (#1196)
Improvements:
- streetkitchen.hu: additional field retrievals (#1183)
- whatsgabycooking.com: ingredient group support (#1210)
Bugfixes:
- streetkitchen.hu: field retrieval fixups (#1183)
Thank you, @bartaldavid @jknndy @tomtjes!
14.58.3
Bugfixes:
- mob.co.uk: rating-rounding and site name retrieval (#1198)
- reishunger.de: HTML and scraping refresh (#1202)
- weightwatchers.*: dynamic category retrieval, language correction (#1213, #1216)
- abstract schema.org: fixup for retrieval of graph lists (#1199)
Misc:
15.0.0
π π π₯ 15.0.0 π π₯ π
Welcome to version 15.0.0 of recipe-scrapers
, a simple scraping tool for recipe webpages.
This version is a major version upgrade, and contains some breaking changes from the 14.x series.
In particular:
- The
scrape_me
function to the library has been removed from the 15.x series - please migrate to use the recommendedscrape_html
function instead.- This includes removal of the generic
options
arguments that were previously passed torequests
. Please configure the options for your HTTP client (requests
,httpx
,urllib.request
or other) directly. - If you need more time to migrate: don't worry, the 14.x series will continue to be maintained and updated until we're comfortable that the upgraded version is in regular use.
- This includes removal of the generic
To get started and for usage examples, see the README
file.
Features
- Support for America's Test Kitchen, Cook's Country, and Cook's Illustrated have been added (#1060) - thank you, @smilerz!
Scrapers removed
Unfortunately the updated scraper interface is more restrictive, by-design, than 14.x - each scraper accepts an individual HTML page as input, and should not directly make additional HTTP requests. During adaptation to this interface, we dropped support for the following scrapers:
- coop.se
- gousto.co.uk
- kptncook.com
- madewithlau.com
- marleyspoon.com
- monsieur-cuisine.com
- woolworths.com.au
This makes scrapers simpler, and that means they're easier to review and support. But it is a trade-off against the coverage of recipe sites that we can achieve.
Dependencies removed
requests
is no longer a hard dependency of the library, but is available as an optional dependency.responses
is no longer required to run unit tests for the library.
Edit: add missing (and important!) scrapers-removed section.
Edit: section-heading nesting consistency.
14.58.2
This release only affects the unit test suite, and as a result is functionally identical to the 14.58.1 release. This release should be the nearest to the upcoming 15.0.0 release temporally, and relatively similar in terms of feature support.
Misc:
- Unit tests: consider 'ingredient_groups' field optional instead of mandatory (#1184).
Normally I'd thank our contributors in this release-note footer - but unusually, the only change in this release was one of mine, but I can thank @jknndy for code reviewing π
14.58.1
Bugfixes:
- The
nosalty.hu
scraper has been added to theSCRAPERS
supported-hosts dictionary (#1188). - Misleading deprecation notices about
options
had been emitted when using thewild_mode
keyword argument; these no longer occur (#1191). wild_mode
was incorrectly enabled whenscrape_html
was called on recipe URLs for unsupported sites (#1186).
Misc:
- Consistent ordering of JSON keys in test data, to support code review (#1151).
- A deprecation notice for the
scrape_me
function has been added (#1192). - Updates to the
README
based on user support feedback (#1182).
WIth thanks to @jknndy for the consistent JSON key ordering improvements! π