Skip to content

Dataset recipes #153

@lorenzoh

Description

@lorenzoh

With #151, FastAI.jl is getting high-level interfaces for searching datasets (finddatasets) and loading datasets into task-specific data containers (loaddataset). There is also a new DatasetRecipe that encapsulates configuration for loading a data container and the block information from a path. These recipes can be registered with a dataset so that they can be found using the above high-level functions.

The fastai dataset colletion comes with quite a lot of datasets, so only a few have recipes yet. This issue tracks the progress on adding recipes to all the datasets. Contributions of recipe types and recipe configs for datasets are welcome.

See src/datasets/recipes.jl for example recipe implementations and src/datasets/fastairegistry for how recipes are registered. listdatasources() gives you a list of all dataset sources and datasetpath(name) downloads them and returns the download folder.

Progress

For datasets that can be used for multiple tasks, they are listed below. Otherwise a checked dataset that at least one recipe is already implemented.

  • CUB_200_2011
  • bedroom (not sure how the folders are layed out)
  • caltech_101
  • cifar10
  • cifar100
  • food-101
  • imagenette-160
  • imagenette-320
  • imagenette
  • imagenette2-160
  • imagenette2-320
  • imagenette2
  • imagewang-160
  • imagewang-320
  • imagewang
  • imagewoof-160
  • imagewoof-320
  • imagewoof
  • imagewoof2-160
  • imagewoof2-320
  • imagewoof2
  • mnist_png
  • mnist_var_size_tiny
  • oxford-102-flowers
  • oxford-iiit-pet
  • stanford-cars
  • ag_news_csv
  • amazon_review_full_csv
  • amazon_review_polarity_csv
  • dbpedia_csv
  • giga-fren
  • imdb
  • sogou_news_csv
  • wikitext-103
  • wikitext-2
  • yahoo_answers_csv
  • yelp_review_full_csv
  • yelp_review_polarity_csv
  • biwi_head_pose
  • camvid
  • pascal-voc
  • pascal_2007
    • multi-label image classification ((Image{2}, LabelMulti))
    • object detection
  • pascal_2012
  • siim_small
  • skin-lesion
  • tcga-small
  • adult_sample
  • biwi_sample
  • camvid_tiny
  • dogscats
  • human_numbers
  • imdb_sample
  • mnist_sample
  • mnist_tiny
  • movie_lens_sample
  • planet_sample
  • planet_tiny
  • coco_sample
  • coco-train2017
  • coco-val2017
  • coco-test2017
  • coco-unlabeled2017
  • coco-image_info_test2017
  • coco-image_info_unlabeled2017
  • coco-annotations_trainval2017
  • coco-stuff_annotations_trainval2017
  • coco-panoptic_annotations_trainval2017

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions