Skip to content

Releases: bio-guoda/preston

0.1.2

14 Jul 01:47
Compare
Choose a tag to compare

Features

Improvements

  1. introduce way to (re-) process provenance logs using stdin (see #15)
  2. retire unused "resume" update option

Bugs

  1. add hadMember relationships between BioCASe registry, datasets and their associated archives (#14)

0.1.1

02 Jul 22:16
Compare
Choose a tag to compare

Features

Improvements

  1. stamp the project version to the new resources/preston.properties file and output this string for the version command

Bugs

  1. show the correct output for the version command when running in Docker (related to #11)

0.1.0

28 Jun 19:18
Compare
Choose a tag to compare

Features

  1. support for remote gzipped tarballs . To help publish many files to existing publication platforms (e.g., Internet Archive, Zenodo), data dirs can be put into tar balls like preston-aa.tar.gz , where "aa" are the first two characters of a content hash of files contained in the tar ball. For example publication, see Poelen, Jorrit H. (2019). A biodiversity dataset graph: Biodiversity Heritage Library (BHL) (Version 0.0.1) [Data set]. Zenodo. http://doi.org/10.5281/zenodo.3251134 and https://archive.org/details/preston-bhl .
  2. command line progress bar. To help indicate that stuff is being downloaded show a progress bar like:
$ preston clone https://archive.org/download/preston-bhl
[https://archive.org/download/preston-bhl/2a5de79372318317a382ea9a2cef069780b852b01210ef59e06b640a3539cb5a] 100.0% of 78 bytes at 0.07 MB/s completed in < 1 minute
[https://archive.org/download/preston-bhl/89926f33157c0ef057b6de73f6c8be0060353887b47db251bfd28222f2fd801a] 0.8% of 236 MB at 0.43 MB/s ETA: +9 minutes
  1. introduce clone command to explicitly clone a remote Preston repository. Can be chained to include many repositories. E.g.,
$ preston clone https://example.org https://archive.org/download/preston-bhl

would first try to get index / provenance and associated content via https://example.org, then try https://archive.org/download/preston-bhl. Allows for distributing content across arbitrary number of remotes.

  1. make local data/tmp dir configurable. Local data/tmp dir can now be configured with option --data-dir and --tmp-dir respectively.

Improvements

  1. stop processing when downstream pipe has closed (see #13)

Bugs

0.0.16

14 Jun 20:55
Compare
Choose a tag to compare

Features

Improvements

  1. support multiple Preston remotes using --remotes to allow for cascading retrieval of provenance record history, provenance records and tracked content. Provides a solution for #8 . Example preston history --remotes https://example.org/prov/,https://deeplinker.bio, would first attempt to find a Preston provenance history at https://example.org/prov/ , then try https://deeplinker.bio

Bugs

0.0.15

12 Jun 03:33
Compare
Choose a tag to compare

Features

Improvements

  1. introduce --no-cache option to avoid having to locally cache remote content

Bugs

0.0.14

07 Jun 23:07
Compare
Choose a tag to compare

Features

Improvements

Bugs

  1. avoid crashing on reaching end of history for a remote when executing history, ls or verify. Bug introduced in v.0.0.13 with sticky fail-over feature.

0.0.13

07 Jun 22:30
Compare
Choose a tag to compare

Features

Improvements

  1. allow remotes to serve content for hashes using pattern like https://example.com/af/13/af135e... in addition to https://example.com/af135e...

Bugs

0.0.12

13 May 16:11
Compare
Choose a tag to compare

Features

Improvements

  1. first pass at tracking the Biodiversity Heritage Library OCR texts at the title/item/book level (related to #4)

Bugs

Alpha version

07 May 00:28
Compare
Choose a tag to compare

Features

Improvements

  1. first pass at enabling remotes to improve data mobility using things like preston history --remote https://deeplinker.bio/

Bugs

Alpha version

31 Jan 20:46
Compare
Choose a tag to compare

Features

Improvements

  1. make copyTo command compatible with both 3 and 5 level hash directory structure.

Bugs