Skip to content
This repository was archived by the owner on Jul 25, 2025. It is now read-only.

Releases: databricks/lilac

v0.1.1

26 Sep 00:08
Compare
Choose a tag to compare

Overview

  • Embedding computation can now be larger-than-RAM! Computing lots of embeddings will iteratively write to a vector store.
  • JSON and CSV sources are heavily optimized and go through duckdb for parsing.
  • Clustering now supports semantic clustering with embeddings, using DBScan.

New features

Performance

Bug fixes

Full Changelog: https://github.com/lilacai/lilac/compare/v0.1.0...v0.1.1

v0.1.0

21 Sep 12:23
Compare
Choose a tag to compare

New Features

Lilac now supports labeling! For a detailed guide, see Labeling a dataset

Labels can be added for individual rows:

dataset.add_labels(
  'good',
  row_ids=['0003076800f1471f8f4c8a1b2deda742'])

Or for slices of the data:

dataset.add_labels(
  'short',
  filters=[
    (('text', 'text_statistics', 'num_characters'), 'less', 1000)
  ]
)

They can then be exported:

short_rows = list(
  dataset.select_rows(
    ['*', 'short'],
    filters=[
      (('short', 'label'), 'exists')
    ]
  )
)
# Print the first row.
print(short_rows[0])

Output:

{
  '__rowid__': '0003076800f1471f8f4c8a1b2deda742',
  'text': 'If you want to truly experience the magic (?) of Don Dohler, then check out "Alien Factor" or maybe "Fiend", but not this. Alien Factor is actually rather imaginative considering the low budget and it\'s fairly creepy, but "Nightbeast", which I guess is sort of an updating of Alien Factor, is just plain dumb. Actors sleepwalk through their roles, especially Mr. Monotone sheriff, and the monster is some dumb Halloween-mask kind of thing instead of the wildly imaginative (but kind of stupid) looking critters from Alien Factor. A spaceship crashes on Earth and there\'s a critter inside, of course, who runs around vaporizing people. And ripping off arms, etc. And he has a cool ray gun that he uses to vaporize people too, until it gets shot out of his hand. And that\'s really about it. "Alien Factor" beats this mess hands down, if you really want to see a good Don Dohler movie, check that out instead. And RIP Don Dohler, 12/2/06.',
  'label': 'neg',
  '__hfsplit__': 'test',
  'good': {
    'label': 'true',
    'created': datetime.datetime(2023, 9, 20, 10, 16, 15, 545277)
  }
}

Labels can also be added via the UI:

image

What's changed

Bug fixes

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.20...v0.1.0

v0.0.20

20 Sep 20:05
Compare
Choose a tag to compare

Features

Bug fixes

Other

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.19...v0.0.20

v0.0.19

14 Sep 19:18
Compare
Choose a tag to compare

What's Changed

New Features 🎉

Other Changes

New Contributors

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.18...v0.0.19

v0.0.18

06 Sep 14:19
Compare
Choose a tag to compare

New Features

Other Changes

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.17...v0.0.18

v0.0.17

02 Sep 12:48
Compare
Choose a tag to compare

What's Changed

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.16...v0.0.17

v0.0.16

31 Aug 16:30
Compare
Choose a tag to compare

New Features

Other Changes

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.15...v0.0.16

v0.0.15

29 Aug 18:26
Compare
Choose a tag to compare

What's Changed

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.14...v0.0.15

v0.0.14

29 Aug 13:54
cfeb0e8
Compare
Choose a tag to compare

What's Changed

A bug with JavaScript not getting built for the pip package was fixed and released with this version. This includes the change to the searchbox: https://github.com/lilacai/lilac/pull/603

New Contributors

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.13...v0.0.14

v0.0.13

29 Aug 00:47
59f5f44
Compare
Choose a tag to compare

What's Changed

Full Changelog: https://github.com/lilacai/lilac/compare/v0.0.12...v0.0.13