What's Changed
- setup: expand .gitignore to include more Python-related files by @SeanSdahl in #21
- New implementation of wasserstein loss based on the absolute difference by @NinaWie in #22
- Code packaging by @jannisborn in #23
- adjusted paths for new packaging by @zausin33 in #25
- Poster by @jannisborn in #27
- Bump torch from 2.0.1 to 2.2.0 by @dependabot[bot] in #29
- Bump aiohttp from 3.10.5 to 3.10.11 by @dependabot[bot] in #30
- chore: add 'outputs/' to .gitignore by @SeanSdahl in #31
- test: update language modelling tests to enable running on apple silicon by @SeanSdahl in #32
- added code to generate arithmetic expression datasets by @SeanSdahl in #28
- Bump jinja2 from 3.1.4 to 3.1.5 by @dependabot[bot] in #33
- Benchmarking Suite for Loss Functions in T5 Training by @s1k0ra in #36
- Gce by @zausin33 in #39
- Ablation studies by @AnamarijaKozina in #37
- Non-number token loss contributes now to GCE by @ad045 in #41
- Ablation studies without data by @zausin33 in #50
- Integration of base ntl and minimal working example by @Larspennig in #53
- Add Lightweight Runtime Benchmarking for Loss Functions by @s1k0ra in #54
- Multigpu compatibility (#1) by @jannisborn in #57
- Gh page by @zausin33 in #59
- Rebuttal by @zausin33 in #62
- downloaded the cnn dataset and preprocess by @zausin33 in #60
- Profiling by @zausin33 in #61
- added generation file for temperature dataset by @zausin33 in #58
- Gaussian smoothing with NTL by @Larspennig in #64
- Gh page by @zausin33 in #63
New Contributors
- @SeanSdahl made their first contribution in #21
- @dependabot[bot] made their first contribution in #29
- @s1k0ra made their first contribution in #36
- @ad045 made their first contribution in #41
Full Changelog: neurips_mathai...icml25