v0.4.0
What's Changed
- Add note about local venv to the readme by @alan-cooney in #27
- remove old import by @jbloomAus in #21
- Add abstract dataset by @alan-cooney in #35
- Add Pile Uncopyrighted source data by @alan-cooney in #36
- Set all dataclasses to frozen by @alan-cooney in #37
- Add Neel's C4 Dataset by @alan-cooney in #38
- Remove old src data functionality by @alan-cooney in #39
- Add get_dataloader method to source dataset by @alan-cooney in #41
- Add mixed precision manual settings throughout SAE by @alan-cooney in #42
- Make the pile tests more robust by @alan-cooney in #43
- Restrict Python version to <3.12 due to aiohttp compatibility by @HoagyC in #40
- Pin old transformers and tokenizers versions to fix bug with Neel's tokenizer by @ArthurConmy in #24
- Add alias poe commands by @alan-cooney in #44
- Add pre commit hooks quick setup by @alan-cooney in #46
- Add batch size option to config by @ArthurConmy in #25
- Fix demo pipeline by @alan-cooney in #50
New Contributors
- @jbloomAus made their first contribution in #21
- @HoagyC made their first contribution in #40
- @ArthurConmy made their first contribution in #24
Full Changelog: v0.3.0...v0.4.0