Releases: LAION-AI/Open-Assistant
Releases · LAION-AI/Open-Assistant
v0.0.1-beta57
What's Changed
- added markdown tooltip by @Anantha-Kandrapu in #2064
- duplex communication in inference by @yk in #2098
- Fix passing parallelism var to worker in Ansible playbook by @olliestanley in #2092
- Add XP progress by @AbdBarho in #2093
- Update Chat UI to latest inference changes by @AbdBarho in #2105
- Inference-deploy by @yk in #2109
- Inference-comma-separated-list-configs by @yk in #2110
- added create_at and parent_id to inference MessageRead by @yk in #2111
- Improve FAQ by @olliestanley in #2114
- Add GitHub OAuth support to inference server by @olliestanley in #2097
- Clean up chat UI by @AbdBarho in #2108
- Add Team page by @AbdBarho in #2103
- Added James Mete (JMete) to team page. by @Jmete in #2117
- Update team.json by @melvinebenezer in #2116
- Minor update on team.json by @melvinebenezer in #2118
- Some changes in teams by @0x22almostEvil in #2120
- Reward Model Training by @dvruette in #2071
- You in teams and more by @0x22almostEvil in #2123
- Add minimath support by @johnflux in #2094
- Add
Zyto /website/src/data/team.json by @Zyphre in #2125 - Fix GitHub auth flow on inference server by @olliestanley in #2121
- Add tlcv2.0_oa by @wannaphong in #1965
- Basic-hf-server by @yk in #2130
- feature: Using Celery for APIs and periodic tasks by @melvinebenezer in #1754
- enable chat via environment variable instead of flag by @yk in #2128
- Boilerplate OAuth website <=> inference by @AbdBarho in #2127
- Add Joak to team by @notmd in #2134
- Evaluation datasets for Reward Model by @shahules786 in #2112
- fix team description by @notmd in #2135
- Add me to team.json by @johnflux in #2132
- Update team.json by @kenhktsui in #2139
- Simplify celery worker use by @andreaskoepf in #2131
- added HF, W&B, and Redmond to sponsors by @yk in #2142
- Display message tree state admin by @Rulios in #2133
- Add multi-turn conversation dataset format by @Vechtomov in #2113
- [#2077] Possbility to revert message deletion by @Shadowner in #2079
- Added ENABLE_CHAT to chat id page and docker-compose by @yk in #2137
- Llama inference by @yk in #2144
- hotfix for max_new_tokens parameter being None by @yk in #2146
- Implemented ability to press ctrl+enter (or cmd+enter) to Review/Subm… by @cooperaustinj in #1237
- Cancelling pending chat messages of user on submission of a new message by @yk in #2141
- Added a timeout for inference messages by @yk in #2147
- Added various people to codeowners for backend and inference by @yk in #2150
- Remove invalid badge from README by @olliestanley in #2149
- Add refresh tokens to inference server auth by @olliestanley in #2129
- Robustification: Specific message type for protocol upgrade, everything else will re-connect by @yk in #2152
- Fixes auth TokenPair change bug (also fix for null logprob from inference server) by @yk in #2155
- Fixes changed access token model for website. Does not handle refresh yet by @yk in #2157
- Updated team page to fit visually more the the rest of the pages, added a button to the call-to-action section which links to the team page. by @theopfr in #2126
- Fix stacking of pos/neg logits for RM compute_metrics by @andreaskoepf in #2138
- building separate llama image for inference by @yk in #2161
- Changed worker config into worker info and unified model input and total lengths by @yk in #2160
- Added hardware info dependencies to oasst-shared to make api tests pass by @yk in #2162
- Derive RewardModel from PreTrainedModel by @andreaskoepf in #2158
- check
per_digit_tokens
exists by @HuangLK in #2166 - Only allow prompter answers to complete parent message by @yk in #2164
- Add first RM training run params by @andreaskoepf in #2167
- Diabling stop sequences in inference, since we are using eos_tokens in most models by @yk in #2168
- Added param configs from eval to pass to the frontend by @yk in #2163
- Fix: Correctly sending error to waiting client on message abort by @yk in #2169
- Chat config UI by @notmd in #2115
- Update pythia 6b RM training config by @andreaskoepf in #2187
- Add @dvruette, @andreaskoepf & @yk as codeowners of /model/ by @andreaskoepf in #2186
- Fix inconsistent sources of max token length by @yk in #2178
- Add GPTNeoXRewardModel by @andreaskoepf in #2182
- Add llama training support by @andreaskoepf in #2055
- Fix output of RankingDataCollator when max_length is None by @andreaskoepf in #2185
- Update website's dockerfile by @AbdBarho in #2189
- Login to inference through website by @AbdBarho in #2176
- rework layout by @notmd in #2172
- Evaluate sampling report using RM by @shahules786 in #2190
New Contributors
- @Anantha-Kandrapu made their first contribution in #2064
- @Zyphre made their first contribution in #2125
- @Rulios made their first contribution in #2133
- @cooperaustinj made their first contribution in #1237
- @HuangLK made their first contribution in #2166
Full Changelog: v0.0.1-beta56...v0.0.1-beta57
v0.0.1-beta56
What's Changed
- Reddit confessions dataset by @shahules786 in #2045
Full Changelog: v0.0.1-beta55...v0.0.1-beta56
v0.0.1-beta55
What's Changed
- Add Zhihu data (#1459) by @wangrui6 in #1863
- Fixed typo in tasks.json (english) by @Snake1Byte in #2037
- Show correct user avatars on leaderboard by @AbdBarho in #2039
- Swap color toggle icons by @AbdBarho in #2042
- Resilient discord image update by @AbdBarho in #2043
- Update docs by @CactiStaccingCrane in #2015
- Trainer fixes by @johnflux in #2040
- Add Vietnamese translation by @CactiStaccingCrane in #2046
- Add support for flash attention by @dvruette in #2033
- Delete duplicate export script by @dvruette in #2052
- refactor(website): convert Header and Loading stories to TypeScript by @Juneezee in #2050
- Choice to add global system-prefix to the assistant during changes by @sanagno in #2053
- Add option to tokenize numbers per digit by @johnflux in #2024
- Admin message list by @notmd in #2057
- Lazy load code highlight by @notmd in #2067
- Add --include-synth and --only-synth filtering options for export.py by @andreaskoepf in #2012
- Updating Russian Translation by @0x22almostEvil in #2069
- Remove repeated assignment in trainer_sft.py by @entropy2333 in #2074
- Add brief documentation for backend data export process by @olliestanley in #2068
- Document message and token format by @johnflux in #2056
- fix: added platform support for ghcr.io images to be run on Apple Sil… by @melvinebenezer in #1763
- Revert "fix: added platform support for ghcr.io images to be run on Apple Sil…" by @yk in #2080
- Add user level to user stats by @AbdBarho in #2083
- Add initial Ansible playbooks for inference server and worker by @olliestanley in #2078
- TV and Movie dialogue corpus by @sedthh in #2058
- 'Language' string extraction from Stats by @0x22almostEvil in #2085
- Update uk-UA translations. by @nmeln in #2091
- More french translation by @Shadowner in #2082
New Contributors
- @wangrui6 made their first contribution in #1863
- @Snake1Byte made their first contribution in #2037
- @Juneezee made their first contribution in #2050
- @entropy2333 made their first contribution in #2074
Full Changelog: v0.0.1-beta54...v0.0.1-beta55
v0.0.1-beta54
What's Changed
- show synthetic badge on message by @notmd in #2014
- Update chat API calls by @AbdBarho in #2016
- fix for invalid avatarUrl error by @owen-duncan-snobel in #1986
- Refresh active users' avatars by @AbdBarho in #2019
- Create oasst_data module by @andreaskoepf in #2009
- Safety pipeline prototype by @shahules786 in #1972
- Show more text in Ranking tasks depending on screen size by @AbdBarho in #2020
- Group samples by sequence length by @sanagno in #2011
- Configure pytest in sft pipeline #1770 by @bethanyconnolly in #1922
- [MRG] [model_training]📝 update model training doc by @HuaizhengZhang in #1946
- Add script to export model to HF or local directory by @andreaskoepf in #2028
- add
include_user
params in /messages/cursor` endpoint by @notmd in #2021 - dataset: NSFW and Self-harm dataset from Reddit by @jjmachan in #1967
- Fix for board by @Shadowner in #2035
New Contributors
- @bethanyconnolly made their first contribution in #1922
- @HuaizhengZhang made their first contribution in #1946
- @jjmachan made their first contribution in #1967
Full Changelog: v0.0.1-beta53...v0.0.1-beta54
v0.0.1-beta53
Full Changelog: v0.0.1-beta52...v0.0.1-beta53
v0.0.1-beta52
What's Changed
- Minor training speed improvements by @dvruette in #1979
- Correct Collator by @sanagno in #1975
- Add a tooltip to the creative and spam label (#1973) by @johnflux in #1988
- show message create date by @notmd in #1978
- force
next-i18next
by @AbdBarho in #1998 - Adding Safety Directory To The Datasets by @Shadowner in #1980
- Added moon icon for when the light mode is active, removed oddly big gap between sign-in and color-mode button by @theopfr in #1989
- fix ranking validity change detection by @notmd in #2000
- Various minor fixes for training pythia-6b by @dvruette in #2003
- Avoid Nan Losses by @sanagno in #2001
- Add SFT params random_offset_probability, label_masking by @andreaskoepf in #2005
- Use browser defaults for markdown lists by @AbdBarho in #1999
- Residual dropout support by @dvruette in #2010
- added async postgres to inference by @yk in #1961
New Contributors
- @Shadowner made their first contribution in #1980
Full Changelog: v0.0.1-beta51...v0.0.1-beta52
v0.0.1-beta51
What's Changed
- disable not_rankable checkbox when in review state by @notmd in #1950
- feat: discord bots by @MrlolDev in #1953
- Show correct discord avatar on leaderboard by @AbdBarho in #1955
- Fix discord bot workflow by @AbdBarho in #1956
- enable
strictBindCallApply
by @AbdBarho in #1959 - Fix leaderboard not loading by @AbdBarho in #1958
- Chat vote UI by @notmd in #1913
- remove global prisma type by @notmd in #1960
- Extended export script by @yk in #1954
- Add iapp_wiki_qa_squad_oa by @wannaphong in #1936
- enable
strictFunctionTypes
by @AbdBarho in #1962 - Fix some null checks by @AbdBarho in #1963
- Add YouTube subtitles HowTo100M dataset #259 by @totuta in #1924
- Fix ordering of images in leaderboard api by @AbdBarho in #1964
- Update hyperparameters to reflect InstructGPT by @dvruette in #1966
- Show user score in header by @AbdBarho in #1969
New Contributors
Full Changelog: v0.0.1-beta50...v0.0.1-beta51
v0.0.1-beta50
What's Changed
- Distributed Sampler by @sanagno in #1869
- Add debug auth between Website & Inference by @AbdBarho in #1893
- Display user avatar in leaderboard by @MrlolDev in #1767
- Update Thai translation by @wannaphong in #1901
- Scroll to message when loaded by @johnflux in #1886
- added CMU Wiki QA parser by @sedthh in #1905
- Store worker compliance check results, add initial scoring algorithm by @olliestanley in #1894
- Add basic load tests by @jackapbutler in #1784
- Show all my messages by @johnflux in #1864
- added Ubuntu QA dataset by @sedthh in #1906
- Updated datasets.md by @sedthh in #1916
- refactor inference message score by @notmd in #1907
- Typo Fixes by @rehmatsg in #1910
- Update notebooks readme with new location for dataset code by @olliestanley in #1920
- View chat list by @AbdBarho in #1915
- Add chat.json for Thai by @wannaphong in #1921
- docs: small header change and add latest youtube vid as blogpost by @andrewm4894 in #1851
- Gutenberg fix by @sedthh in #1914
- fix(backend): fill lang when importing mst by @nero19960329 in #1929
- SFT model eval by @andreaskoepf in #1918
- Update HumanEval and MBPP notebooks by @olliestanley in #1895
- Add not_rankable and reveal_synthetic flags by @andreaskoepf in #1934
- Update Contributor Documentation by @kingychiu in #1171
- Bugs in Markdown. 3. Big numbers in lists are corrupted resolved by @austraCalvin in #1941
- Markdown inline code rendering broken resolved by @austraCalvin in #1945
- feat(backend): enable automatically import message node by @nero19960329 in #1947
- add synthetic badge and all answers wrong checkbox to ranking task by @notmd in #1937
- Added Notebook for Closed Book QA Generator by @horribleCodes in #1933
New Contributors
- @rehmatsg made their first contribution in #1910
- @nero19960329 made their first contribution in #1929
- @austraCalvin made their first contribution in #1941
Full Changelog: v0.0.1-beta49...v0.0.1-beta50
v0.0.1-beta49
What's Changed
- fix layout overflow by @notmd in #1823
- Adding external-link icon by @ArgiesDario in #1827
- [Translation] Improved and completed the Arabic translation for the Website. by @Ali-x98 in #1813
- Fix numbered list alignment by @Slowlife01 in #1833
- Make the colors of the "Message tree states" match by @GuilleHoardings in #1829
- rename openassistant/ -> data/ by @andreaskoepf in #1830
- update some dependencies by @notmd in #1837
- Handle empty message tree state by @AbdBarho in #1838
- Document docker behaviour for development by @AbdBarho in #1839
- Adding Guidelines to translation, and translate it to spanish by @ArgiesDario in #1840
- WikiData Crawler Notebook by @sedthh in #1075
- Fix failing pre-commit on WikiData notebook by @olliestanley in #1844
- 404 [en, ru], edit [en, ru], keeping upd, fixes by @0x22almostEvil in #1842
- Improving UI fot the FAQ page by @cuevantn in #1850
- Inference: Associate chats with user IDs by @olliestanley in #1826
- Fix dark theme problem in the developer guide by @GuilleHoardings in #1855
- Implement separate rate limit for message submission tasks by @olliestanley in #1747
- Copy message text button by @CheckMC in #1859
- Fix link to the data directory that was renamed by @GuilleHoardings in #1862
- Inference: Server code clean up by @olliestanley in #1857
- Add ability to Delete Messages From the List View in User Detail View… by @johnflux in #1860
- Add button to copy code in markdown by @Slowlife01 in #1845
- Add missing Czech translations, fix few typos by @d3nzil in #1867
- Notebook contribution: Film descriptions. Resolves #1705 by @VictorLG98 in #1856
- Various fixes in inference by @yk in #1865
- worker status tracking by @yk in #1866
- [1477] - add voting and reporting api by @layterz in #1848
- Adding Home Page Unit Test by @MalikMAlna in #1858
- Recipes dataset. Resolves #1031 by @dctanner in #1836
- Update Japanese translation by @GuilleHoardings in #1887
- Use utf-8 in find missing locale script by @d3nzil in #1891
- add some additional badges to readme by @andrewm4894 in #1852
- New oasst export dataset loader by @andreaskoepf in #1854
- Add TreeManagerConfiguration.max_prompt_lottery_waiting parameter by @andreaskoepf in #1889
New Contributors
- @Ali-x98 made their first contribution in #1813
- @cuevantn made their first contribution in #1850
- @CheckMC made their first contribution in #1859
- @d3nzil made their first contribution in #1867
- @layterz made their first contribution in #1848
- @dctanner made their first contribution in #1836
Full Changelog: v0.0.1-beta48...v0.0.1-beta49
v0.0.1-beta48
What's Changed
- Display name validation by @markuschue in #1633
- website: Fix stats page responsiveness and add page title by @rjmacarthy in #1799
- Restore help links to documentation of tasks by @horribleCodes in #1800
- Fixed typos and updated translations for Finnish by @minipasila in #1804
- Update Spanish translation by @GuilleHoardings in #1807
- Add Slovak Translation by @hzj5790 in #1787
- Trollboard hide banned by @ArgiesDario in #1786
- Add an introduction guide for developers by @GuilleHoardings in #1806
- Added RM dataset training split and add support for WebGPT and other for RLHF by @theblackcat102 in #1793
- Fix reward model evaluation by @maw501 in #1783
- Fix the path of mtsamples.csv by @totuta in #1691
- Added Hindi and Japanese Translation by @shashankx86 in #1738
- Update Thai translations by @Slowlife01 in #1811
- web: flag explanation: make more readable by @lainisourgod in #1097
- Add message counts grouped by lang tag to system stats query by @kingychiu in #1101
- handle error when submitting task by @notmd in #1802
- Completed and Fixed Slovak Translation by @hzj5790 in #1816
- fix task error message by @notmd in #1817
- Inference: Require user authentication to call chat endpoints by @olliestanley in #1805
New Contributors
- @markuschue made their first contribution in #1633
- @minipasila made their first contribution in #1804
- @hzj5790 made their first contribution in #1787
- @shashankx86 made their first contribution in #1738
Full Changelog: v0.0.1-beta47...v0.0.1-beta48