feat(openai): added support for o1 reasoning models #1618

KeganHollern · 2024-12-07T02:45:46Z

Summary

Fixing a bug with systemRoleSupported for the OpenAI endpoint.
Now using max_completion_tokens over max_tokens when querying chat_completions endpoint.

The bugfix is necessary as o1-preview and o1-mini models do not support "system" message types.
The token configuration change was necessary as o1-preview and o1-mini no longer support the, now deprecated, max_tokens field.

Considerations

I added a new endpoint configuration field called useCompletionTokens and defaulted it to true.
This would have the effect of changing the default chat_completions call to use max_completion_tokens over max_tokens.
There may be unintended consequences for users, as this could increase their overall token usage per query (as the default would change).
It may be prudent to start with this field set to false, and later switch to true as part of a planned deprecate of max_tokens all together.

Another consideration is that my change for system messages will leave a "role": "user", "content": "" entry. One thing we may want is to drop the system message entirely if content is blank and system messages are not supported.

Reference

For those wanting to introduce o1 reasoning models based on this PR (future readers), here is an example implementation of o1-mini:

{
      "name": "o1-mini",
      "displayName": "o1 mini test",
      "multimodal": false,
      "description": "ChatGPT o1-mini",
      "systemRoleSupported": false,
      "parameters": {
        "max_new_tokens": 2048,
      },
      "endpoints" : [{
        "type": "openai",
      }]
}

…ints

…_completeions endpoint.

evalstate · 2024-12-07T08:31:49Z

this is cool - i had a crack at this a while ago ( #1540 (comment) ) at which point streaming wasn't supported so i had to do some extra messing around (i capture token usage too).

…ern/main

nsarrazin · 2024-12-09T12:50:19Z

Hi! Thanks for the contrib. I made a couple changes:

Added an example in the doc
Made the parameter false by default (like you said, probably better to not change the default behaviour for now)
Fixed the linting

Overall this looks great, tried it locally and it seems to work well. Will merge once the checks are done. 🚀

KeganHollern added 2 commits December 6, 2024 20:36

fix(openai): systemRoleSupported model configuration for openai endpo…

4c2de79

…ints

feat(openai): max_completion_tokens now used over max_tokens for chat…

2eabd3b

…_completeions endpoint.

KeganHollern force-pushed the main branch from 575ca72 to 2eabd3b Compare December 7, 2024 02:59

KeganHollern marked this pull request as ready for review December 7, 2024 02:59

nsarrazin added enhancement New feature or request back This issue is related to the Svelte backend or the DB models This issue is related to model performance/reliability labels Dec 9, 2024

nsarrazin added 5 commits December 9, 2024 12:40

fix: lint

287c512

Merge branch 'main' into main

fd9549c

feat(docs): add o1 example

6f3ee74

fix: make parameter default to false and fix type checks

a0e6a77

Merge branch 'main' of github.com:KeganHollern/chat-ui into KeganHoll…

9646d4d

…ern/main

nsarrazin merged commit 9542b2c into huggingface:main Dec 9, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(openai): added support for o1 reasoning models #1618

feat(openai): added support for o1 reasoning models #1618

KeganHollern commented Dec 7, 2024 •

edited

Loading

evalstate commented Dec 7, 2024

nsarrazin commented Dec 9, 2024

feat(openai): added support for o1 reasoning models #1618

feat(openai): added support for o1 reasoning models #1618

Conversation

KeganHollern commented Dec 7, 2024 • edited Loading

Summary

Considerations

Reference

evalstate commented Dec 7, 2024

nsarrazin commented Dec 9, 2024

KeganHollern commented Dec 7, 2024 •

edited

Loading