Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Consider supporting a reranker model? #301

Open
qiankunli opened this issue Nov 2, 2024 · 7 comments
Open

Consider supporting a reranker model? #301

qiankunli opened this issue Nov 2, 2024 · 7 comments

Comments

@qiankunli
Copy link

No description provided.

@samos123
Copy link
Contributor

samos123 commented Nov 2, 2024

Do you have a specific
reranker API, model and engine in mind?

OpenAI doesn't provide a reranker API. I am not too familiar with reranking use case. So please provide as much context and details as possible.

@qiankunli
Copy link
Author

qiankunli commented Nov 2, 2024

Do you have a specific reranker API, model and engine in mind?

OpenAI doesn't provide a reranker API. I am not too familiar with reranking use case. So please provide as much context and details as possible.

The reranker model is an important part of RAG (Retrieval-Augmented Generation), and we hope to operate a reranker model (for example, BAAI/bge-reranker-v2-m3) and provide a rerank API.

You can refer to the rerank API provided by Cohere: Rerank API.
@samos123

@samos123
Copy link
Contributor

samos123 commented Nov 5, 2024

Edit: The docs call out that Infinity adheres to Cohere API, which is great. I think maybe that's our shortest route to support reranker models.

@michaelfeil what's the API that Infinity provides? https://github.com/michaelfeil/infinity?tab=readme-ov-file#reranking

I'm thinking maybe we can re-use Infinity support for adding support for reranker models.

@michaelfeil
Copy link
Contributor

Reranker API’s don’t have a common ground in their spec. E.g. LiteLLM and I followed Cohere.

@samos123
Copy link
Contributor

Seems Cohere is also supported in other solutions: https://docs.continue.dev/customize/model-types/reranking

I think supporting a Reranker API would be great for fully local private code completion with continue.dev. We support all endpoints (chat completions, code completion, embeddings) except Reranker.

@qiankunli
Copy link
Author

Seems Cohere is also supported in other solutions: https://docs.continue.dev/customize/model-types/reranking

I think supporting a Reranker API would be great for fully local private code completion with continue.dev. We support all endpoints (chat completions, code completion, embeddings) except Reranker.

I'm hopeful that this plan will be put into action.

@samos123
Copy link
Contributor

samos123 commented Dec 5, 2024

It's definitely on the roadmap and I think I have all the info I would need. Right now our focus is on adding support for PVC support and Prefix Cache aware routing. Afterwards we should be able to tackle this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants