You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
we have a deduplication endpoint that would help developer experience - making better agents / AI apps by removing the noise (see search pipe which already does it client side and it's quite bad UX because it blocks the UI thread and when there are tons of results it freeze for like 30+ seconds. using string similarity heuristics)
should have a way to call it in the SDK like pipe.dedup(mydata) maybe taking the queryScreenpipe output as input? not sure, suggest good DX
should use embedding model either using candle or onnx somehow (candle usually less pain but less models)
most of the code in its own file imported then in server.rs
should be as fast as possible and not destroying the user computer (use mac metal, or mkl/cuda if possible)
any thoughts before starting this?
The text was updated successfully, but these errors were encountered:
Embedding Model: jina-embeddings-v3, this does support multiple languages can be used with onnx as well.
SDK Action: I'm thinking to take the queryScreenpipe's input fields and let the backend do the query and return non-duplicated results.
Since SDK is in JS, passing data to rust might be costly in some cases, so good to do in rust itself.
New crates required:
Candle (or) ORT - To Run Onnx Model.
tokenizers - Tokenzie and embed
please take a look and let me know if I need to make any changes or so.
/bounty 150
definition of done:
any thoughts before starting this?
The text was updated successfully, but these errors were encountered: