Release 0.0.15
0.0.15
New features
- Multimodal tensor combination (#332, #355). Combine image and text data into a single vector! Multimodal combination objects can be added as Marqo document fields. This can be used to encode text metadata into image vectors. See usage here.
Bug fixes
- Fixed a bug that prevented CLIP's device check from behaving as expected (#337)
- CLIP utils is set to use the OpenCLIP default tokenizer so that long text inputs are truncated correctly (#351).
Contributor shout-outs:
- Thank you to our 2.4k stargazers
- Thank you to @ed-muthiah, @codebrain and others for raising issues.
Release images can be found on Docker hub