multimodal huggingface embeddings #17463

logan-markewich · 2025-01-09T03:04:02Z

This PR adds (initial) multimodal support to huggingface embeddings

Its pretty straightforward, a lot of it relies on the sentence transformer model having image processing built in
For example
https://huggingface.co/jinaai/jina-clip-v2/blob/main/custom_st.py

multimodal huggingface embeddings

421af83

dosubot bot added the size:M This PR changes 30-99 lines, ignoring generated files. label Jan 9, 2025

logan-markewich added 3 commits January 9, 2025 09:02

fix types

37e0387

last fix?

ad4506c

fix usage

c85a595

logan-markewich merged commit 0648ceb into main Jan 9, 2025
11 checks passed

logan-markewich deleted the logan/multimodal_huggingface branch January 9, 2025 18:26

Provide feedback