We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
支持ONNX Model的加载和推理
在仅有CPU的环境下,我不得不考虑使用CPU去运行Embedding & Reranker 模型,经过测试,使用onnx拥有相较于 Sentence Transformer更快的处理速度
https://huggingface.co/docs/optimum/v1.3.0/en/onnxruntime/modeling_ort#optimum.onnxruntime.ORTModelForSequenceClassification
https://github.com/onnx/onnx
The text was updated successfully, but these errors were encountered:
Do you have some data to compare ONNX and sentence-transformers?
Sorry, something went wrong.
I don't have a lot of data, just a simple test using internal industry knowledge data
No branches or pull requests
Feature request / 功能建议
支持ONNX Model的加载和推理
Motivation / 动机
在仅有CPU的环境下,我不得不考虑使用CPU去运行Embedding & Reranker 模型,经过测试,使用onnx拥有相较于
Sentence Transformer更快的处理速度
Your contribution / 您的贡献
https://huggingface.co/docs/optimum/v1.3.0/en/onnxruntime/modeling_ort#optimum.onnxruntime.ORTModelForSequenceClassification
https://github.com/onnx/onnx
The text was updated successfully, but these errors were encountered: