Blazing fast inference solution for text embedding models
Blazing fast inference solution for text embeddings models
text-embeddings-inference$ text-embeddings-inference --model-id sentence-transformers/all-MiniLM-L6-v2$ text-embeddings-inference --model-id sentence-transformers/all-MiniLM-L6-v2 --port 8080$ curl http://localhost:3000/embed -X POST -H 'Content-Type: application/json' -d '{"inputs": ["Hello world"]}'