I'm not sure how big of a difference it is, but ollama also has an exposed API, I wonder how hard it'd be to use that?
I'm not sure how big of a difference it is, but ollama also has an exposed API, I wonder how hard it'd be to use that?