OpenAI-compatible inference API. Model: code-1b-chat-v2
curl -X POST /v1/chat/completions \
-H "Content-Type: application/json" \
-d '{
"model": "code-1b-chat-v2",
"messages": [{"role":"user","content":"Write a Python fibonacci function."}],
"max_tokens": 200
}'