The Qwen3 Embedding model series is the latest proprietary model of the Qwen family, specifically designed for text embedding and ranking tasks.
Qwen3-Embedding-0.6b (qwen/qwen3-embedding-0.6b) is a embedding model from Alibaba Qwen, released 2025-06-18. Context window: — tokens; max output —. Pricing via AIgateway: input $0.012/M tokens, output $0/M tokens. Call it via https://api.aigateway.sh/v1/embeddings — set model="qwen/qwen3-embedding-0.6b". Best for: RAG, Semantic search, Recommendation.
curl https://api.aigateway.sh/v1/embeddings \
-H "Authorization: Bearer $AIGATEWAY_API_KEY" \
-H "Content-Type: application/json" \
-d '{"model":"qwen/qwen3-embedding-0.6b","input":"the quick brown fox"}'{
"model": "qwen/qwen3-embedding-0.6b",
"input": "Text to embed, or an array of strings for batch."
}{
"object": "list",
"data": [
{
"object": "embedding",
"index": 0,
"embedding": [0.0123, -0.0456, 0.0789, /* ... */]
}
],
"model": "qwen/qwen3-embedding-0.6b",
"usage": { "prompt_tokens": 5, "total_tokens": 5 }
}from openai import OpenAI client = OpenAI(base_url="https://api.aigateway.sh/v1", api_key="sk-aig-...") r = client.embeddings.create(model="qwen/qwen3-embedding-0.6b", input="hello world") print(r.data[0].embedding[:5])