Zhipu AI General Purpose

GLM-4 Flash

Powerful AI. Zero Cost.

128K context, tool calling, and production-ready - all free.

Pricing

Metric Official Price tokencnn.com Price
Input Free Free
Output Free Free

* tokencnn.com applies a 1.5x multiplier on official pricing for gateway and aggregation services.

API Reference

cURL
Python
curl -X POST https://api.tokencnn.com/v1/chat/completions \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-4-flash",
    "messages": [{"role": "user", "content": "Summarize the key points of this document"}],
    "temperature": 0.7
  }'
import openai

client = openai.OpenAI(
    base_url="https://api.tokencnn.com/v1",
    api_key="YOUR_API_KEY"
)

response = client.chat.completions.create(
    model="glm-4-flash",
    messages=[
        {"role": "user", "content": "Write a poem about AI in the style of Li Bai"}
    ],
    temperature=0.8,
    max_tokens=1024
)

print(response.choices[0].message.content)

Model Capabilities

🚀Completely free - zero cost for both input and output tokens
⚡128K token context window for long document processing
🧠Full tool calling and function calling support
🔧Strong Chinese-English bilingual performance
📊Suitable for production workloads at scale - no budget concerns
🎯Open-weight ecosystem with active community support

Ready to Build with GLM-4 Flash?

Get your API key in minutes and start integrating China's best AI models.

Get API Key →