Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
- Updated
Dec 1, 2025 - Python
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & GLM4(V)
GLM-4.1V-9B-Thinking, designed to explore the upper limits of reasoning in vision-language models. By introducing a "thinking paradigm" and leveraging reinforcement learning, the model significantly enhances its capabilities.
Accessing the GLM-4 translation PDF e-book in the specified language as an MD file using the SDK method.
Add a description, image, and links to the glm4 topic page so that developers can more easily learn about it.
To associate your repository with the glm4 topic, visit your repo's landing page and select "manage topics."