- Hunan University
- Changsha, China
- https://caoyunkang.github.io/
Lists (4)
Sort Name ascending (A-Z)
Starred repositories
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
The official gpt4free repository | various collection of powerful language models | opus 4.6 gpt 5.3 kimi 2.5 deepseek v3.2 gemini 3
🚀🚀 「大模型」2小时完全从0训练64M的小参数GPT!🌏 Train a 64M-parameter GPT from scratch in just 2h!
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
Official Code for DragGAN (SIGGRAPH 2023)
Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Open-Sora: Democratizing Efficient Video Production for All
A generative world for general-purpose robotics & embodied AI learning.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
Fully open reproduction of DeepSeek-R1
JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
WebUI extension for ControlNet
A set of ready to use Agent Skills for research, science, engineering, analysis, finance and writing.
Annotate better with CVAT, the industry-leading data engine for machine learning. Used and trusted by teams at any scale, for data of any scale.
Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-迁移学习
An open source implementation of CLIP.
🏛️ 三省六部制 · OpenClaw Multi-Agent Orchestration System — 9 specialized AI agents with real-time dashboard, model config, and full audit trails
[CVPR 2025 Best Paper Award] VGGT: Visual Geometry Grounded Transformer
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation" (CVPR'25 Spotlight).
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
Refine high-quality datasets and visual AI models

