Stars
[ICLR 2025] Official Implementation of Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image Synthesis
🔥Hierarchical Fine-Grained Image Forgery Detection and Localization (CVPR23 + IJCV24)
A latent text-to-image diffusion model
DALL·E Mini - Generate images from a text prompt
A comprehensive collection of IQA papers
Open-source and strong foundation image recognition models.
DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding
An invisible desktop application to help you pass your technical interviews.
The best way to write secure and reliable applications. Write nothing; deploy nowhere.
HunyuanVideo: A Systematic Framework For Large Video Generation Model
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
Get a job from Xuanwu Lab in 365 days
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
[ECCV2024] Video Foundation Models & Data for Multimodal Understanding
A unified framework for 3D content generation.
Official PyTorch Code and Models of "RePaint: Inpainting using Denoising Diffusion Probabilistic Models", CVPR 2022
Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
violence check model for sjtu AI introduction assignment