Stars
[CAAI AIR'24] Bilateral Reference for High-Resolution Dichotomous Image Segmentation
A streamlined and customizable framework for efficient large model (LLM, VLM, AIGC) evaluation and performance benchmarking.
🐍 Geometric Computer Vision Library for Spatial AI
[TPAMI] DDM: A Metric for Comparing 3D Shapes Using Directional Distance Fields
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
[NeurIPS 2025] SpatialLM: Training Large Language Models for Structured Indoor Modeling
Official implementation of the paper "GUAVA: Generalizable Upper Body 3D Gaussian Avatar" [ICCV 2025]
[ICCV 2025] Diffuman4D: 4D Consistent Human View Synthesis from Sparse-View Videos with Spatio-Temporal Diffusion Models
GauSTAR: Gaussian Surface Tracking and Reconstruction (CVPR 2025)
这是一个使用FastAPI和PyWebView构建的完整桌面Web应用示例项目,旨在为教学博客提供一个详细的参考。项目展示了如何结合WebSocket和模板渲染技术,创建一个功能丰富的桌面Web应用。通过PyWebView,应用可以在本地桌面上运行,同时利用FastAPI提供强大的后端支持,实现前后端的无缝交互。
🐜🐀🐒🚶 A toolkit for robust markerless 3D pose estimation
FLAME head tracker for single image reconstruction and monocular video tracking. [Note: This tracker operates offline and is not intended for real-time applications.]
Summary of publicly available ressources such as code, datasets, and scientific papers for the FLAME 3D head model
[SIGGRAPH 2025 (Journal Track)] Facial Appearance Capture at Home with Patch-Level Reflectance Prior.
[NeurIPS 2024] Generalizable and Animatable Gaussian Head Avatar
A list of SaaS, PaaS and IaaS offerings that have free tiers of interest to devops and infradev
PDF Guru Anki是你整个知识世界的“中枢转换器”,与 Anki 的强大记忆引擎无缝融合,能将来自任何地方、任何格式的知识精华,高效、系统、可持续地转化为牢固的长期记忆资产,打造专属自己的个性化Anki知识库,助你高效学习、轻松记忆。
A lightweight 3D rendering engine based on modern OpenGL
Mapping Mediapipe's 52 blendshapes to FLAME's expression coefficients and poses.
A lightweight point-based visualization tool used for inspecting Gaussian data, designing camera motion, and exporting setups for external Gaussian renderers.
[CVPR 2025 Highlight] FRESA: Feedforward Reconstruction of Personalized Skinned Avatars from Few Images
PantoMatrix: Generating Face and Body Animation from Speech
[ICLR 2025 Oral] Official code for "LVSM: A Large View Synthesis Model with Minimal 3D Inductive Bias"





