- Intuitive Surgical, Inc.
- Cupertino, California
- https://www.linkedin.com/in/xingtong-liu-ph-d-b43b27131/
- https://scholar.google.com/citations?hl=en&user=qMxRGQkAAAAJ
Lists (1)
Sort Name ascending (A-Z)
Stars
Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch
Hiera: A fast, powerful, and simple hierarchical vision transformer.
Official repository for "AM-RADIO: Reduce All Domains Into One"
[ICCV'25 Oral] Differentiable Room Acoustic Rendering with Multi-View Vision Priors
PyTorch code and models for the DINOv2 self-supervised learning method.
CoTracker is a model for tracking any point (pixel) on a video.
Automated dense category annotation engine that serves as the initial semantic labeling for the Segment Anything dataset (SA-1B).
[ECCV 2024] Official implementation of the paper "Semantic-SAM: Segment and Recognize Anything at Any Granularity"
Lora beYond Conventional methods, Other Rank adaptation Implementations for Stable diffusion.
[ECCV2024 - Oral, Best Paper Award Candidate] SEA-RAFT: Simple, Efficient, Accurate RAFT for Optical Flow
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
PyTorch implementation of "WILDCAT: Weakly Supervised Learning of Deep ConvNets for Image Classification, Pointwise Localization and Segmentation", CVPR 2017
Desmoking laparoscopy surgery images using an image-to-image translation guided by an embedded Dark channel
A simple method to perform semi-supervised learning with limited data.
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
Tensors and Dynamic neural networks in Python with strong GPU acceleration
[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
✨✨Latest Advances on Multimodal Large Language Models
Implementation of Nougat Neural Optical Understanding for Academic Documents
[ NeurIPS2021] This is an official implementation of our paper "HRFormer: High-Resolution Transformer for Dense Prediction".
A quick guide (especially) for trending instruction finetuning datasets
Accelerate segment anything model inference using Tensorrt 8.6.1.6
Python bindings for the Transformer models implemented in C/C++ using GGML library.
Aligning pretrained language models with instruction data generated by themselves.
A sample app for the Retrieval-Augmented Generation pattern running in Azure, using Azure AI Search for retrieval and Azure OpenAI large language models to power ChatGPT-style and Q&A experiences.
