Skip to content
View kbpark102's full-sized avatar

Block or report kbpark102

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Repository for evaluating Pegasus-1 and video-language foundation models

Python 14 2 Updated Nov 12, 2024

Flask-based web application designed to compare text and image embeddings using the CLIP model.

Python 22 4 Updated Jan 22, 2024

Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation

Python 144 6 Updated Oct 30, 2024

Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)

Python 108 12 Updated Jan 23, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 17,472 1,589 Updated Sep 5, 2024

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 4,201 264 Updated Sep 12, 2025

Awesome things you can do with ChatGPT + Code Interpreter combo 🔥

1,017 57 Updated Dec 10, 2023

AskUp Search ChatGPT Plugin

Python 20 2 Updated May 27, 2023
Python 53 19 Updated May 12, 2025

Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-…

TypeScript 1,336 181 Updated Mar 20, 2026

Segment Any RGBD

Python 865 52 Updated May 24, 2023

H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/

Python 4,901 519 Updated Mar 20, 2026

Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"

Python 201 10 Updated May 1, 2023

🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.

TypeScript 35,877 9,426 Updated Apr 29, 2025

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

Python 182,754 46,222 Updated Mar 23, 2026

Vision Transformer Cookbook with Tensorflow

Python 343 53 Updated Mar 28, 2022

(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"

Python 821 77 Updated Jul 14, 2022

Deep Learning Paper Reading Meeting-Archive

250 36 Updated Jan 12, 2025

A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support

Python 16,677 596 Updated Mar 4, 2026

Simple llama usage example

Python 1 Updated Mar 6, 2023

High-speed download of LLaMA, Facebook's 65B parameter GPT model

Shell 4,142 403 Updated Jun 28, 2023

Easy Docker setup for Stable Diffusion with user-friendly UI

Shell 7,331 1,268 Updated Aug 18, 2024

Let us control diffusion models!

Python 33,760 3,002 Updated Feb 25, 2024

Stable Diffusion web UI

Python 161,977 30,194 Updated Mar 2, 2026

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,132 6,866 Updated Mar 23, 2026

Visualizes Video Quality Metrics (PSNR, SSIM & VMAF) calculated by ffmpeg.exe

951 35 Updated Mar 23, 2026

PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)

Jupyter Notebook 126 12 Updated Feb 24, 2023

Google Research

Jupyter Notebook 37,521 8,364 Updated Mar 18, 2026

An easy library for Python file locking. It works on Windows, Linux, BSD and Unix systems and can even perform distributed locking. Naturally it also supports the with statement.

Python 322 53 Updated Jan 1, 2026

fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…

Python 1,836 87 Updated Feb 18, 2026
Next