Stars
Repository for evaluating Pegasus-1 and video-language foundation models
Flask-based web application designed to compare text and image embeddings using the CLIP model.
Offical Code for GPT4Video: A Unified Multimodal Large Language Model for lnstruction-Followed Understanding and Safety-Aware Generation
Hierarchical Video-Moment Retrieval and Step-Captioning (CVPR 2023)
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
Segment Anything in High Quality [NeurIPS 2023]
Awesome things you can do with ChatGPT + Code Interpreter combo 🔥
Visual Blocks for ML is a Google visual programming framework that lets you create ML pipelines in a no-code graph editor. You – and your users – can quickly prototype workflows by connecting drag-…
H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://docs.h2o.ai/h2o-llmstudio/
Official Pytorch implementation of "Graphit: A Unified Framework for Diverse Image Editing Tasks"
🤖 Assemble, configure, and deploy autonomous AI Agents in your browser.
AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
Vision Transformer Cookbook with Tensorflow
(ICLR 2022 Spotlight) Official PyTorch implementation of "How Do Vision Transformers Work?"
Deep Learning Paper Reading Meeting-Archive
A high-performance, zero-overhead, extensible Python compiler with built-in NumPy support
High-speed download of LLaMA, Facebook's 65B parameter GPT model
Easy Docker setup for Stable Diffusion with user-friendly UI
Stable Diffusion web UI
🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.
Visualizes Video Quality Metrics (PSNR, SSIM & VMAF) calculated by ffmpeg.exe
PyTorch code for “TVLT: Textless Vision-Language Transformer” (NeurIPS 2022 Oral)
Google Research
An easy library for Python file locking. It works on Windows, Linux, BSD and Unix systems and can even perform distributed locking. Naturally it also supports the with statement.
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…