Skip to content
View DSKSD's full-sized avatar

Organizations

@Team-Neighborhood

Block or report DSKSD

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A simulation evaluation platform for DROID

Python 177 25 Updated Mar 16, 2026

Efficient Triton Kernels for LLM Training

Python 6,243 507 Updated Mar 28, 2026

`dattri` is a PyTorch library for developing, benchmarking, and deploying efficient data attribution algorithms.

Python 120 30 Updated Mar 24, 2026

AI Logging for Interpretability and Explainability🔬

Python 140 11 Updated Jun 7, 2024
Python 43 5 Updated Dec 29, 2025
Python 24 1 Updated Dec 2, 2023

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,355 209 Updated Mar 5, 2024

[NeurIPS 2023] ImageReward: Learning and Evaluating Human Preferences for Text-to-image Generation

Python 1,655 90 Updated Oct 29, 2025

Code and documentation to train Stanford's Alpaca models, and generate the data.

Python 30,262 4,002 Updated Jul 17, 2024

Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI

Python 96 11 Updated Feb 22, 2023

An open collection of implementation tips, tricks and resources for training large language models

Python 497 22 Updated Mar 8, 2023

🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.

Python 20,855 2,230 Updated Mar 27, 2026

Code repository for supporting the paper "Atlas Few-shot Learning with Retrieval Augmented Language Models",(https//arxiv.org/abs/2208.03299)

Python 554 72 Updated Nov 28, 2023

The simplest, fastest repository for training/finetuning medium-sized GPTs.

Python 55,733 9,504 Updated Nov 12, 2025

Generating Information-Seeking Conversations from Unlabeled Documents (EMNLP 2022).

Python 11 Updated Jan 6, 2023

Keep Me Updated! Memory Management in Long-term Conversations (Findings of EMNLP 2022)

33 1 Updated Dec 2, 2022

Saving Dense Retriever from Shortcut Dependency in Conversational Search (EMNLP 2022)

Python 18 Updated Nov 24, 2022

Polyglot: Large Language Models of Well-balanced Competence in Multi-languages

485 42 Updated Aug 22, 2023
Python 103 9 Updated Apr 11, 2025

COYO-700M: Large-scale Image-Text Pair Dataset

Python 1,251 38 Updated Nov 30, 2022

Seminar on Large Language Models (COMP790-101 at UNC Chapel Hill, Fall 2022)

314 17 Updated Nov 21, 2022

A latent text-to-image diffusion model

Jupyter Notebook 72,776 10,619 Updated Jun 18, 2024

A library for building and serving multi-node distributed faiss indices.

Python 278 21 Updated Nov 1, 2023

Official Pytorch implementation of GGDR (ECCV 2022)

Python 102 8 Updated Aug 10, 2022

Repository for Realistic Blur Synthesis for Learning Image Deblurring

Python 115 14 Updated Feb 22, 2026

Open-source pre-training implementation of Google's LaMDA in PyTorch. Adding RLHF similar to ChatGPT.

Python 470 75 Updated Feb 24, 2024

🤗 Diffusers: State-of-the-art diffusion models for image, video, and audio generation in PyTorch.

Python 33,186 6,879 Updated Mar 28, 2026

PyTorch code for "Fine-grained Image Captioning with CLIP Reward" (Findings of NAACL 2022)

Python 246 27 Updated Jun 10, 2025

Korean Online That-gul Emotions Dataset

Jupyter Notebook 132 19 Updated Jun 24, 2023
Next