Skip to content
View sweinbach's full-sized avatar

Block or report sweinbach

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.

1,102 46 Updated Sep 27, 2024

Data augmentation for NLP

Jupyter Notebook 4,652 474 Updated Jun 24, 2024

Download single or multiple classes from the Open Images V6 dataset (OIDv6)

Python 46 21 Updated Nov 18, 2020

Topic-Aware Convolutional Neural Networks for Extreme Summarization

Python 376 82 Updated Jun 20, 2023

State-of-the-Art Text Embeddings

Python 18,435 2,767 Updated Mar 12, 2026

JSON formatter and viewer in HTML for Angular

TypeScript 182 71 Updated Apr 11, 2024

Adaptive Experimentation Platform

Python 2,724 367 Updated Mar 20, 2026

💻 Medis is a beautiful, easy-to-use Mac database management application for Redis.

JavaScript 11,770 789 Updated Feb 21, 2024

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,256 32,574 Updated Mar 22, 2026

Apache Superset is a Data Visualization and Data Exploration Platform

TypeScript 71,054 16,831 Updated Mar 22, 2026