Stars
DuckDB is an analytical in-process SQL database management system
Empowering everyone to build reliable and efficient software.
A Python Object-Document-Mapper for working with MongoDB
🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.
Apache Spark - A unified analytics engine for large-scale data processing
Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs



