Skip to content
View junhl's full-sized avatar
  • Montreal

Block or report junhl

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Apache Iceberg

Java 8,670 3,105 Updated Mar 28, 2026

PyIceberg

Python 1,024 459 Updated Mar 28, 2026

Graphs for Everyone

Java 16,196 2,584 Updated Mar 20, 2026

DuckDB is an analytical in-process SQL database management system

C++ 37,027 3,042 Updated Mar 28, 2026

Empowering everyone to build reliable and efficient software.

Rust 111,548 14,674 Updated Mar 28, 2026

A Python Object-Document-Mapper for working with MongoDB

Python 4,350 1,231 Updated Mar 10, 2026

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 158,494 32,653 Updated Mar 28, 2026

The Julia Programming Language

Julia 48,536 5,751 Updated Mar 28, 2026

Apache Spark - A unified analytics engine for large-scale data processing

Scala 43,048 29,139 Updated Mar 28, 2026

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

C++ 16,626 4,065 Updated Mar 28, 2026

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Scala 8,646 2,030 Updated Mar 27, 2026