Name	Name	Last commit message	Last commit date
parent directory ..
swe-bench	swe-bench
.gitignore	.gitignore
README.md	README.md

Name

Last commit message

Last commit date

CK Benchmarks

This directory contains benchmarks for evaluating CK's search performance against industry standards.

Available Benchmarks

Evaluates CK's code search and retrieval capabilities using real-world GitHub issues from the SWE-bench dataset.

Dataset: 2,294 real GitHub issues from popular Python repositories
Task: Given an issue description, retrieve relevant files that need to be modified
Baseline: BM25 retrieval (as used in SWE-bench evaluations)
CK Advantage: Tests hybrid semantic + lexical search vs pure lexical search

See swe-bench/README.md for detailed setup and usage instructions.

Each benchmark has its own directory with:

Benchmark results and performance comparisons are documented in each benchmark's directory.

To add a new benchmark: