Fix champs-scalar-coupling to include test molecule structures by jvpoulos · Pull Request #70 · openai/mle-bench

jvpoulos · 2025-09-04T11:35:33Z

The prepare script explicitly filters structures to only include train molecules. This is incorrect for this competition --- in the Kaggle competition, structures.csv contains all molecules (both train and test). The test molecules need their structures to make predictions, but they're being filtered out.

Changes:

Modified prepare.py to include both train and test molecules in structures.csv
Updated checksums.yaml to reflect the new structures.csv checksum
Updated assertions to validate both train and test molecules

After fixing the data preparation issue, I used a LightGBM model trained on 25% of the data to make predictions:

 { "competition_id": "champs-scalar-coupling", "score": 0.5823, "gold_threshold": -2.87509, "silver_threshold": -2.03119, "bronze_threshold": -1.90122, "median_threshold": -0.9529, "any_medal": false, "gold_medal": false, "silver_medal": false, "bronze_medal": false, "above_median": false, "submission_exists": true, "valid_submission": true, "is_lower_better": true, "created_at": "2025-09-03T22:56:19.592098", "submission_path": "mlebench/competitions/champs-scalar-coupling/submission.csv" }

…uctures.csv

thesofakillers · 2025-09-08T10:20:51Z

Thank you for catching this. You are correct and there is a mistake in the prepare.py, and your fix seems right.

As explained in the readme in #66 we won't be merging this fix in yet, and will release it as a batch of fixes in a upcoming v2 to be released on openai/preparedness. I've added as tracked in #71. I will try to put you as co-author for when we release the fix.

For submissions to the v1 leaderboard, please proceed as if this issue was not present.

* catalogue issue described in #70 * catalogue #77 * its called frontier-evals now

…ion (openai#78) * catalogue issue described in openai#70 * catalogue openai#77 * its called frontier-evals now

Fix champs-scalar-coupling to include test molecule structures in str…

043a279

…uctures.csv

thesofakillers added a commit that referenced this pull request Sep 8, 2025

catalogue issue described in #70

52e6ad4

thesofakillers mentioned this pull request Sep 8, 2025

Catalogue issue described in #70 re champs-scalar-coupling #71

Merged

thesofakillers closed this Sep 8, 2025

thesofakillers added a commit that referenced this pull request Sep 8, 2025

catalogue issue described in #70 (#71)

4c4a6ff

thesofakillers added a commit that referenced this pull request Oct 8, 2025

Catalogue issue described in #77 re multi-modal-gesture-recogntion (#78)

4011d70

* catalogue issue described in #70 * catalogue #77 * its called frontier-evals now

li-seeker pushed a commit to ycs-atc/mle-bench that referenced this pull request Dec 17, 2025

catalogue issue described in openai#70 (openai#71)

ed65ba2

li-seeker pushed a commit to ycs-atc/mle-bench that referenced this pull request Dec 17, 2025

Catalogue issue described in openai#77 re multi-modal-gesture-recognt…

a2d0405

…ion (openai#78) * catalogue issue described in openai#70 * catalogue openai#77 * its called frontier-evals now

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix champs-scalar-coupling to include test molecule structures#70

Fix champs-scalar-coupling to include test molecule structures#70
jvpoulos wants to merge 1 commit intoopenai:mainfrom
jvpoulos:fix-champs-scalar-coupling-structures

jvpoulos commented Sep 4, 2025 •

edited

Loading

thesofakillers commented Sep 8, 2025

Labels

2 participants

Conversation

jvpoulos commented Sep 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes:

thesofakillers commented Sep 8, 2025

Labels

2 participants

jvpoulos commented Sep 4, 2025 •

edited

Loading