Skip to content
View twni2016's full-sized avatar
:octocat:
Focusing
:octocat:
Focusing

Highlights

  • Pro

Block or report twni2016

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. neubay neubay Public

    Official code for "Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism"

    Python

  2. self-predictive-rl self-predictive-rl Public

    Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024

    Jupyter Notebook 22 2

  3. Memory-RL Memory-RL Public

    When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

    Python 68 6

  4. pomdp-baselines pomdp-baselines Public

    Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

    Python 338 48

  5. llm-reasoning-uft llm-reasoning-uft Public

    Code for Offline Learning and Forgetting for Reasoning with Large Language Models, TMLR 2025

    Python 13