twni2016 (Tianwei Ni)

Pinned Loading

neubay neubay Public

Official code for "Long-Horizon Model-Based Offline Reinforcement Learning Without Conservatism"

Python
self-predictive-rl self-predictive-rl Public

Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024

Jupyter Notebook 22 2
Memory-RL Memory-RL Public

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

Python 68 6
pomdp-baselines pomdp-baselines Public

Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022

Python 338 48
llm-reasoning-uft llm-reasoning-uft Public

Code for Offline Learning and Forgetting for Reasoning with Large Language Models, TMLR 2025

Python 13