Working on Foundation Models @QwenLM
- Qwen, Alibaba
- Beijing, China
- 17:57
(UTC +08:00) - https://lancelqf.github.io/about/
- https://orcid.org/0000-0001-8568-7603
- @qingfeng_lan
- in/qingfenglan
-
-
Elephant Public
Elephant activation function is a novel class of activation functions designed to enhance the resilience of neural networks to catastrophic forgetting.
-
MeDQN Public
The official implementation of Memory-efficient DQN algorithm.
-
Explorer Public
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
-
gym-games Public
A collection of Gymnasium compatible games for reinforcement learning.
-
oat Public
Forked from sail-sg/oat🌾 OAT: A research-friendly framework for LLM online alignment, including preference learning, reinforcement learning, etc.
Python Apache License 2.0 UpdatedMar 22, 2025 -
Jaxplorer Public
Jaxplorer is a Jax reinforcement learning (RL) framework for exploring new ideas.
-
QuantumExplorer Public
A quantum reinforcement learning framework based on PyTorch and PennyLane.
-
gymnax Public
Forked from RobertTLange/gymnaxRL Environments in JAX 🌍
Python Apache License 2.0 UpdatedSep 16, 2022 -
singularity-deffile Public
Singularity definition files for projects.
MIT License UpdatedJul 31, 2019
