Skip to content
View erfanMhi's full-sized avatar

Organizations

@Computational-Intelligence-Fall18 @rasht-school-of-ai @iust-projects

Block or report erfanMhi

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. one-covenant/grail one-covenant/grail Public

    interplanetary intelligence

    Python 21 11

  2. rlvr_pipeline rlvr_pipeline Public

    A composable component orchestrator for Reinforcement Learning from Verifiable Rewards (RLVR) training of Large Language Models on reasoning tasks.

    Python 1

  3. intractai/IntractCodeAPI intractai/IntractCodeAPI Public

    An API designed for code completion and fine-tuning of open-source large language models on internal codebases and documents.

    Python 13 2

  4. base_reinforcement_learning base_reinforcement_learning Public

    This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.

    Python 13 1

  5. flypi flypi Public

    Circuit Analysis for Extracting Components and Connections for XR (Toronto Meta Llama Hackathon)

    Python 7

  6. Deep-Reinforcement-Learning-CS285-Pytorch Deep-Reinforcement-Learning-CS285-Pytorch Public

    Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

    Python 144 11