erfanMhi

Follow

Erfan Miahi erfanMhi

Follow

Working on really hard problems, probably the problem of intelligence.

135 followers · 81 following

Templar
Canada, Toronto
https://www.linkedin.com/in/erfan-miahi-8637a1130/
https://orcid.org/0000-0001-7510-083X
@erfan_mhi
erfan_mhi
in/erfan-miahi-8637a1130

Achievements

Achievements

Organizations

Pinned Loading

one-covenant/grail one-covenant/grail Public

interplanetary intelligence

Python 21 11
rlvr_pipeline rlvr_pipeline Public

A composable component orchestrator for Reinforcement Learning from Verifiable Rewards (RLVR) training of Large Language Models on reasoning tasks.

Python 1
intractai/IntractCodeAPI intractai/IntractCodeAPI Public

An API designed for code completion and fine-tuning of open-source large language models on internal codebases and documents.

Python 13 2
base_reinforcement_learning base_reinforcement_learning Public

This is the code-base that I personally use as the starting point for any reinforcement learning codebase with the purpose of fast experimentation and analysis.

Python 13 1
flypi flypi Public

Circuit Analysis for Extracting Components and Connections for XR (Toronto Meta Llama Hackathon)

Python 7
Deep-Reinforcement-Learning-CS285-Pytorch Deep-Reinforcement-Learning-CS285-Pytorch Public

Solutions of assignments of Deep Reinforcement Learning course presented by the University of California, Berkeley (CS285) in Pytorch framework

Python 144 11