Skip to content

Pull requests: mlc-ai/mlc-llm

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

[Serving] Add speculative decoding
#1539 by KnowingNothing was merged Jan 10, 2024 Loading…
[SpecDecode] Support Eagle in speculative decoding
#2080 by KnowingNothing was merged Apr 12, 2024 Loading…
[Serving] Prefix Cache
#2295 by cyx-6 was merged May 21, 2024 Loading…
PoC implementation of SmoothQuant
#855 by ibsidorenko was closed Jun 24, 2024 Loading…
gradio polish and minigpt cli
#496 by Kathryn-cat was merged Jul 7, 2023 Loading…
[SLM] Batched Llama
#1520 by MasterJH5574 was merged Jan 4, 2024 Loading…
[Serving][Grammar] BNF AST and Parser for EBNF grammar
#1534 by Ubospica was merged Jan 12, 2024 Loading…
Add support for specifying custom model path
#140 by sudeepag was merged May 16, 2023 Loading…
[Model] Initial batching support for Llama
#1048 by MasterJH5574 was merged Oct 14, 2023 Loading…
[iOS] support for multimodal
#524 by Kathryn-cat was merged Jul 17, 2023 Loading…
[MultiGPU] Support pre-sharded model weights
#1096 by Lunderberg was merged Nov 9, 2023 Loading…
[KVCache] Migrate Baichuan model to PagedKVCache
#1854 by tlopex was merged Feb 28, 2024 Loading…
[SLM][AutoLLM] Enable Command Line Weight Conversion
#1170 by zxybazh was merged Nov 2, 2023 Loading…
[SLM] Add support for StableLM architecture
#1701 by rickzx was merged Feb 3, 2024 Loading…
[SLM] Add support for Baichuan2 architecture
#1755 by tlopex was merged Feb 15, 2024 Loading…
[REST] OpenAI compatible Rest API
#1107 by Kartik14 was merged Oct 24, 2023 Loading…
[Serving][Grammar] BNF grammar simplifier and matcher
#1801 by Ubospica was merged Feb 24, 2024 Loading…
[Serving] Support "n" for parallel generation
#1868 by MasterJH5574 was merged Mar 2, 2024 Loading…
[SLM] Integration with Disco sharding.
#1212 by LeshengJin was merged Dec 10, 2023 Loading…
[SLIM] Allow dynamic shape parameters
#1417 by CharlieFRuan was merged Jan 17, 2024 Loading…
[Multimodal Support] Add MiniGPT4
#390 by Kathryn-cat was merged Jun 22, 2023 Loading…
6 tasks done
[SLM] Enable FasterTransformer quantization
#1480 by cyx-6 was merged Jan 2, 2024 Loading…
ProTip! Updated in the last three days: updated:>2025-12-01.