Expose MLX memory management APIs by dannote · Pull Request #98 · elixir-nx/emlx

dannote · 2026-02-22T16:30:33Z

Adds bindings for MLX's memory management functions:

EMLX.memory_info/0 — returns %{active_memory, peak_memory, cache_memory} in bytes
EMLX.clear_cache/0 — releases unused GPU memory back to the system
EMLX.reset_peak_memory/0 — resets the peak memory counter
EMLX.set_memory_limit/1 — sets the memory limit guideline
EMLX.set_cache_limit/1 — sets the cache size limit

Why this is needed

Without clear_cache, repeated model inference causes GPU memory to grow unbounded as MLX caches freed buffers. On a 24 GB Apple M5 running ai-forever/FRIDA (823M parameter T5 encoder), memory grows from 3 GB to 18 GB after just 4 inference batches, causing severe system-wide slowdowns as the GPU starts swapping:

Batch 1: 4873ms = 13.1 sent/s Batch 2: 5619ms = 11.4 sent/s Batch 3: 31385ms = 2.0 sent/s ← GPU memory exhausted Batch 4: 69177ms = 0.9 sent/s

With EMLX.clear_cache() + :erlang.garbage_collect() between batches:

Batch 1: 4517ms = 14.2 sent/s Batch 2: 4587ms = 14.0 sent/s Batch 3: 4517ms = 14.2 sent/s Batch 4: 4556ms = 14.0 sent/s ← stable

All 2117 tests pass (2115 existing + 6 new memory tests).

Add bindings for MLX's memory management functions: - EMLX.memory_info/0 - returns active, peak, and cache memory usage - EMLX.clear_cache/0 - releases unused GPU memory back to the system - EMLX.reset_peak_memory/0 - resets the peak memory counter - EMLX.set_memory_limit/1 - sets the memory limit guideline - EMLX.set_cache_limit/1 - sets the cache size limit Without clear_cache, repeated model inference causes GPU memory to grow unbounded as MLX caches freed buffers. On a 24 GB Apple M5 running a 823M parameter model, memory usage grew from 3 GB to 18 GB after just 4 batches, causing severe system-wide slowdowns and GPU swapping. Calling clear_cache + :erlang.garbage_collect() between batches keeps memory stable and inference throughput consistent.

polvalente · 2026-02-22T17:11:23Z

c_src/emlx_nif.cpp

+}
+
+NIF(clear_cache) {
+ mlx::core::clear_cache();


Do these return void?

polvalente · 2026-02-22T19:00:41Z

test/emlx/memory_test.exs

+ t = Nx.iota({1024, 1024}, type: :f32, backend: EMLX.Backend)
+ EMLX.eval(EMLX.Backend.from_nx(t))
+ after_alloc = EMLX.memory_info().active_memory
+ assert after_alloc > before


This assertion could be more strict since we know the tensor size

polvalente · 2026-02-22T19:01:05Z

test/emlx/memory_test.exs

+ EMLX.eval(EMLX.Backend.from_nx(t))
+ Nx.backend_deallocate(t)
+ EMLX.clear_cache()
+ info = EMLX.memory_info()


Do we have assertions on what info returns?

polvalente · 2026-02-22T19:01:27Z

lib/emlx.ex

 {EMLX.Backend, device: device}
 end
+
+ @doc """


These docs could use some examples,.even if they aren't doctests per se

c_src/emlx_nif.cpp

polvalente reviewed Feb 22, 2026

View reviewed changes

c_src/emlx_nif.cpp

}

NIF(clear_cache) {

mlx::core::clear_cache();

Copy link

Collaborator

polvalente Feb 22, 2026

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do these return void?

polvalente reviewed Feb 22, 2026

View reviewed changes

Add doc examples, specs for void returns, stricter test assertions

f025210

polvalente reviewed Feb 22, 2026

View reviewed changes

c_src/emlx_nif.cpp Outdated Show resolved Hide resolved

polvalente reviewed Feb 22, 2026

View reviewed changes

c_src/emlx_nif.cpp Outdated Show resolved Hide resolved

Apply suggestions from code review

af02406

polvalente approved these changes Feb 22, 2026

View reviewed changes

Merge branch 'main' into add-memory-management-apis

4321553

polvalente merged commit 2d5742d into elixir-nx:main Feb 22, 2026
11 of 12 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose MLX memory management APIs#98

Expose MLX memory management APIs#98
polvalente merged 4 commits intoelixir-nx:mainfrom
dannote:add-memory-management-apis

dannote commented Feb 22, 2026

polvalente Feb 22, 2026

polvalente Feb 22, 2026

polvalente Feb 22, 2026

polvalente Feb 22, 2026

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants

Conversation

dannote commented Feb 22, 2026

Why this is needed

polvalente Feb 22, 2026

Choose a reason for hiding this comment

polvalente Feb 22, 2026

Choose a reason for hiding this comment

polvalente Feb 22, 2026

Choose a reason for hiding this comment

polvalente Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Labels

2 participants