Why can arrays not be trimmed?

Question

On the MSDN Documentation site it says the following about the Array.Resize method:

If newSize is greater than the Length of the old array, a new array is allocated and all the elements are copied from the old array to the new one.

If newSize is less than the Length of the old array, a new array is allocated and elements are copied from the old array to the new one until the new one is filled; the rest of the elements in the old array are ignored.

An array is a sequence of adjoined memory blocks. If we need a bigger array I understand that we cannot add memory to it since the memory next to it may already be claimed by some other data. So we have to claim a new sequence of adjoined memory blocks with the desired bigger size, copy our entries there and remove our claim of the old space.

But why create a new array with smaller size? Why can the array not just remove its claim of the last memory blocks? Then it would be an O(1) operation instead of O(n), as it is now.

Does it have something to do with how the data is organized on a computer architectural or physical level?

Traditionally, with C's malloc and realloc, sometimes realloc moves objects around, sometimes it doesn't. One of the reasons it sometimes does is that the memory manager has a separate pool for small memory blocks. Reducing a large memory block to a small one may then require it being moved to the separate pool. But I couldn't say if it's the same for .NET. — user743382
– user743382, Commented Jul 19, 2016 at 8:39
Although this is speculation, allocating space in a new memory location leaves the possibility to select the location in order to minimize memory fragmentation by selecting the new location where the array 'fits best'. — Codor
– Codor, Commented Jul 19, 2016 at 8:40
@Codor Doesn't this create a larger empty hole (fragment) in the original location of the array ? — Zein Makki
– Zein Makki, Commented Jul 19, 2016 at 8:42
The heap in C# bears absolutely no resemblance to a C or C++ heap. C# is garbage collected. — doug65536
– doug65536, Commented Jul 19, 2016 at 9:08
A single byte is only ever stored on the heap when it is boxed. It then takes at least 12 bytes instead of 1. Made ArrayList an unpopular class :) — Hans Passant
– Hans Passant, Commented Jul 19, 2016 at 9:32

Hans Passant · Accepted Answer · 2016-07-19 15:32:59Z

37

Unused memory is not actually unused. It is the job of any heap implementation to keep track of holes in the heap. At a minimum, the manager needs to know the size of the hole and needs to keep track of their location. That always costs at least 8 bytes.

In .NET, System.Object plays a key role. Everybody knows what it does, what isn't so obvious that it continues to live on after an object is collected. The two extra fields in the object header (syncblock and type handle) then turn into a backward and forward pointer to the previous/next free block. It also has a minimum size, 12 bytes in 32-bit mode. Guarantees that there is always enough space to store the free block size after the object is collected.

So you probably see the problem now, reducing the size of an array does not guarantee that a hole is created that's big enough to fit those three fields. Nothing it could do but throw a "can't do that" exception. Also depends on the bitness of the process. Entirely too ugly to consider.

edited Jul 19, 2016 at 15:32

answered Jul 19, 2016 at 9:30

Hans Passant

946k151 gold badges1.8k silver badges2.6k bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

Steve Jessop Over a year ago

" Nothing it could do but throw a "can't do that" exception" -- well, with care you could define it to do nothing in that case other that update the field that stores the size of the array. After all, in these cases where you're only very slightly reducing the size of the array, there's every chance that the new memory allocation it's copied into actually consumes just as much memory resource as the old one did. I think the complete answer is a little more than this, but it depends how far one is willing to unravel the system design before re-raveling it with trim capabilities.

Hans Passant Over a year ago

It is a little more than that, an array doesn't have a "capacity" field. Adding that field to every array object for such a minor feature is a very hard sell. The efficiency of an array affects everything, all collection types other than LinkedList use arrays under the hood.

Steve Jessop Over a year ago

Oh, so if you ask for a small array, like 2 elements, does .NET give you a memory allocation which is exactly big enough for that, with no slack at the end and hence no need for a size field separate from the memory allocator's notion of how big the block is? And me saying "there's every chance that the new memory allocation it's copied into actually consumes just as much memory resource" is all foul lies?

Holger Over a year ago

@Steve Jessop: having a mutable array size affects the optimizer’s work, which relies on the immutable nature of the length when predicting whether a loop might cross the array boundary. Having a mutable size implies having to recheck the conditions in every iteration of every loop. That performance drop would outweigh any advantage of being able to change the size without copying (in a few cases).

Steve Jessop Over a year ago

@Holger: fair enough, although if that's the real reason then what about what Hans says? I won't trouble either of us with idle speculation as to what proportion of the time data flow analysis can be used to hoist loop invariants: some but by no means all.

|

Luis Perez · Accepted Answer · 2016-07-23 22:48:23Z

To answer your question it has to do with the design of the memory management system.

In theory if you were writing your own memory system you could totally design it to behave exactly the way you said.

The question then becomes why wasn't it designed that way. The answer is that the memory management system made a trade off between efficient use of memory versus performance.

For example most memory management systems don't manage memory down to the byte. Instead they break up the memory into 8 KB chunks. There are bunch of reasons for this most of which are around performance.

Some of the reason have to do with how well the processor moves memory around. For example let's say that the processor was much better at copying 8 KB of data at a time then it was at copying 4 KB. Then there is a performance benefit to storing the data in 8 KB chunks. That would be a design trade off based on CPU architecture.

There are also algorithmic performance trade offs. For example say from studying the behavior of most applications you find that 99% of the time applications allocate blocks of data that are 6 KB to 8 KB in size.

If the memory system allowed you to allocate and release 4KB it would be left with a with free 4KB chunk that 99% of allocations won't be able to use. If instead of over allocated to 8 KB even though only 4KB were needed it would be much more reusable.

Consider yet another design. Say you had a list of free memory locations which could be of any size and a request was made to allocate 2KB of memory. One approach would be to look through your list of free memory and find one that is at least 2KB in size, but do you look through the whole list to find that smallest block, or you do find the first one that is big enough and use that.

The first approach is more efficient, but slower, the second approach is less efficient but faster.

It gets even more interesting in languages like C# and Java that have "managed memory". In a managed memory system the memory isn't even freed; it just stops getting used, which the garbage collector later, in some cases much later, detects and frees.

For more information different memory management and allocation you might want to check out this article on Wikipedia:

https://en.wikipedia.org/wiki/Memory_management

Plus, if the system allowed you to "release" the end of an allocated block, you'd end up with many more "odd sized blocks" cluttering-up the "free list" -- e.g. you allocated 7k, trimmed it down to 5k, you'd be left with a 2k free block that probably won't satisfy most future allocations. After many such "trims", you might have lots of free memory, but it's all in small (2-3k) blocks and mostly unusable. Reallocating on a "trim" will minimise these left-over blocks.
While this is a decent description of malloc/realloc, it doesn't really explain how it works in C# (which was the OP's question). Right now, MS.NET allocates from the top of the heap (just like with a stack), and relies on heap compaction to "release" memory. The real reason is safety and correctness, not heap allocation patterns.
@Luaan I think it's implied that the core of Kjara question is broader than just C#. In the question he mentions broad concepts like "memory blocks" and whether the reason is due to "computer architectural or physical level". You're correct that .NET uses a heap, I would argue though that that is a memory allocation pattern. I would also argue that "partial deallocation" could have been built into that pattern. So the only specific answer I could give as to why you can't do that in C# is because .NET wasn't designed to support it. Why wasn't designed to support it? We could only speculate.

Community · Accepted Answer · 2017-05-23 11:51:41Z

21

I was looking for an answer to your question since I found it a very interesting question. I found this answer which has an interesting first line:

You can't free part of an array - you can only free() a pointer that you got from malloc() and when you do that, you'll free all of the allocation you asked for.

So actually the problem is the register that keeps which memory is allocated. You can't just free a part of the block you have allocated, you have to free it entirely or you don't free it at all. That means in order to free that memory, you have to move the data first. I don't know if .NET memory management does something special in this regard, but I think this rule applies to the CLR too.

edited May 23, 2017 at 11:51

CommunityBot

11 silver badge

answered Jul 19, 2016 at 9:02

Patrick Hofman

158k23 gold badges270 silver badges343 bronze badges

11 Comments

MSalters Over a year ago

Be aware that this answers references a C answer, not a C# answer. In C#, there is no call to ` malloc` behind the scenes.

MSalters Over a year ago

I don't think so, malloc goes to the CRT (C RunTime), which in turn has to ask the OS. I think the CLR goes straight to the OS.

MSalters Over a year ago

Yes? If you're going to call HeapAlloc, the standard Windows function I'd expect the CLR to use, you have to write that call in some language. Given that Microsoft doesn't maintain a C compiler (MSVC++ is stuck in the 20th century, supporting not even C99), the logical choice is C++. But you could even make that call from unsafe C# code.

Hans Passant Over a year ago

These comments are not very accurate. std::shared_pointer is the most basic reason why C# code beats C++ code when memory usage is the bottleneck. Amortizing the cost of the memory barriers puts it ahead. The CLR does not use HeapAlloc, only VirtualAlloc. VS2015 supports C99 well, required for C++11 compliance. They completely rewrote the front-end, the original one was designed to compile with only 256 KB of memory. Good enough to compile the CLR :)

Luaan Over a year ago

Hans describes the (quite different) problems the .NET runtime faces with such an operation in his answer quite well - it's sort of similar to free, but not really. You could free a part of the memory, but you need to keep the old object reference around, because it becomes a "free pointer" when the object is collected. This means you can't do an in-place resize anyway - even if there's enough free space for the next object header, you'd still need to move the start of the array, which means copying all the elements. Allocating a new array is just all around better.

|

Zein Makki · Accepted Answer · 2016-07-19 09:24:55Z

I think this is because the old array is not destroyed. It is still there if it is being referenced somewhere else and it still can be accessed. This is why the new array is created in a new memory location.

Example:

int[] original = new int[] { 1, 2, 3, 4, 5, 6 }; int[] otherReference = original; // currently points to the same object Array.Resize(ref original, 3); Console.WriteLine("---- OTHER REFERENCE-----"); for (int i = 0; i < otherReference.Length; i++) { Console.WriteLine(i); } Console.WriteLine("---- ORIGINAL -----"); for (int i = 0; i < original.Length; i++) { Console.WriteLine(i); }

Prints:

---- OTHER REFERENCE----- 0 1 2 3 4 5 ---- ORIGINAL ----- 0 1 2

int[] oldRef = old copies the reference (i.e. a shallow copy) to the array.
This is not an answer to my question "Why does the array get copied in the first place if it is not needed to copy it?". It just shows the consequences of the array being copied instead of trimmed.
@ user3185569 I know that. And my question is why there is no trimming function.
@Kjara I don't think you understood the spirit of the answer. It says "you can't resize the existing array because somebody else might be using the same array at the same time, while depending on the length not changing under the covers." Memory safety is a big deal in .NET - how could you have memory safety if you could make buffer overflows just by doing well-crafted Array.Resizes? :)

gnasher729 · Accepted Answer · 2016-07-19 20:16:52Z

There are two reasons for the definition of realloc as it is: First, it makes it absolutely clear that there is no guarantee that calling realloc with a smaller size will return the same pointer. If your program makes that assumption, your program is broken. Even if the pointer is the same 99.99% of the time. If there is a large block right smack in the middle of lots of empty space, causing heap fragmentation, then realloc is free to move it out of the way if possible.

Second, there are implementations where it is the absolutely necessary to do this. For example, MacOS X has an implementation where one large memory block is used to allocate malloc blocks of 1 to 16 bytes, another large memory block for malloc blocks of 17 to 32 bytes, one for malloc blocks of 33 to 48 bytes, etc. This makes it very naturally that any size change that stays in the range say 33 to 48 bytes returns the same block, but changing to either 32 or 49 bytes must reallocate the block.

There is no guarantee for the performance of realloc. But in practice, people don't make the size a little bit smaller. The main cases are: Allocate memory to an estimated upper bound of the needed size, fill it, then resize to the actual much smaller required size. Or allocate memory, then resize it to something very small when it's not needed anymore.

Luaan · Accepted Answer · 2016-07-20 09:58:53Z

Only the designers of the .NET runtime can tell you their actual reasoning. But my guess is that memory safety is paramount in .NET, and it would be very expensive to maintain both memory safety and mutable array lengths, not to mention how much complicated any code with arrays would be.

Consider the simple case:

var fun = 0; for (var i = 0; i < array.Length; i++) { fun ^= array[i]; }

To maintain memory safety, every array access must be bounds-checked, while ensuring that the bounds checking isn't broken by other threads (the .NET runtime has much stricter guarantees than, say, the C compiler).

So you need a thread-safe operation that reads data from the array, while checking the bounds at the same time. There's no such instruction on the CPU, so your only option is a synchronization primitive of some kind. Your code turns into:

var fun = 0; for (var i = 0; i < array.Length; i++) { lock (array) { if (i >= array.Length) throw new IndexOutOfBoundsException(...); fun ^= array[i]; } }

Needless to say, this is hideously expensive. Making the array length immutable gives you two massive performance gains:

Since the length cannot change, the bounds checking doesn't need to be synchronized. This makes each individual bounds check vastly cheaper.
... and you can omit the bounds checking if you can prove the safety of doing so.

In reality, what the runtime actually does ends up being something more like this:

var fun = 0; var len = array.Length; // Provably safe for (var i = 0; i < len; i++) { // Provably safe, no bounds checking needed fun ^= array[i]; }

You end up having a tight loop, no different from what you'd have in C - but at the same time, it's entirely safe.

Now, let's see the pros and cons of adding array shrinking the way you want it:

Pros:

In the very rare scenario where you'd want to make an array smaller, this would mean the array doesn't need to be copied over to change its length. However, it would still require a heap compaction in the future, which involves a lot of copying.
If you store object references in the array, you might get some benefits from cache locality if the allocation of the array and the items happens to be colocated. Needless to say, this is even rarer than Pro #1.

Cons:

Any array access would become hideously expensive, even in tight loops. So everyone would use unsafe code instead, and there goes your memory safety.
Every single piece of code dealing with arrays would have to expect that the length of the array can change at any time. Every single array access would need a try ... catch (IndexOutOfRangeException), and everyone iterating over an array would need to be able to deal with the changing size - ever wondered why you can't add or remove items from List<T> you're iterating over?
A huge amount of work for the CLR team that couldn't be used on another, more important feature.

There's some implementation details that make this even less of a benefit. Most importantly, .NET heap has nothing to do with malloc/free patterns. If we exclude the LOH, the current MS.NET heap behaves in a completely different way:

Allocations are always from the top, like in a stack. This makes allocations almost as cheap as stack allocation, unlike malloc.
Due to the allocation pattern, to actually "free" memory, you must compact the heap after doing a collection. This will move objects so that the free spaces in the heap are filled, which makes the "top" of the heap lower, which allows you to allocate more objects in the heap, or just free the memory for use by other applications on the system.
To help maintain cache locality (on the assumption that objects that are commonly used together are also allocated close to one another, which is quite a good assumption), this may involve moving every single object in the heap that's above the freed space down. So you might have saved yourself a copy of a 100-byte array, but then you have to move 100 MiB of other objects anyway.

Additionally, as Hans explained very well in his answer, just because the array is smaller doesn't necessarily mean that there's enough space for a smaller array in the same amount of memory, due to the object headers (remember how .NET is designed for memory safety? Knowing the right type of an object is a must for the runtime). But what he doesn't point out is that even if you do have enough memory, you still need to move the array. Consider a simple array:

ObjectHeader,1,2,3,4,5

Now, we remove the last two items:

OldObjectHeader;NewObjectHeader,1,2,3

Oops. We need the old object header to keep the free-space list, otherwise we couldn't compact the heap properly. Now, it could be done that the old object header would be moved beyond the array to avoid the copy, but that's yet another complication. This is turning out to be quite an expensive feature for something that noöne will ever use, really.

And that's all still in the managed world. But .NET is designed to allow you to drop down to unsafe code if necessary - for example, when interoperating with unmanaged code. Now, when you want to pass data to a native application, you have two options - either you pin the managed handle, to prevent it from being collected and moved, or you copy the data. If you're doing a short, synchronous call, pinning is very cheap (though more dangerous - the native code doesn't have any safety guarantees). The same goes for e.g. manipulating data in a tight loop, like in image processing - copying the data is clearly not an option. If you allowed Array.Resize to change the existing array, this would break entirely - so Array.Resize would need to check if there's a handle associated with the array you're trying to resize, and throw an exception if that happens.

More complications, much harder to reason about (you're going to have tons of fun with tracking the bug that only occurs once in a while when it so happens that the Array.Resize tries to resize an array that just so happens to right now be pinned in memory).

As others have explained, native code isn't much better. While you don't need to maintain the same safety guarantees (which I wouldn't really take as a benefit, but oh well), there's still complications related to the way you allocate and manage memory. Called realloc to make a 10-item array 5-item? Well, it's either going to be copied, or it's still going to be the size of a 10-item array, because there's no way to reclaim the left-over memory in any reasonable manner.

So, to make a quick summary: you're asking for a very expensive feature, that would be of very limited benefit (if any) in an extremely rare scenario, and for which there exists a simple workaround (making your own array class). I don't see that passing the bar for "Sure, let's implement this feature!" :)

+1 nice answer. I answered similar thing in the Java space (stackoverflow.com/a/35847639/5851520) a while ago, but your detail is cool.

Mike Robinson · Accepted Answer · 2016-07-19 15:55:00Z

There might be many sophisticated data-structures operating "under the hood" in any heap-management system. They might, for instance, store blocks according to their present size. It would add a lot of complications if blocks were allowed to "be split, grow, and shrink." (And, it really wouldn't make things any 'faster.')

Therefore, the implementation does the always-safe thing: it allocates a new block, and moves values as needed. It is known that "this strategy will always work reliably, on any system." And, it really won't slow things down at all.

Vishnu Prasad V · Accepted Answer · 2016-07-19 09:10:53Z

Under the hood, Arrays are stored in continuous memory block but is still a primitive type in many languages.

To answer your question, the space allocated to an array is considered as one single block and stored in stack in case of local variables or bss/data segments when it is global. AFAIK, when you access an array like array[3], at low level, OS will get you a pointer to the first element and jumps/skip till it reaches (thrice in case of above example) the required block. So it might be an architectural decision that an array size can't be changed once it is declared.

Similar way, OS cannot know if it's valid index of an array before it accesses the required index. When it tries to access the requested index by reaching memory block after the jumping process and finds out that the memory block reached is not part of the array, it'll throw an Exception

How does the OS find out if the memory block is not part of the array? Doesn't it need to know not only the starting point of the array but also its size? In this case, Changing the stored value of the size (which lies on the stack as I understand from your answer) would be all there is to do, right?

Collectives™ on Stack Overflow

Why can arrays not be trimmed?

8 Answers 8

8 Comments

3 Comments

11 Comments

6 Comments

Comments

1 Comment

Comments

1 Comment

Linked

Hot Network Questions

Collectives™ on Stack Overflow

8 Answers 8

8 Comments

3 Comments

11 Comments

6 Comments

Comments

1 Comment

Comments

1 Comment

Linked

Related