find size of heap allocated

rupeshkp728 · May 5, 2011, 5:07am

I want to find the size of the total memory allocated on the heap for the following statement:
int* a = new int[1000];How can I use the sizeof operator for this?
I used:
printf("\t===> %d\n",sizeof(*a));

Is this statement correct?

I have asked the question because when I checked the memory of heap allocated in windbg it shows me the size as fc4 i.e. 4036 which is more then 4000 and not as desired. Any hint as to what may be the cause?

kumaran_5555 · May 5, 2011, 6:19am

This will not take memory from heap. This would be allocated from stack because this is a local variable and you aren't using malloc() / any alloc() functions.

But if you want to see the size of heap, you can check using pmap <PID>

which will display the anonymous pages allocated for this process, which also includes the stack.

---------- Post updated at 03:49 PM ---------- Previous update was at 03:16 PM ----------

This statement will print the size of one integer.

rupeshkp728 · May 5, 2011, 6:20am

hey kumaran in C++ new is used in place of malloc of C and so the memory will be allocated from heap.

Loic_Domaigne · May 5, 2011, 4:21pm

The statement is not correct, you'll get the size of an int on your architecture.

windbg... hmm... that's doesn't sound really Un*x...

Remember that new is an operator provided by your compiler, we don't know what it does... It may for instance add a header for internal bookkeeping; or perhaps the underlying allocation mechanism on your system imposed this... Who knows?

You'll have to look at the interna of the new operator to find it out.

Cheers, Lo�c

Corona688 · May 5, 2011, 5:20pm

Only the heap knows that. But unless you're allocating lots of really tiny things it's going to be pretty close to what you asked for.

alister · May 5, 2011, 6:53pm

I know very little about C++. The following comment is not intended to address the original poster's issue. It's just a brief response to what seems to be a tendency to equate malloc'd memory with the heap.

malloc is not required to allocate from the heap. In fact, quite a few implementations may not do so. Some use mmap exclusively (e.g. OpenBSD). Some may use sbrk for some allocations and mmap for others (e.g. Linux (glibc), FreeBSD).

Where malloc'd memory resides is highly implementation dependent.

Regards,
Alister

Praveen_218 · May 6, 2011, 3:46am

How come????

The arrary is going to be on the heap of the process memory only.
However, it's the pointer variable 'a' is going to be over the stack frame of the function using that as it's local.

---------- Post updated at 01:16 PM ---------- Previous update was at 12:51 PM ----------

@aliter ,
Thanks for the information, that's definitely going to be a direction, I'll need to investigate a bit further for my own shake.

To sum up, do you mean to say, in other words, that it's the implementation of heap which is going to be different in different systems?

That's what I'm aware of and most logical thing to conclude of when two operating systems are developed altogether differently and separately.

Also its never the compiler which actually allocates these heap related object/variables rather the OS memory manager allocates them at the run time only.

Its perfectly okey to have different implementations of the actual resident memory area of a heap based variables. The running process (which owns these variables) just treat them at a memory location that's always upper bound and assumes such memory area altogether grows outwards (the stack grows in the direction opposite to the heap, which is inwards).

The whole thing could also be implemented as to simulate this behavior and that's what many virtual memory implementations do.

Not only that, the whole process memory area could altogether be implemented differently (over different platforms) but they give the same feeling to the running process by exposing similar interfaces and response from the operating platform the process is executing on.

Your information was important to me more because of the fact you have clubbed FreeBSD & Linux implementations together.

Might be Linux code implementing these behaviors find it's root in BSD repository tree only, but I'm just speculating here.

gaurav1086 · May 6, 2011, 5:51am

Hi ,

You can use sizeof() operator to find the size of any data structure by providing the base address of the structure. the implementation of memory allocation are different on different architectures and they may allocate slightly more space than you requested for in order to have alignment of the memory allocated which can increase performance benefits.

Thanks and Regards,
Gaurav.

shamrock · May 6, 2011, 12:21pm

heap is just a moniker given to the dynamic part of the data segment...the part from where memory is allocated at runtime.

Whether mmap or sbrk is used for mem allocation...the point is that it is always allocated from the dynamic portion of the data segment aka the heap.

Not exactly because of the the above.

alister · May 6, 2011, 1:32pm

Incorrect. The "dynamic portion of the data segment" is delimited by the break. The break is manipulated with the brk() and sbrk() system calls. As its name implies, the segment is contiguous. mmap(), however, is allowed to map/allocate memory at random locations in non-contiguous segments beyond the break. Such behavior has nothing to do with the heap.

Perhaps your mmap implementation does not do so, but there's nothing to forbid it.

Alternatively, you have a very loose definition of "data segment".

Regards,
Alister

shamrock · May 6, 2011, 4:19pm

Yes you are right the heap is capped by the break pointer...and mmap allocates memory in the hole between break and the upper bound of the stack segment.

All segments are contiguous virtually speaking but none are contiguous physically.

mmap is only used in openbsd for dynamic storage allocation and since my system isnt openbsd it still uses malloc.

That might be true as anything between the text and stack segment i'm calling collectively the data segment

alister · May 6, 2011, 5:35pm

I believe that the malloc statements in my original post in this thread are 100% correct. At the risk of appearing to be trying to win an internet argument, I am dissecting your responses only in the hope that the thread contains useful and accurate information for those who may read it in the future.

Well, for the purposes of this discussion, physical memory is irrelevant; it's all about process virtual memory maps. Still, the statement isn't quite accurate. It implies that each segment is guaranteed to be fragmented in physical memory. It very well could be, but it's not a certainty.

OpenBSD uses malloc(), as do all UNIX/POSIX systems (to my knowledge).

You seem to be confusing interfaces that operate at different levels. mmap() is a lower level system call whose implementation is part of the kernel. malloc() is a higher level function whose implementation is part of the userland standard C library. malloc() does not directly allocate any memory; it must invoke a system call to do the job. The question is, which system call is used. Some malloc() implementations use brk()/sbrk(), some use mmap(), some use both.

If you're using Linux, you're almost certainly using glibc, and glibc's malloc implementation uses sbrk() for small allocations and mmap() for large ones.

FreeBSD tries to use mmap() for all allocations but if mmap() fails it will retry with sbrk().

OpenBSD's malloc() uses mmap() exclusively.

I believe OSX's malloc() uses mmap(), but I am not sure.

I could be mistaken, but I suspect that if there are any malloc() implementations still in widespread use which depend exclusively on brk()/sbrk(), they belong to the more conservative, proprietary Unices out there.

You are of course free to choose your nomenclature as you wish, but in this instance your usage is not in agreement with standard practice.

Being a rebel only makes life harder. Submit. Be one with the collective. Resistance is futile.

Regards,
Alister

achenle · May 6, 2011, 9:04pm

man mapmalloc

There are a lot of ways to make a heap.

Sun once even released a library that when preloaded would put your heap into shared memory. But it may have only been for SPARC, and it was definitely only for 32-bit processes. It also failed miserably for multithreaded processes, IIRC.

So I wrote a better one for a client who wanted fast restarts for a process that used about a few-hundred GB heap. We'd create and zero-fill a single huge ISM shared memory segment upon system boot, when it would be the fastest since memory wasn't fragmented yet. Then the process would start up in about 10 seconds, instead of the 10+ minutes or longer it would take to create and zero-fill the giant heap.

And yes, you have to zero-fill such a heap upon creation, or whenever your application code that by requirement has to be really, really fast may very well hang up waiting for the OS to actually create the VM page the process merely reserved with its call to brk()/sbrk(). Like trying to do a few tens of GB of IO from a SAN at a few GB/sec into a newly-created chunk of memory that hasn't actually been allocated yet... That can generate some nasty problems since the data is coming in faster than the kernel's VM code can create application pages to put them into.

Corona688 · May 6, 2011, 9:56pm

Or if your system supports it, MAP_POPULATE.

hugepages may also be helpful here.