0xV3n0m

Announcement

Welcome To My Personal Blog

1.15 More about results returning

The author said that in x86, the result of function execution is usually returned in the EAX register. If the type is byte or char, the lower part of the EAX register which is AL is used. If the function returns a float number, the FPU register ST(0) is used. In ARM, the result is usually returned in the R0 register.

1.15.1 Attempt to use the result of a function returning void

Well, what happens if the return value of main() was void not int?

The so-called startup-code calls main() approximately like this:

1
push envp ; push the environment pointer onto the stack
2
push argv ; push the argument vector onto the stack
3
push argc ; push the argument count onto the stack
4
call main ; call the main function
5
push eax ; push the return value from main (in EAX) onto the stack
6
call exit ; call the exit function with the pushed value

In other words

1
exit(main(argc, argv, envp)); // call exit with the return value of main as argument

If you wrote void main() instead of int main(), what happens?

void main() means that no value is expected to be returned explicitly. But the EAX register may contain any meaningless value (leftover) from previous instructions.
When the startup code does push eax after call main, it will send the value in EAX to exit() — and therefore the exit code will be a random value or a value from the last executed function (like puts() or printf() if used).

We can illustrate this with code like this:

1
#include <stdio.h> // include standard I/O header
2
void main() // main function declared as void, no return value
3
{
4
    printf("Hello, world!\\n"); // print "Hello, world!" followed by newline
5
};

GCC here might replace printf with puts.

puts() returns the number of characters it printed in EAX. If main didn't return a value, EAX will retain this value.

1
.LC0: // label for the string
2
.string "Hello, world!" // define the string "Hello, world!"
3
main: // start of main function
4
      push ebp // save base pointer
5
      mov ebp, esp // set base pointer to stack pointer
6
      and esp, -16 // align stack to 16-byte boundary
7
      sub esp, 16 // allocate 16 bytes on stack
8
      mov DWORD PTR [esp], OFFSET FLAT:.LC0 // store string address on stack
9
      call puts // call puts to print the string
10
      leave // restore base and stack pointers
11
      ret // return from function

We write a bash script that displays the exit status:

Listing 1.101: tst.sh

1
#!/bin/sh // shebang for shell script
2
./hello_world // run the hello_world executable
3
echo $? // echo the exit status of the previous command

And we run it:

1
$ tst.sh
2
Hello, world!
3
14

14 is the number of characters that were printed.

The number of characters leaked from printf() (or puts) through EAX/RAX and entered as “exit code”.

By the way, when we decompile C++ with Hex-Rays, sometimes we encounter a function that ends with a class destructor:

1
...
2
call ??1CString@@QAE@XZ ; CString::CString(void) // call the CString destructor
3
mov ecx, [esp+30h+var_C] // move value from stack to ECX
4
pop edi // pop EDI from stack
5
pop ebx // pop EBX from stack
6
mov large fs:0, ecx // move ECX to FS:0 (thread information block)
7
add esp, 28h // add 28h to ESP (clean stack)
8
retn // return from function

According to the C++ standard, the destructor does not return anything, but when Hex-Rays does not know that, and thinks that the destructor and the function itself return int, we see something like this in the outputs:

1
...
2
return CString::~CString(&Str); // Hex-Rays mistakenly shows destructor as returning value
3
}

In a clearer sense, it is that when Hex-Rays saw retn, it said that surely this Function returns a Value even though in reality this is just a return to the Caller, nothing more.

1.15.3 Returning a structure

The author then explained and said the truth is that the return value is computed in the EAX register.

And without much chatter, the reason is that old C compilers could not make a function return something that does not fit in one register (usually int)

If one needs to return something bigger, he must return the data through pointers sent as arguments to the function.

So it is very normal that a function returns one value only, and the rest returns it through pointers.

Now we can return a full struct, but the subject is not famous.

If a function must return a large struct, the function that calls it (the caller) must allocate it and send a pointer to it as the first argument, and this happens hidden from the programmer.

Meaning it is the same idea as if you send a pointer in the first argument by hand, but the compiler hides this.

A small example:

1
struct s { // define structure s
2
    int a; // field a
3
    int b; // field b
4
    int c; // field c
5
};
6

7
struct s get_some_values(int a) // function that returns struct s
8
{
9
    struct s rt; // local struct rt
10
    rt.a = a+1; // set rt.a to a+1
11
    rt.b = a+2; // set rt.b to a+2
12
    rt.c = a+3; // set rt.c to a+3
13
    return rt; // return the struct
14
};

What we got (MSVC 2010 /Ox):

1
$T3853 = 8 ; size = 4 // temporary variable for struct pointer
2
_a$ = 12 ; size = 4 // parameter a
3
?get_some_values@@YA?AUs@@H@Z PROC ; get_some_values // start of function
4
mov ecx, DWORD PTR _a$[esp-4] // move a to ECX
5
mov eax, DWORD PTR $T3853[esp-4] // move struct pointer to EAX
6
lea edx, DWORD PTR [ecx+1] // load a+1 to EDX
7
mov DWORD PTR [eax], edx // store a+1 in struct.a
8
lea edx, DWORD PTR [ecx+2] // load a+2 to EDX
9
add ecx, 3 // add 3 to ECX (a+3)
10
mov DWORD PTR [eax+4], edx // store a+2 in struct.b
11
mov DWORD PTR [eax+8], ecx // store a+3 in struct.c
12
ret 0 // return
13
?get_some_values@@YA?AUs@@H@Z ENDP ; get_some_values // end of function

The micro that the compiler uses here to pass the pointer to the struct is named $T3853.

We can write the same example using C99:

1
struct s { // define structure s
2
    int a; // field a
3
    int b; // field b
4
    int c; // field c
5
};
6

7
struct s get_some_values(int a) // function that returns struct s
8
{
9
    return (struct s){.a=a+1, .b=a+2, .c=a+3}; // return initialized struct
10
};

GCC 4.8.1:

1
_get_some_values proc near // start of function
2
ptr_to_struct = dword ptr 4 // pointer to struct parameter
3
a = dword ptr 8 // parameter a
4
mov edx, [esp+a] // move a to EDX
5
mov eax, [esp+ptr_to_struct] // move struct pointer to EAX
6
lea ecx, [edx+1] // load a+1 to ECX
7
mov [eax], ecx // store a+1 in struct.a
8
lea ecx, [edx+2] // load a+2 to ECX
9
add edx, 3 // add 3 to EDX (a+3)
10
mov [eax+4], ecx // store a+2 in struct.b
11
mov [eax+8], edx // store a+3 in struct.c
12
retn // return
13
_get_some_values endp // end of function

As we see, the function fills the fields of the struct that was allocated before by the calling function, as if a pointer to the struct was sent as an argument.

So there is no loss in performance.

To make this part easier for you, I'll explain with a simple explanation that clarifies things a bit.

First, this is the big instruct will be in this shape for example

1
struct s { // define structure s
2
   int a; // field a
3
   int b; // field b
4
   int c; // field c
5
};

This will be its shape in memory

1
┌─────────┐
2
│   a     │
3
├─────────┤
4
│   b     │
5
├─────────┤
6
│   c     │
7
└─────────┘

The caller now before calling the function get_some_values(a)

He does this, allocates a place for the struct in memory like this

1
Caller Memory:
2
┌──────────────────────────┐
3
│  Empty space to save struct   │  ← It will be returned here
4
│ Address = 5000            │
5
└──────────────────────────┘

And after that sends the address of this place to the function as a hidden argument

1
Caller
2
   │
3
   │  sends pointer = 5000
4
   ▼
5
Callee (get_some_values)

At that time the function receives a pointer to an empty place and starts writing the values inside it

1
Address 5000:
2
┌─────────┐
3
│  a=a+1  │
4
├─────────┤
5
│  b=a+2  │
6
├─────────┤
7
│  c=a+3  │
8
└─────────┘

And this is the final shape

1
Caller memory:
2
┌────────────────────────────┐
3
│ struct at 5000:            │
4
│   a = a+1                  │
5
│   b = a+2                  │
6
│   c = a+3                  │
7
└────────────────────────────┘
8
              ↑
9
              │
10
   callee wrote the values here

After now the function finishes, the function does not return the struct directly, she returns the pointer that you originally sent (hidden)

So the caller sees the full struct appeared to him:

1
return value ← same address 5000
2

3
Caller now sees:
4
a = a+1
5
b = a+2
6
c = a+3

And this is a summary for all this talk

Share

If this article helped you, please share it with others!

CH1.14 - More About Results Returning

https://v3nn00m.github.io/posts/re4b/chapter115/

Author

0xV3n0m

Published at

2025-12-11

License

0xV3n0m's Personal Blog License

Some information may be outdated

CH1.15 - Pointers

CH1.13 - Global vs. Local Variables & Accessing Passed Arguments

0xV3n0m

1.15 More about results returning

1.15.1 Attempt to use the result of a function returning void

1.15.3 Returning a structure

Table of Contents