GDB stub and async shim rework #29

encounter · 2024-07-31T00:29:53Z

This implements a working GDB stub for x86-emu and x86-unicorn. One larger change is the new state machine in cli/main.rs. This code is now shared between x86-emu and x86-unicorn: it allows stopping, resuming and single-stepping the machine emulation, handling breakpoints and machine errors in a unified way.

Another larger change is reworking the async shim handling. There's no longer any backend-specific code in builtin.rs. (Take a look at the simplified implementation!) x86-unicorn was updated to use the x86-emu approach for future handling (using eip=MAGIC_ADDR to poll futures instead of executing CPU), though I'd like to consider ideas to unify this code between the two as well, if possible.

Other changes:

x86-unicorn: Unmapped first 4k of memory to catch zero page accesses
Removed --trace-blocks because I didn't feel like reimplementing it (I could, though, if desired)
Removed x86-emu snapshot feature. I broke it while refactoring things to use Pin<>, and decided to rip it out and revisit later if it's still a desired feature. One big issue is that it's incompatible with any running futures.

TODO:

Fix x86-64 build
Fix web build
Multi-threading support
Restore --trace-points

evmar

I am sad to give up on snapshotting but I agree it is impossible(?) to preserve in the face of the futures. It is a super bummer, earlier I had partway implemented a reverse debugger that used snapshots. Also serde is a lot of goop that I would rather not deal with in general so dropping it is nice I guess.

I am generally positive on this but it would be nice to preserve --trace-points as we discussed in chat.

web/glue/src/lib.rs

win32/derive/src/gen.rs

evmar · 2024-08-01T16:02:10Z

win32/src/machine_emu.rs

            }
-            _ => return false,
+            x86::CPUState::Error(message) => StopReason::Error {


BTW one reason I had a weird API here around returning a bool is that this run function is the hottest code when emulating. So when this returns a constructed StopReason enum, the caller is responsible for tearing that StopReason down when it drops which means the drop impl needs to check if it's a StopReason::Error and if so free the message. I cannot remember if I measured this mattering or if I was just worrying about it in the abstract.

I'm not sure the drop check here will matter much, it won't have to do anything in most cases. Happy to rework it if you have suggestions, though

evmar · 2024-08-01T16:06:26Z

win32/src/machine.rs

+    /// The CPU hit a debug breakpoint.
+    Breakpoint { eip: u32 },
+    /// The CPU hit a shim call.
+    ShimCall(&'static Shim),


BTW, I have been (slowly) working on a change to how shims work that will require something very similar to this.

The summary is that kernel32.dll becomes a real dll that contains e.g.

_ReadFile: syscall ret 32

and then the cpu's syscall handler will trigger a StopReason like the above.

(The motivation for this change is (1) using a real dll makes it easier to handle exposing raw symbols from dlls as needed for msvcrt, and (2) a demoscene unpacker attempts to walk the dll headers in memory so I need actual dlls to exist in memory to appease them.)

Coincidentally another win32 emu I've been following just made a similar change
https://inuh.net/@[email protected]/112889440660822739

evmar · 2024-08-02T15:38:24Z

Thinking about this change, I would like to merge it in pieces to better understand the different parts. Do you mind if i pick up parts of it (e.g. "remove snapshotting") separately?

encounter · 2024-08-02T21:27:29Z

Do you mind if i pick up parts of it (e.g. "remove snapshotting") separately?

Go for it! I'll rebase accordingly once I get time to work on it again.

evmar · 2024-08-04T17:41:29Z

1129538 removes all the snapshotting, including the signal handler and web UI for it

evmar · 2024-08-04T22:58:07Z

I copied your change to add Handler::Async, and one thing I notice about it is we end up with two Boxed futures per async call. The first basically holds the decoded stack args, and then the second calls the first and updates state based on its result. The previous code had only one boxed future because it bundled those two things together. I'm not sure there's a good way to resolve this.

#31 -- my attempt

encounter · 2024-08-05T01:33:49Z

If handle_shim_call was always async, one Boxed future could be avoided. It’s only used to conditionally return a Future. Maybe it would work to make that function return Option<impl Future> instead?

edit: I realized that wouldn’t allow us to store it in a Vec, unless we converted it to a custom Future implementation to make it a concrete type.

evmar · 2024-08-05T05:19:42Z

Hm yeah, I had a similar thought. Unfortunately it's put in Vec stored per-cpu, and the cpu layer doesn't know about shims, hrmm.

encounter · 2024-08-07T02:45:28Z

In 5c92c7d I was able to simplify the async shim handling. It pushes the responsibility for storing and polling the future up into the top-level event loop (cli or web, web updates in 6822158). machine.call_shim returns the BoxFuture<u32> from the shim directly, instead of wrapping it in another future for updating the registers. The event loop stores this future, polls it, and calls finish_shim_call when complete, which will then run the machine-specific register updates.

x86-emu and x86-unicorn only have to return StopReason::Blocked when their EIP=MAGIC_ADDR, which tells the outer event loop to perform future polling. This simplifies their implementations quite a bit.

evmar · 2024-08-07T06:23:46Z

I haven't had a chance to look at this yet, but I wanted to note my pending builtins-as-dlls work lets us eliminate some of the post-shim code: 60b256e

basically the new flow is that the original binary does some call [SomeFn] which shows up at

SomeFn:
   syscall  ; causes Rust impl to be invoked
   ret 12   ; the number here is stack_consumed

The only thing the shim implementation code needs to manage is taking the return value of the Rust fn and putting it into eax.

I haven't quite figured it out yet but I am pretty sure that async fns will benefit similarly.

Now that I've typed this out, maybe the eax handling means this doesn't win anything, hrm.

evmar · 2024-08-07T17:43:08Z

New idea: make the futures Vec a Vec of Future<Option<u32>>, where a present value means "put this in eax when done". Coupled with my other change that removes the other code that needs to run after an async block I think that is enough?

encounter · 2024-08-07T17:54:20Z

Sounds good to me. I think that's similar to the solution I cooked up in the above commit:

struct AsyncShimCall {
  shim: &'static win32::shims::Shim,
  future: BoxFuture<u32>,
}
let mut shim_calls = Vec::<AsyncShimCall>::new();

// ...

match stop_reason {
  win32::StopReason::Blocked => {
    // Poll the last future.
    let shim_call = shim_calls.last_mut().unwrap();
    match shim_call.future.as_mut().poll(&mut ctx) {
      Poll::Ready(ret) => {
        target.machine.finish_shim_call(shim_call.shim, ret);

Except now finish_shim_call is only responsible for setting eax, instead of doing the eip and stack manipulation as well.

evmar · 2024-09-12T17:36:30Z

I am so so sorry this has taken me so long! I have merged pieces of it and I am working on the main debugger part, but it also managed to collide horribly with this dll change I've also been working on for a long time so it's been kind a three way train wreck between your change, my change, and also my personal life stuff.

evmar · 2024-09-12T17:40:00Z

cli/src/debugger.rs

+    Ok(stream)
+}
+
+pub type StateMachine<'a> = GdbStubStateMachine<'a, MachineTarget, std::net::TcpStream>;


I am trying to follow why you went with implementing this StateMachine type rather that using the simpler event loop from the gdbstub examples. Can you explain a bit?

I would have expected something more like this
https://github.com/daniel5151/gdbstub/blob/6e72c26211515bf15b8a462d195f6ccf418f4c92/examples/armv4t/main.rs#L95

evmar reviewed Aug 1, 2024

View reviewed changes

encounter force-pushed the gdb-stub branch from e1e1c5e to 6822158 Compare August 7, 2024 02:37

encounter force-pushed the gdb-stub branch 2 times, most recently from 7265fd6 to 25adc6f Compare August 13, 2024 01:58

encounter changed the title ~~WIP GDB stub~~ GDB stub and async shim rework Aug 13, 2024

encounter marked this pull request as ready for review August 13, 2024 01:59

encounter added 7 commits August 12, 2024 20:12

GDB stub

9413eec

Fix web build & minor cleanup

419158a

web: Fix bug causing infinite CPU loop

244a272

Simplify and unify async shim handling

beed984

web: Updates for futures rework

7bf71cf

x86-64: Updates for futures rework

b87da6d

Restore --trace-points

e0f2f65

encounter force-pushed the gdb-stub branch from 25adc6f to e0f2f65 Compare August 13, 2024 02:13

evmar reviewed Sep 12, 2024

View reviewed changes

evmar force-pushed the main branch from 5d8b6fb to d08edb3 Compare October 6, 2024 20:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GDB stub and async shim rework #29

GDB stub and async shim rework #29

encounter commented Jul 31, 2024 •

edited

Loading

evmar left a comment

evmar Aug 1, 2024

encounter Aug 7, 2024

evmar Aug 1, 2024

evmar Aug 2, 2024

evmar commented Aug 2, 2024

encounter commented Aug 2, 2024

evmar commented Aug 4, 2024 •

edited

Loading

evmar commented Aug 4, 2024

encounter commented Aug 5, 2024 •

edited

Loading

evmar commented Aug 5, 2024

encounter commented Aug 7, 2024 •

edited

Loading

evmar commented Aug 7, 2024 •

edited

Loading

evmar commented Aug 7, 2024

encounter commented Aug 7, 2024

evmar commented Sep 12, 2024

evmar Sep 12, 2024

GDB stub and async shim rework #29

Are you sure you want to change the base?

GDB stub and async shim rework #29

Conversation

encounter commented Jul 31, 2024 • edited Loading

evmar left a comment

Choose a reason for hiding this comment

evmar Aug 1, 2024

Choose a reason for hiding this comment

encounter Aug 7, 2024

Choose a reason for hiding this comment

evmar Aug 1, 2024

Choose a reason for hiding this comment

evmar Aug 2, 2024

Choose a reason for hiding this comment

evmar commented Aug 2, 2024

encounter commented Aug 2, 2024

evmar commented Aug 4, 2024 • edited Loading

evmar commented Aug 4, 2024

encounter commented Aug 5, 2024 • edited Loading

evmar commented Aug 5, 2024

encounter commented Aug 7, 2024 • edited Loading

evmar commented Aug 7, 2024 • edited Loading

evmar commented Aug 7, 2024

encounter commented Aug 7, 2024

evmar commented Sep 12, 2024

evmar Sep 12, 2024

Choose a reason for hiding this comment

encounter commented Jul 31, 2024 •

edited

Loading

evmar commented Aug 4, 2024 •

edited

Loading

encounter commented Aug 5, 2024 •

edited

Loading

encounter commented Aug 7, 2024 •

edited

Loading

evmar commented Aug 7, 2024 •

edited

Loading