home

Runtime Duck Typing in C++

C⁠+⁠+ is generally considered a statically typed language. Virtual functions and templates blur the lines a bit, but you generally still need to know how all your types are laid out at compile time. So what if you didn’t?

In the previous post, I went over how Unreal Engine’s core class’ layouts have changed over time, and how the Borderlands series’ modding sdk has historically handled these changes. In the sdk code, we don’t actually particularly care about the exact object layout, all we care about is that a field of the given name and type exists. We can consider the different versions as different types, and we want to swap between them using some form of duck typing.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
namespace bl2 {

class UClass : public UStruct { 
    uint8_t UnknownData00[0xCC];
    UObject* ClassDefaultObject;
    uint8_t UnknownData01[0x48];
    TArray<FImplementedInterface> Interfaces;
};

}

namespace tps {

class UClass : public UStruct { 
    uint8_t UnknownData00[0xCC];
    UObject* ClassDefaultObject;
    uint8_t UnknownData01[0x14];  // <-- different size
    TArray<FImplementedInterface> Interfaces;
};

}

bool UClass::implements(const UClass* interface) const {
    return std::ranges::any_of(this->Interfaces, // <-- pick the right one
                               [&](auto x) { return x.Class == interface; });
}

Now before continuing let’s quickly answer why the obvious approaches won’t work.

What worked

So to start with, let’s go over what aspects of the historical solutions discussed in the previous post worked well, and should be incorporated into the new one.

Defining types using classes and inheritance

It feels a bit stupid pointing this one out when it’s so obvious, but I have to. If I want to define a class, it’s nice defining it… as a class. I don’t want to have to fill any big tables of magic numbers for each field.

As a specific example, in the previous post we went over UProperty::read_field, which took member pointers instead of relying on magic numbers - it even worked if you inherited through multiple types.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
class UObjectProperty : public UProperty {
   private:
    UClass* PropertyClass;

   public:
    UClass* get_property_class(void) const {
        return this->read_field(&UObjectProperty::PropertyClass);
    }
};

class UClassProperty : public UObjectProperty {
   private:
    UClass* MetaClass;

   public:
    UClass* get_meta_class(void) const {
        return this->read_field(&UClassProperty::MetaClass);
    }
};

Returning references to members

There were a small handful of places where the sdk needed to set a field whose offset changed dynamically. Turning these into functions returning a reference lead to really nice semantics for calling code - it’s essentially the closest C⁠+⁠+ can get to a property.

1
this->func->FunctionFlags() |= UFunction::FUNC_NATIVE;

While it’s a lesser consideration, this also makes porting code quite easy, just add a pair of brackets. Clang in fact detects this exact error case.

1
2
3
4
 /workspaces/unrealsdk/src/unrealsdk/game/bl3/console.cpp:166:19: error: reference to non-static member function must be called; did you mean to call it with no arguments?
  166 |         viewport->Class->find_prop_and_validate<UObjectProperty>(L"ViewportConsole"_fn);
      |         ~~~~~~~~~~^~~~~
      |                        ()

This project was always going to require a major version bump, a breaking change is ok, it’s just nice it’s one relatively easy to fix.

Wrapper types

For the cases where the internal unreal types where completely different, we used wrapper types, so that all calling code used a common interface.

1
2
3
4
5
6
7
8
9
10
11
12
13
class GObjects {
   public:
    struct Iterator;

    [[nodiscard]] size_t size(void) const;
    [[nodiscard]] UObject* obj_at(size_t idx) const;
    
    [[nodiscard]] Iterator begin(void) const;
    [[nodiscard]] static Iterator end(void);

    [[nodiscard]] UObject* get_weak_object(const FWeakObjectPtr* ptr) const;
    void set_weak_object(FWeakObjectPtr* ptr, const UObject* obj) const;
};

Since these are our own types, which we’re never going to hand back to the engine, we can actually get away with using virtual functions here, if required.

Putting it together

Basically all the historical ways of dealing with these different object layouts boiled down to working out what offset the field is supposed to be at, and then manually doing the pointer maths. If we want to extend this system to cover every field, the obvious first idea is to just store every field’s offset.

After a bunch of experimenting, I came up with the following.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
using offset_type = uint16_t;

template <typename From, typename To>
using copy_cv = std::conditional_t<
    std::is_const_v<From>,
    std::add_const_t<std::conditional_t<std::is_volatile_v<From>, std::add_volatile_t<To>, To>>,
    /* align      */ std::conditional_t<std::is_volatile_v<From>, std::add_volatile_t<To>, To>>;

class UObject {
   public:
    struct Offsets {
        offset_type Class;
        offset_type Name;
        
        template <typename T>
        static constexpr Offsets from() {
            static_assert(offsetof(T, Class) < std::numeric_limits<offset_type>::max());
            static_assert(offsetof(T, Name) < std::numeric_limits<offset_type>::max());
            
            return {
                offsetof(T, Class),
                offsetof(T, Name),
            };
        }

        static offset_type get(offset_type Offsets::* field);
    };

    template <typename T>
    [[nodiscard]] copy_cv<T, UClass*>& Class(this T& self) {
        return *reinterpret_cast<copy_cv<T, UClass*>*>(reinterpret_cast<uintptr_t>(&self)
                                                       + Offsets::get(&Offsets::Class));
    }
    template <typename T>
    [[nodiscard]] copy_cv<T, FName>& Name(this T& self) {
        return *reinterpret_cast<copy_cv<T, FName>*>(reinterpret_cast<uintptr_t>(&self)
                                                     + Offsets::get(&Offsets::Name));
    }
};

struct OffsetList {
    UObject::Offsets UObject;

    template <typename T>
    static constexpr OffsetList from(void) {
        return {
            UObject::Offsets::from<typename T::UObject>(),
        };
    }
};

In each type, we add an Offsets struct, holding each field we’re interested in. We also add a helper function to generate the struct based off of a type. We then also add a reference getter for each field, which gets the relevant offset and adds it to its pointer. These getters use C⁠+⁠+⁠23’s deducing this to automatically return a const reference on const pointers. We then create an OffsetList type holding all sets of offsets, along with a templated helper again.

Now in practice, to avoid copy paste errors, this is actually implemented using a number of X macros - reflection isn’t quite here yet, it would probably be cleaner. The actual code in each type’s header is a lot simpler.

1
2
3
4
5
6
7
8
#define UNREALSDK_UOBJECT_FIELDS(X)   \
    X(object_flags_type, ObjectFlags) \
    X(int32_t, InternalIndex)         \
    X(UClass*, Class)                 \
    X(FName, Name)                    \
    X(UObject*, Outer)

    UNREALSDK_DEFINE_FIELDS_HEADER(UObject, UNREALSDK_UOBJECT_FIELDS);

So the next question is how do we actually swap the offsets out? The sdk already used an AbstractHook type to swap handling of the engine functions it hooked. We can simply add on returning the current offset list to its responsibilities.

1
2
3
4
struct AbstractHook {
    // ...
    [[nodiscard]] virtual const OffsetList& get_offsets(void) const = 0;
};

Then each game can implement this by defining all its own types, and calling our templated helpers.

1
2
3
4
5
6
7
8
9
10
11
12
13
namespace {

struct OffsetClasses {
    using UObject = bl2::UObject;
};

const auto OFFSETS = OffsetList::from<OffsetClasses>();

}  // namespace

[[nodiscard]] const OffsetList& BL2Hook::get_offsets(void) const {
    return OFFSETS;
}

And finally, we can implement Offsets::get with the following. This is it’s own function, implemented in the source file, mostly just to avoid recursive include issues.

1
2
3
offset_type UObject::Offsets::get(offset_type UObject::Offsets::* field) { 
    return hook_instance->get_offsets().UObject.*field;
}

Handling subclasses

So one annoyance with the above approach is to do with handling subclasses. Some subclasses are relatively trivial, and have remained identical across all games.

1
2
3
4
class UField : public UObject {
   public:
    UField* Next;
};

However, if the parent class has changed, that means even though the child class’s definition hasn’t changed, the offsets of its fields certainly have. A parent class changing forces you to redefine all its children. And UObject is the parent class, all other unreal object types inherit from it. As of writing this the sdk currently includes 39 of its subclasses. We’d prefer not to need to redeclare every other class if they’re unchanged.

Instead, we can define a templated generic class, which inherits from the game-specific class.

1
2
3
4
5
6
7
8
9
10
11
12
13
14
namespace generic {

template <typename T>
class UField : public T {
   public:
    UField* Next;
};

}  // namespace generic

struct OffsetClasses {
    using UObject = bl2::UObject;
    using UField = generic::UField<bl2::UObject>;
};

Downsides

Now, while they sounded good in theory, as I was implementing the previous techniques, I came across a few small downsides.

The first, and most obvious one: changing all members to functions is a breaking change, that requires going through the entire codebase to fix every reference. I knew this, I pointed it out earlier, but it was still quite annoying.

A more “real” problem is that turning the members into functions means you don’t get to see them while debugging anymore. If you need to see a specific field, you can manually call it’s function, it doesn’t have side effects, but you just don’t get to see a listing of all an object’s members. You can also manually cast to the game-specific version of the object, though that’s normally more typing.

But my biggest concern was performance. Obviously, adding any sort of dynamic typing will always have worse performance than being able to read from a static offset. And the sdk already uses slow runtime lookups for most unreal properties - but those lookups were exactly why I was worried, I didn’t want to make them too much worse. They work by iterating through several linked lists, and comparing each object’s name. With these changes, at minimum each iteration involves two virtual function calls (next pointer + name), and a bit of pointer maths - and that’s assuming perfect link time optimization.

To test this, I ported UObject, UField, UStruct, and UProperty to the new system, then upgraded pyunrealsdk to support it. This is the main way the sdk is actually used, from it’s Python bindings, so better to benchmark there. I then manually ran the following console commands from the main menu of BL3:

1
2
3
4
5
py import timeit
py from mods_base import get_pc
py pc = get_pc()
py print(timeit.timeit(lambda: pc.OakCharacter, number=1000000))
py print(timeit.timeit(lambda: pc.ExecuteUbergraph, number=1000000))

pc was a BP_MenuPlayerController_C. Finding ExecuteUbergraph on this class iterates though 7 classes and 948 total fields. This is pretty much the worst case I could find. OakCharacter on the other hand is the best case, it’s the very first field.

  Static typed Duck typed % slower
OakCharacter 1.507s 1.582s 5.0%
ExecuteUbergraph 4.897s 5.048s 3.1%

Seems acceptable enough, guess it’s not a problem.

An alternative approach

I did come up with a few concepts for an alternative approach while I was working on this. Since the existing one is good enough, I haven’t tried implementing it, there are still a number of open questions, and I imagine it would probably be a lot more complex. But I figure it’s worth putting the idea out there.

Essentially, I imagine the “core” unreal types wouldn’t have any members, and you’d instead be forced to convert them to a “concrete”, game specific, type before you can access them. A templated lambda could be used to automatically handle the different object layouts - the implementation would call it with a different templated type for each set of layouts.

1
2
3
unrealsdk::to_concrete([]<typename T>(T::UObject* obj) {
    std::print("Object Name: {}", obj->Name);
}, obj);

This is making use of how templates are (compile-time) ducked typed to begin with, and then just adding a runtime switch at the front.

Looking back at the downsides of the current approach, while it doesn’t fully solve any of them, this approach would help with all. You wouldn’t need to update every single member access, but you would still need to wrap larger code blocks. When debugging, the core types would still be unusable, but in cases you want to see the members, it’s likely you already converted your objects to a concrete type, and within the lambda you could see all their members. And for performance, while we’ll still need to check which objects layouts to use once, at the start, it would only be that once, the code within each lambda should compile similarly to before.

Now I like I said, there’s still a number of open questions to this I haven’t investigated.

© 2025    •  Theme  Moonwalk