MABlockClosure 源码导览

文章發布時間 2011年5月6日

作者 TommyWu

標籤

译文 · 原文： Friday Q&A 2011-05-06: A Tour of MABlockClosure · 作者 Mike Ash

原文：https://www.mikeash.com/pyblog/friday-qa-2011-05-06-a-tour-of-mablockclosure.html 发布：2011-05-06　作者：Mike Ash 译者：MiMo（mimo-v2.5-pro）；代码块保留英文原样

迟到了一周，但最新一期的 Friday Q & A 终于来了。大约一年前，我曾撰写过关于在运行时构建代码来将 block（代码块）转换为函数指针的文章。那是一次有趣的尝试，但由于各种限制最终不太实用。在此期间，我编写了 MABlockClosure，这是一种更健壮、更实用的实现方式，但我从未发表过相关文章。Landon Fuller 建议我讨论一下它的工作原理，因此这就是我今天要讲述的内容。

回顾

Block 是一种极其有用的语言特性，原因有二：它们允许在其它代码中内联编写匿名函数，并且可以通过引用其所在作用域中的局部变量来捕获该作用域的上下文。除此之外，这使得回调模式（callback patterns）变得简单得多。

1
    struct CallbackContext
2
    {
3
        NSString *title;
4
        int value;
5
    };
6

7
    static void MyCallback(id result, void *contextVoid)
8
    {
9
        struct CallbackContext *context = contextVoid;
10
        // use result, context->title, and context->value
11
    }
12

13
    struct CallbackContext ctx;
14
    ctx.title = [self title];
15
    ctx.value = [self value];
16
    CallAPIWithCallback(workToDo, MyCallback, &ctx;);

1
    CallAPIWithCallbackBlock(workToDo, ^(id result) {
2
        // use result, [self title], [self value]
3
    });

问题在于，并非所有基于回调的 API 都有接受 block（块）的版本。MABlockClosure 和我早期实验性的 trampoline 代码所实现的功能，是将一个 block 转换为可传递给这类 API 的函数指针。例如，如果不存在CallAPIWithCallbackBlock，MABlockClosure 允许你编写出几乎同样优雅的代码：

1
    CallAPIWithCallback(workToDo, BlockFptrAuto(^(id result) {
2
        // use result, [self title], [self value]
3
    }));

Blocks 会编译成一个函数和几个结构体。函数包含代码，结构体则保存块的信息，包括捕获的上下文（captured context）。该函数包含一个隐式参数（implicit argument），与 Objective-C 方法的 self 参数类似，指向块结构体。上面的块会被翻译成类似这样的形式：

1
    void BlockImpl(struct BlockStruct *block, id info)
2
    {
3
        // code goes here
4
    }

我最初尝试用一小段汇编代码实现跳板函数（trampoline）。这段代码试图以通用方式移动参数，然后在开头插入指针。遗憾的是，对于所有情况根本无法用同一段代码实现，最终导致了许多恼人的限制。

在当时，这已经是能实现的最佳方案了。幸运的是，苹果后来为块（block）添加了类型元数据（type metadata）。只要使用的编译器足够新以生成此元数据（任何较新的 clang 都能满足），就能用它来生成能进行适当参数操作的智能跳板代码。

libffi（外部函数接口）尽管块类型元数据提供了执行必要参数转换所需的全部信息，但这仍然是一项极其复杂的任务。具体需要执行的操作很大程度上取决于代码运行所在特定架构的函数调用 ABI（应用程序二进制接口），以及涉及的具体参数类型。

如果这一切都需要我亲自来实现，那么我将永远无法投入如此巨大的精力。好消息是，已经有一个现成的库知道如何为大量不同的架构处理所有这些：libffi。

libffi 提供了两大主要功能。它最著名的能力是能够以任意参数调用任意函数，而这些参数的类型直到运行时（runtime）才为人所知。一个较少被提及的功能则提供了本质上相反的作用：它允许创建「闭包（closure）」，即运行时生成的函数，这些函数能捕获任意参数，而参数类型同样是直到运行时才确定。

后者正是我们生成 block 的蹦床函数（trampoline function）所需要的。它以一种能够被 C 代码操纵的形式来捕获参数。然后，该代码可以根据需要操纵这些参数，并使用前一个功能来调用 block 的实现指针。

Support Structures（支持结构体）

block 结构体的布局并未在任何公开的头文件中披露。然而，由于这些结构体在编译时会被固化到可执行文件中，我们可以安全地从规范中提取它们，并依赖于与之匹配的实现。

以下是相关结构体：

1
    struct BlockDescriptor
2
    {
3
        unsigned long reserved;
4
        unsigned long size;
5
        void *rest[1];
6
    };
7

8
    struct Block
9
    {
10
        void *isa;
11
        int flags;
12
        int reserved;
13
        void *invoke;
14
        struct BlockDescriptor *descriptor;
15
    };

1
    static void *BlockImpl(id block)
2
    {
3
        return ((struct Block *)block)->invoke;
4
    }

1
    static const char *BlockSig(id blockObj)
2
    {
3
        struct Block *block = (void *)blockObj;
4
        struct BlockDescriptor *descriptor = block->descriptor;
5

6
        int copyDisposeFlag = 1 << 25;
7
        int signatureFlag = 1 << 30;
8

9
        assert(block->flags & signatureFlag);
10

11
        int index = 0;
12
        if(block->flags & copyDisposeFlag)
13
            index += 2;
14

15
        return descriptor->rest[index];
16
    }

很多必要的 libffi 数据结构需要根据类型签名动态创建。手动管理这些内存会很繁琐。既然这些数据结构的生命周期与闭包对象本身绑定，最简单的处理方式就是在对象内部追踪分配记录。为此我使用了一个 NSMutableArray。当需要分配内存时，我会创建合适大小的 NSMutableData，将其加入这个数组，然后返回它的 mutableBytes 指针。这个数组是该类的第一个实例变量：

1
    @interface MABlockClosure : NSObject
2
    {
3
        NSMutableArray *_allocations;

1
        ffi_cif _closureCIF;
2
        ffi_cif _innerCIF;
3
        int _closureArgCount;

1
        ffi_closure *_closure;
2
        void *_closureFptr;
3
        id _block;
4
    }

1
    - (id)initWithBlock: (id)block;
2

3
    - (void *)fptr;
4

5
    @end

1
    - (void *)fptr
2
    {
3
        return _closureFptr;
4
    }

1
    - (id)initWithBlock: (id)block
2
    {
3
        if((self = [self init]))
4
        {
5
            _allocations = [[NSMutableArray alloc] init];
6
            _block = block;
7
            _closure = AllocateClosure(&_closureFptr);
8
            [self _prepClosureCIF];
9
            [self _prepInnerCIF];
10
            [self _prepClosure];
11
        }
12
        return self;
13
    }

新版本的 libffi 将所有这些操作封装在 allocate（分配）、prepare（准备）和 deallocate（释放）闭包的调用中。如果你从源码构建 libffi，你会获得这种封装方式，在 iOS 上也是如此。MABlockClosure 被设计为能同时处理这两种方式。

AllocateClosure 函数使用条件编译来决定采用哪种技术。如果设置了 USE_LIBFFI_CLOSURE_ALLOC，它就直接调用 libffi 的相应函数。否则，它使用 mmap 来分配内存，这可以确保内存对齐正确，并且之后可以被标记为可执行。该函数如下所示：

1
    static void *AllocateClosure(void **codePtr)
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        return ffi_closure_alloc(sizeof(ffi_closure), codePtr);
5
    #else
6
        ffi_closure *closure = mmap(NULL, sizeof(ffi_closure), PROT_READ | PROT_WRITE, MAP_ANON | MAP_PRIVATE, -1, 0);
7
        if(closure == (void *)-1)
8
        {
9
            perror("mmap");
10
            return NULL;
11
        }
12
        *codePtr = closure;
13
        return closure;
14
    #endif
15
    }

1
    static void DeallocateClosure(void *closure)
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        ffi_closure_free(closure);
5
    #else
6
        munmap(closure, sizeof(ffi_closure));
7
    #endif
8
    }

由 -initWithBlock: 调用的两个预备方法（prep methods）只是用略有不同的参数来调用同一个公共方法（common method）：

1
    - (void)_prepClosureCIF
2
    {
3
        _closureArgCount = [self _prepCIF: &_closureCIF withEncodeString: BlockSig(_block) skipArg: YES];
4
    }
5

6
    - (void)_prepInnerCIF
7
    {
8
        [self _prepCIF: &_innerCIF withEncodeString: BlockSig(_block) skipArg: NO];
9
    }

-_prepCIF:withEncodeString:skipArg: 方法会进而调用另一个方法，由该方法真正完成将 @encode 字符串转换为 ffi_type 数组的工作。随后，该方法会根据需要跳过第一个参数，并调用 ffi_prep_cif 来填充 ffi_cif 结构体：

1
    - (int)_prepCIF: (ffi_cif *)cif withEncodeString: (const char *)str skipArg: (BOOL)skip
2
    {
3
        int argCount;
4
        ffi_type **argTypes = [self _argsWithEncodeString: str getCount: &argCount;];
5

6
        if(skip)
7
        {
8
            argTypes++;
9
            argCount--;
10
        }
11

12
        ffi_status status = ffi_prep_cif(cif, FFI_DEFAULT_ABI, argCount, [self _ffiArgForEncode: str], argTypes);
13
        if(status != FFI_OK)
14
        {
15
            NSLog(@"Got result %ld from ffi_prep_cif", (long)status);
16
            abort();
17
        }
18

19
        return argCount;
20
    }

Foundation 提供了一个便捷的函数 NSGetSizeAndAlignment，在解析这类字符串时非常有用。当传入一个 @encode 字符串时，它会返回该字符串中第一个类型的大小和对齐方式，并返回一个指向下一个类型的指针。理论上，我们只需在循环中调用此函数，就能遍历一个 block 签名中的所有类型。

实际上，这里有个复杂情况。出于我从未查明的原因，方法签名（method signatures，也因此包括 block 签名）中，各个类型编码之间会夹杂一些数字。NSGetSizeAndAlignment 对这些数字一无所知，因此需要适当调整才能正确解析这类字符串。我编写了一个辅助函数，它调用 NSGetSizeAndAlignment 后，会跳过在类型字符串之后找到的任何数字：

1
    static const char *SizeAndAlignment(const char *str, NSUInteger *sizep, NSUInteger *alignp, int *len)
2
    {
3
        const char *out = NSGetSizeAndAlignment(str, sizep, alignp);
4
        if(len)
5
            *len = out - str;
6
        while(isdigit(*out))
7
            out++;
8
        return out;
9
    }

1
    static int ArgCount(const char *str)
2
    {
3
        int argcount = -1; // return type is the first one
4
        while(str && *str)
5
        {
6
            str = SizeAndAlignment(str, NULL, NULL, NULL);
7
            argcount++;
8
        }
9
        return argcount;
10
    }

1
    - (ffi_type **)_argsWithEncodeString: (const char *)str getCount: (int *)outCount
2
    {
3
        int argCount = ArgCount(str);
4
        ffi_type **argTypes = [self _allocate: argCount * sizeof(*argTypes)];

1
        int i = -1;
2
        while(str && *str)
3
        {
4
            const char *next = SizeAndAlignment(str, NULL, NULL, NULL);
5
            if(i >= 0)
6
                argTypes[i] = [self _ffiArgForEncode: str];
7
            i++;
8
            str = next;
9
        }

1
        *outCount = argCount;
2

3
        return argTypes;
4
    }

1
    - (ffi_type *)_ffiArgForEncode: (const char *)str
2
    {

libffi（外部函数接口库）按大小区分整数类型，并没有与 int 或 long 直接对应的类型。为帮助我在这两者之间进行转换，我构建了一些宏。（事实证明 libffi 也为此内置了一些宏，比如 ffi_type_sint 这样的 #define 会映射到正确的基础 ffi_type。我在编写代码时并不知道这些，因此我的方法比必要情况稍显迂回。）

如前所述，基本类型在 @encode 中用单个字符表示。为避免硬编码任何字符值，我使用类似 @encode(type)[0] 的表达式来获取该单个字符。如果它等于 str[0]，则说明字符串编码的就是该基本类型。

我的有符号整数宏首先执行此检查以判断类型是否匹配。若匹配，它会使用 sizeof(type) 来确定所讨论的整数类型的大小，并返回与之匹配的相应 ffi_type *（指向类型描述结构的指针）。宏的示例如下：

1
        #define SINT(type) do { \
2
            if(str[0] == @encode(type)[0]) \
3
            { \
4
               if(sizeof(type) == 1) \
5
                   return &ffi;_type_sint8; \
6
               else if(sizeof(type) == 2) \
7
                   return &ffi;_type_sint16; \
8
               else if(sizeof(type) == 4) \
9
                   return &ffi;_type_sint32; \
10
               else if(sizeof(type) == 8) \
11
                   return &ffi;_type_sint64; \
12
               else \
13
               { \
14
                   NSLog(@"Unknown size for type %s", #type); \
15
                   abort(); \
16
               } \
17
            } \
18
        } while(0)

1
        #define UINT(type) do { \
2
            if(str[0] == @encode(type)[0]) \
3
            { \
4
               if(sizeof(type) == 1) \
5
                   return &ffi;_type_uint8; \
6
               else if(sizeof(type) == 2) \
7
                   return &ffi;_type_uint16; \
8
               else if(sizeof(type) == 4) \
9
                   return &ffi;_type_uint32; \
10
               else if(sizeof(type) == 8) \
11
                   return &ffi;_type_uint64; \
12
               else \
13
               { \
14
                   NSLog(@"Unknown size for type %s", #type); \
15
                   abort(); \
16
               } \
17
            } \
18
        } while(0)

作为对整数宏的补充，这里有一个快捷方式，它接受一个整数类型，然后生成检查该类型有符号与无符号变体的代码：

1
        #define INT(type) do { \
2
            SINT(type); \
3
            UINT(unsigned type); \
4
        } while(0)

1
        #define COND(type, name) do { \
2
            if(str[0] == @encode(type)[0]) \
3
                return &ffi_type_ ## name; \
4
        } while(0)

1
        #define PTR(type) COND(type, pointer)

理论上，可以通过解析 @encode 字符串中的结构体定义并构建匹配的 ffi_type（libffi 类型描述符）来支持任意结构体。但在实践中，这既困难又容易出错 ——@encode 格式本身并不友好。对于处理大多数情况而言，只需翻译一小部分结构体。这些结构体可通过简单的字符串比较来识别，无需解析 @encode 字符串，随后只需为 libffi 提供一个硬编码的类型列表即可。虽然此方法无法覆盖所有场景，但若遇到未知结构体可立即中止并提示，且便于添加新类型，这使得开发者能快速修复可能遇到的缺陷。

最后一个宏用于处理结构体。它接收一个结构体类型及对应的 ffi_types 列表。若 @encode 匹配，则为该结构体创建一个 ffi_type，根据传入的参数填充其元素字段，并返回该类型：

1
        #define STRUCT(structType, ...) do { \
2
            if(strncmp(str, @encode(structType), strlen(@encode(structType))) == 0) \
3
            { \
4
               ffi_type *elementsLocal[] = { __VA_ARGS__, NULL }; \
5
               ffi_type **elements = [self _allocate: sizeof(elementsLocal)]; \
6
               memcpy(elements, elementsLocal, sizeof(elementsLocal)); \
7
                \
8
               ffi_type *structType = [self _allocate: sizeof(*structType)]; \
9
               structType->type = FFI_TYPE_STRUCT; \
10
               structType->elements = elements; \
11
               return structType; \
12
            } \
13
        } while(0)

1
        SINT(_Bool);
2
        SINT(signed char);
3
        UINT(unsigned char);
4
        INT(short);
5
        INT(int);
6
        INT(long);
7
        INT(long long);

1
        PTR(id);
2
        PTR(Class);
3
        PTR(SEL);
4
        PTR(void *);
5
        PTR(char *);
6
        PTR(void (*)(void));

1
        COND(float, float);
2
        COND(double, double);
3

4
        COND(void, void);

现在处理基本类型的问题解决了，接下来是结构体。我只处理 CGRect、CGPoint、CGSize 及其对应的 NS 等价类型。如果需要的话，其他结构体也可以很容易地添加进来。

这些结构体的元素类型都是 CGFloat。CGFloat 的类型可以是 float 或 double，具体取决于平台。因此，首先要确定它属于哪一种，然后获取相应的 ffi_type：

1
        ffi_type *CGFloatFFI = sizeof(CGFloat) == sizeof(float) ? &ffi;_type_float : &ffi;_type_double;

1
        STRUCT(CGRect, CGFloatFFI, CGFloatFFI, CGFloatFFI, CGFloatFFI);
2
        STRUCT(CGPoint, CGFloatFFI, CGFloatFFI);
3
        STRUCT(CGSize, CGFloatFFI, CGFloatFFI);

1
    #if !TARGET_OS_IPHONE
2
        STRUCT(NSRect, CGFloatFFI, CGFloatFFI, CGFloatFFI, CGFloatFFI);
3
        STRUCT(NSPoint, CGFloatFFI, CGFloatFFI);
4
        STRUCT(NSSize, CGFloatFFI, CGFloatFFI);
5
    #endif

1
        NSLog(@"Unknown encode string %s", str);
2
        abort();
3
    }

当闭包（closure）被准备时，它需要三个关键的数据。其一是之前代码费力构建的类型信息（type information）。其二是以 libffi 格式接收参数的 C 函数。其三是传递给该 C 函数的上下文指针（context pointer）。正是这个上下文指针让所有魔法得以发生 —— 它使函数能够判断当前调用关联的是 MABlockClosure 的哪个实例，并最终调用到关联的代码块（block）。

与闭包分配和释放类似，闭包的准备方式取决于 libffi 正在运行的模式。如果 libffi 自行管理闭包内存分配，那么只需单次调用即可完成闭包准备。否则，需要先通过一次调用进行设置，再通过一次 mprotect 调用将内存标记为可执行。以下是 -_prepClosure 方法的实现：

1
    - (void)_prepClosure
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        ffi_status status = ffi_prep_closure_loc(_closure, &_closureCIF, BlockClosure, self, _closureFptr);
5
        if(status != FFI_OK)
6
        {
7
            NSLog(@"ffi_prep_closure returned %d", (int)status);
8
            abort();
9
        }
10
    #else
11
        ffi_status status = ffi_prep_closure(_closure, &_closureCIF, BlockClosure, self);
12
        if(status != FFI_OK)
13
        {
14
            NSLog(@"ffi_prep_closure returned %d", (int)status);
15
            abort();
16
        }
17

18
        if(mprotect(_closure, sizeof(_closure), PROT_READ | PROT_EXEC) == -1)
19
        {
20
            perror("mprotect");
21
            abort();
22
        }
23
    #endif
24
    }

1
    static void BlockClosure(ffi_cif *cif, void *ret, void **args, void *userdata)
2
    {
3
        MABlockClosure *self = userdata;

1
        int count = self->_closureArgCount;
2
        void **innerArgs = malloc((count + 1) * sizeof(*innerArgs));
3
        innerArgs[0] = &self-;>_block;
4
        memcpy(innerArgs + 1, args, count * sizeof(*args));

1
        ffi_call(&self-;>_innerCIF, BlockImpl(self->_block), ret, innerArgs);

1
        free(innerArgs);
2
    }

直接使用 MABlockClosure 稍显不便。我编写了两个便捷函数来简化这一操作。BlockFptr 函数会在代码块（block）本身上创建一个 MABlockClosure 实例作为关联对象（associated object）。这确保了只要代码块有效，函数指针就保持有效：

1
    void *BlockFptr(id block)
2
    {
3
        @synchronized(block)
4
        {
5
            MABlockClosure *closure = objc_getAssociatedObject(block, BlockFptr);
6
            if(!closure)
7
            {
8
                closure = [[MABlockClosure alloc] initWithBlock: block];
9
                objc_setAssociatedObject(block, BlockFptr, closure, OBJC_ASSOCIATION_RETAIN);
10
                [closure release]; // retained by the associated object assignment
11
            }
12
            return [closure fptr];
13
        }
14
    }

1
    void *BlockFptrAuto(id block)
2
    {
3
        return BlockFptr([[block copy] autorelease]);
4
    }

1
    int x = 42;
2
    void (*fptr)(void) = BlockFptrAuto(^{ NSLog(@"%d", x); });
3
    fptr(); // prints 42!

以上就是本期（延迟的）周五问答的全部内容。下期再见，届时我会继续分享更多探讨主题。一如既往，欢迎大家随时将想探讨的话题建议发送给我。

#Original (English)

Source: https://www.mikeash.com/pyblog/friday-qa-2011-05-06-a-tour-of-mablockclosure.html

It’s a week late, but it’s finally time for the latest edition of Friday Q&A. About a year ago, I wrote about converting blocks into function pointers by building code at runtime. This was an interesting exercise, but ultimately impractical due to various limitations. In the meantime, I wrote MABlockClosure, a more robust and usable way of doing the same thing, but I never posted about it. Landon Fuller suggest I discuss how it works, and so that is what I will talk about today.

Recap Blocks are an extremely useful language feature for two reasons: they allow writing anonymous functions inlined in other code, and they can capture context from the enclosing scope by referring to local variables from that scope. Among other things, this makes callback patterns much simpler. Instead of this:

1
    struct CallbackContext
2
    {
3
        NSString *title;
4
        int value;
5
    };
6

7
    static void MyCallback(id result, void *contextVoid)
8
    {
9
        struct CallbackContext *context = contextVoid;
10
        // use result, context->title, and context->value
11
    }
12

13
    struct CallbackContext ctx;
14
    ctx.title = [self title];
15
    ctx.value = [self value];
16
    CallAPIWithCallback(workToDo, MyCallback, &ctx;);

1
    CallAPIWithCallbackBlock(workToDo, ^(id result) {
2
        // use result, [self title], [self value]
3
    });

The problem is that not all callbacks-based APIs have versions that take blocks. What MABlockClosure and my older experimental trampoline code allow is converting a block to a function pointer that can be passed to one of these APIs. For example, if CallAPIWithCallbackBlock didn’t exist, MABlockClosure allows writing code that’s nearly as nice:

1
    CallAPIWithCallback(workToDo, BlockFptrAuto(^(id result) {
2
        // use result, [self title], [self value]
3
    }));

Blocks ABI Blocks compile down to a function and a couple of structs. The function holds the code, and the structs hold information about the block, including the captured context. The function contains an implicit argument, much like the self argument to Objective-C methods, which points to the block structure. The block above translates to something like this:

1
    void BlockImpl(struct BlockStruct *block, id info)
2
    {
3
        // code goes here
4
    }

My original attempt used a small bit of assembly code for the trampoline. This code tried to shift the arguments in a general fashion, and then insert the pointer at the front. Unfortunately, this really can’t be done by the same code for all cases, so it ended up with a lot of irritating restrictions.

At the time, this was about the best that could be done. Fortunately, Apple later added type metadata to blocks. As long as you’re using a compiler that’s recent enough to generate this metadata (any recent clang will do), this can be used to generate intelligent trampolines which do the appropriate argument manipulation.

libffi Although the block type metadata provides all of the necessary information needed to perform the necessary argument transformation, it’s still an extremely complicated undertaking. The exact nature of what needs to be done depends heavily on the function call ABI of the particular architecture the code is running on, and the particular argument types present.

If I had to do all of this myself, I never would have been able to put in the enormous effort required. The good news is that there is a library already built which knows how to handle all of this for a whole bunch of different architectures: libffi.

libffi provides two major facilities. It’s best known for the ability to call into an arbitrary function with arbitrary arguments whose types aren’t known until runtime. A lesser-known facility provides what is essentially the opposite: it allows creating “closures” which are runtime-generated functions which capture arbitrary arguments whose types aren’t known until runtime.

The latter is what we need to generate the trampoline function for the block. This captures the arguments in a form that can be manipulated from C code. That code can then manipulate the arguments as needed and use the former facility to call the block’s implementation pointer.

Support Structures The layout of a block structure is not in any published header. However, since these structures are baked into executables when they’re compiled, we can safely extract them from the specification and rely on that to match.

These are the structures in question:

1
    struct BlockDescriptor
2
    {
3
        unsigned long reserved;
4
        unsigned long size;
5
        void *rest[1];
6
    };
7

8
    struct Block
9
    {
10
        void *isa;
11
        int flags;
12
        int reserved;
13
        void *invoke;
14
        struct BlockDescriptor *descriptor;
15
    };

1
    static void *BlockImpl(id block)
2
    {
3
        return ((struct Block *)block)->invoke;
4
    }

1
    static const char *BlockSig(id blockObj)
2
    {
3
        struct Block *block = (void *)blockObj;
4
        struct BlockDescriptor *descriptor = block->descriptor;
5

6
        int copyDisposeFlag = 1 << 25;
7
        int signatureFlag = 1 << 30;
8

9
        assert(block->flags & signatureFlag);
10

11
        int index = 0;
12
        if(block->flags & copyDisposeFlag)
13
            index += 2;
14

15
        return descriptor->rest[index];
16
    }

A lot of the necessary libffi data structures have to be created dynamically depending on the type signature. Manually managing that memory gets irritating. Since their lifetime is tied to the life of the closure object itself, the simplest way to deal with this is to track allocations in the object. To do this, I have an NSMutableArray. When I need to allocate memory, I create an NSMutableData of the appropriate size, add it to this array, and then return its mutableBytes pointer. This array is the class’s first instance variable:

1
    @interface MABlockClosure : NSObject
2
    {
3
        NSMutableArray *_allocations;

1
        ffi_cif _closureCIF;
2
        ffi_cif _innerCIF;
3
        int _closureArgCount;

1
        ffi_closure *_closure;
2
        void *_closureFptr;
3
        id _block;
4
    }

1
    - (id)initWithBlock: (id)block;
2

3
    - (void *)fptr;
4

5
    @end

1
    - (void *)fptr
2
    {
3
        return _closureFptr;
4
    }

1
    - (id)initWithBlock: (id)block
2
    {
3
        if((self = [self init]))
4
        {
5
            _allocations = [[NSMutableArray alloc] init];
6
            _block = block;
7
            _closure = AllocateClosure(&_closureFptr);
8
            [self _prepClosureCIF];
9
            [self _prepInnerCIF];
10
            [self _prepClosure];
11
        }
12
        return self;
13
    }

Newer versions of libffi encapsulate all of this in calls to allocate, prepare, and deallocate closures. This is what you’ll get if you build libffi from source, and it’s what you can get on iOS. MABlockClosure is built to handle both ways.

The AllocateClosure function uses conditional compilation to decide which technique to use. If USE_LIBFFI_CLOSURE_ALLOC is set, it just calls through to libffi. Otherwise, it allocates the memory using mmap, which ensures that the memory is properly aligned and can later be marked executable. Here’s what that function looks like:

1
    static void *AllocateClosure(void **codePtr)
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        return ffi_closure_alloc(sizeof(ffi_closure), codePtr);
5
    #else
6
        ffi_closure *closure = mmap(NULL, sizeof(ffi_closure), PROT_READ | PROT_WRITE, MAP_ANON | MAP_PRIVATE, -1, 0);
7
        if(closure == (void *)-1)
8
        {
9
            perror("mmap");
10
            return NULL;
11
        }
12
        *codePtr = closure;
13
        return closure;
14
    #endif
15
    }

1
    static void DeallocateClosure(void *closure)
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        ffi_closure_free(closure);
5
    #else
6
        munmap(closure, sizeof(ffi_closure));
7
    #endif
8
    }

The two prep methods called by -initWithBlock: just call through to a single common method with slightly different arguments:

1
    - (void)_prepClosureCIF
2
    {
3
        _closureArgCount = [self _prepCIF: &_closureCIF withEncodeString: BlockSig(_block) skipArg: YES];
4
    }
5

6
    - (void)_prepInnerCIF
7
    {
8
        [self _prepCIF: &_innerCIF withEncodeString: BlockSig(_block) skipArg: NO];
9
    }

The -_prepCIF:withEncodeString:skipArg: method in turn calls through to another method which does the real work of the conversion of the @encode string to an array of ffi_type. It then skips over the first argument if needed, and calls ffi_prep_cif to fill out the ffi_cif struct:

1
    - (int)_prepCIF: (ffi_cif *)cif withEncodeString: (const char *)str skipArg: (BOOL)skip
2
    {
3
        int argCount;
4
        ffi_type **argTypes = [self _argsWithEncodeString: str getCount: &argCount;];
5

6
        if(skip)
7
        {
8
            argTypes++;
9
            argCount--;
10
        }
11

12
        ffi_status status = ffi_prep_cif(cif, FFI_DEFAULT_ABI, argCount, [self _ffiArgForEncode: str], argTypes);
13
        if(status != FFI_OK)
14
        {
15
            NSLog(@"Got result %ld from ffi_prep_cif", (long)status);
16
            abort();
17
        }
18

19
        return argCount;
20
    }

Foundation provides a handy function called NSGetSizeAndAlignment which helps a great deal when parsing these strings. When passed an @encode string, it returns the size and alignment of the first type in the string, and returns a pointer to the next type. In theory, we can iterate through the types in a block signature by just calling this function in a loop.

In practice, there’s a complication. For reasons I have never discovered, method signatures (and thus block signatures) have numbers in between the individual type encodings. NSGetSizeAndAlignment is clueless about these, so it needs a bit of help to correctly parse one of these strings. I wrote a small helper function which calls NSGetSizeAndAlignment and then skips over any digits it finds after the type string:

1
    static const char *SizeAndAlignment(const char *str, NSUInteger *sizep, NSUInteger *alignp, int *len)
2
    {
3
        const char *out = NSGetSizeAndAlignment(str, sizep, alignp);
4
        if(len)
5
            *len = out - str;
6
        while(isdigit(*out))
7
            out++;
8
        return out;
9
    }

1
    static int ArgCount(const char *str)
2
    {
3
        int argcount = -1; // return type is the first one
4
        while(str && *str)
5
        {
6
            str = SizeAndAlignment(str, NULL, NULL, NULL);
7
            argcount++;
8
        }
9
        return argcount;
10
    }

1
    - (ffi_type **)_argsWithEncodeString: (const char *)str getCount: (int *)outCount
2
    {
3
        int argCount = ArgCount(str);
4
        ffi_type **argTypes = [self _allocate: argCount * sizeof(*argTypes)];

1
        int i = -1;
2
        while(str && *str)
3
        {
4
            const char *next = SizeAndAlignment(str, NULL, NULL, NULL);
5
            if(i >= 0)
6
                argTypes[i] = [self _ffiArgForEncode: str];
7
            i++;
8
            str = next;
9
        }

1
        *outCount = argCount;
2

3
        return argTypes;
4
    }

1
    - (ffi_type *)_ffiArgForEncode: (const char *)str
2
    {

libffi differentiates integer types by size, and has no direct equivalent to int or long. To help me convert between the two, I built some macros. (It turns out that libffi built some macros for this as well. There are #defines like ffi_type_sint which map to the correct base ffi_type. I didn’t know about these when I wrote the code, so my method is slightly more roundabout than it needs to be.)

As I mentioned earlier, primitives are represented as single characters in an @encode. To avoid hardcoding any of those character values, I use an expression like @encode(type)[0] to get that single character. If this equals str[0], then that’s the primitive type encoded by the string.

My macro for signed integers first performs this check to see if the types match. If they do, it then uses sizeof(type) to figure out how big the integer type in question is and return the appropriate ffi_type * to match. Here’s what the macro looks like:

1
        #define SINT(type) do { \
2
            if(str[0] == @encode(type)[0]) \
3
            { \
4
               if(sizeof(type) == 1) \
5
                   return &ffi;_type_sint8; \
6
               else if(sizeof(type) == 2) \
7
                   return &ffi;_type_sint16; \
8
               else if(sizeof(type) == 4) \
9
                   return &ffi;_type_sint32; \
10
               else if(sizeof(type) == 8) \
11
                   return &ffi;_type_sint64; \
12
               else \
13
               { \
14
                   NSLog(@"Unknown size for type %s", #type); \
15
                   abort(); \
16
               } \
17
            } \
18
        } while(0)

1
        #define UINT(type) do { \
2
            if(str[0] == @encode(type)[0]) \
3
            { \
4
               if(sizeof(type) == 1) \
5
                   return &ffi;_type_uint8; \
6
               else if(sizeof(type) == 2) \
7
                   return &ffi;_type_uint16; \
8
               else if(sizeof(type) == 4) \
9
                   return &ffi;_type_uint32; \
10
               else if(sizeof(type) == 8) \
11
                   return &ffi;_type_uint64; \
12
               else \
13
               { \
14
                   NSLog(@"Unknown size for type %s", #type); \
15
                   abort(); \
16
               } \
17
            } \
18
        } while(0)

To round out the integer macros, I have a quick one which takes an integer type and then generates code to check for both signed and unsigned variants:

1
        #define INT(type) do { \
2
            SINT(type); \
3
            UINT(unsigned type); \
4
        } while(0)

1
        #define COND(type, name) do { \
2
            if(str[0] == @encode(type)[0]) \
3
                return &ffi_type_ ## name; \
4
        } while(0)

1
        #define PTR(type) COND(type, pointer)

In theory, it would be possible to support arbitrary structs by parsing the struct in the @encode string and building up the appropriate ffi_type to match. In practice, this is difficult and error-prone. The @encode format is not very friendly at all. To handle most cases, there are only a small number of structs that need to be translated. These structs can be detected with a simple string compare without parsing the @encode string, and then a simple hardcoded list of types provided to libffi. While this won’t handle all cases, by bailing out early if an unknown struct is discovered and making it easy to add new ones, this enables the programmer to quickly fix any deficiences which may be encountered.

One last macro handles structs. It takes a struct type and a list of corresponding ffi_types. If the @encode matches, it creates an ffi_type for the struct, fills out the elements from the arguments given, and returns it:

1
        #define STRUCT(structType, ...) do { \
2
            if(strncmp(str, @encode(structType), strlen(@encode(structType))) == 0) \
3
            { \
4
               ffi_type *elementsLocal[] = { __VA_ARGS__, NULL }; \
5
               ffi_type **elements = [self _allocate: sizeof(elementsLocal)]; \
6
               memcpy(elements, elementsLocal, sizeof(elementsLocal)); \
7
                \
8
               ffi_type *structType = [self _allocate: sizeof(*structType)]; \
9
               structType->type = FFI_TYPE_STRUCT; \
10
               structType->elements = elements; \
11
               return structType; \
12
            } \
13
        } while(0)

1
        SINT(_Bool);
2
        SINT(signed char);
3
        UINT(unsigned char);
4
        INT(short);
5
        INT(int);
6
        INT(long);
7
        INT(long long);

1
        PTR(id);
2
        PTR(Class);
3
        PTR(SEL);
4
        PTR(void *);
5
        PTR(char *);
6
        PTR(void (*)(void));

1
        COND(float, float);
2
        COND(double, double);
3

4
        COND(void, void);

That takes care of primitives. Now it’s time for structs. I only handle CGRect, CGPoint, CGSize, and their NS equivalents. Others could easily be added if necessary.

These structs all have elements of type CGFloat. The type of CGFloat can either be float or double depending on the platform. The first thing to do, then, is to figure out which one it is, and grab the corresponding ffi_type:

1
        ffi_type *CGFloatFFI = sizeof(CGFloat) == sizeof(float) ? &ffi;_type_float : &ffi;_type_double;

1
        STRUCT(CGRect, CGFloatFFI, CGFloatFFI, CGFloatFFI, CGFloatFFI);
2
        STRUCT(CGPoint, CGFloatFFI, CGFloatFFI);
3
        STRUCT(CGSize, CGFloatFFI, CGFloatFFI);

1
    #if !TARGET_OS_IPHONE
2
        STRUCT(NSRect, CGFloatFFI, CGFloatFFI, CGFloatFFI, CGFloatFFI);
3
        STRUCT(NSPoint, CGFloatFFI, CGFloatFFI);
4
        STRUCT(NSSize, CGFloatFFI, CGFloatFFI);
5
    #endif

1
        NSLog(@"Unknown encode string %s", str);
2
        abort();
3
    }

When a closure is prepared, it takes three important pieces of data. One is the type information that all of the previous code worked so hard to build. One is a C function which receives the arguments in libffi format. The last one is a context pointer which is passed into that C function. This context pointer is what allows all of the magic to happen. It allows the function to determine which instance of MABlockClosure the call is associated with, and call through to the associated block.

Like with closure allocation and deallocation, how the closure is prepared depends on which mode libffi is operating in. If libffi is managing its own closure allocation, then it’s just a single call to prepare the closure. Otherwise, there’s a different call to set it up, and then a call to mprotect is required to mark the memory as executable. Here’s what the -_prepClosure method looks like:

1
    - (void)_prepClosure
2
    {
3
    #if USE_LIBFFI_CLOSURE_ALLOC
4
        ffi_status status = ffi_prep_closure_loc(_closure, &_closureCIF, BlockClosure, self, _closureFptr);
5
        if(status != FFI_OK)
6
        {
7
            NSLog(@"ffi_prep_closure returned %d", (int)status);
8
            abort();
9
        }
10
    #else
11
        ffi_status status = ffi_prep_closure(_closure, &_closureCIF, BlockClosure, self);
12
        if(status != FFI_OK)
13
        {
14
            NSLog(@"ffi_prep_closure returned %d", (int)status);
15
            abort();
16
        }
17

18
        if(mprotect(_closure, sizeof(_closure), PROT_READ | PROT_EXEC) == -1)
19
        {
20
            perror("mprotect");
21
            abort();
22
        }
23
    #endif
24
    }

1
    static void BlockClosure(ffi_cif *cif, void *ret, void **args, void *userdata)
2
    {
3
        MABlockClosure *self = userdata;

1
        int count = self->_closureArgCount;
2
        void **innerArgs = malloc((count + 1) * sizeof(*innerArgs));
3
        innerArgs[0] = &self-;>_block;
4
        memcpy(innerArgs + 1, args, count * sizeof(*args));

1
        ffi_call(&self-;>_innerCIF, BlockImpl(self->_block), ret, innerArgs);

1
        free(innerArgs);
2
    }

Convenience Functions Using MABlockClosure directly is slightly inconvenient. I built two convenience functions to make this a bit easier. The BlockFptr function creates an MABlockClosure instance as an associated object on the block itself. This ensures that the function pointer remains valid for as long as the block is valid:

1
    void *BlockFptr(id block)
2
    {
3
        @synchronized(block)
4
        {
5
            MABlockClosure *closure = objc_getAssociatedObject(block, BlockFptr);
6
            if(!closure)
7
            {
8
                closure = [[MABlockClosure alloc] initWithBlock: block];
9
                objc_setAssociatedObject(block, BlockFptr, closure, OBJC_ASSOCIATION_RETAIN);
10
                [closure release]; // retained by the associated object assignment
11
            }
12
            return [closure fptr];
13
        }
14
    }

1
    void *BlockFptrAuto(id block)
2
    {
3
        return BlockFptr([[block copy] autorelease]);
4
    }

1
    int x = 42;
2
    void (*fptr)(void) = BlockFptrAuto(^{ NSLog(@"%d", x); });
3
    fptr(); // prints 42!

That wraps up this week’s (late) Friday Q&A. Come back in two weeks for the next installment. Until then, as always, keep sending me your ideas for topics to cover here.