Fast Thread safe BLI memory routines?

I have a use for something like the routines in BLI_memarena.h and BLI_mempool.h, but that would be thread safe without the need to lock around every allocation. Something like the lockfree implementations of malloc that use atomic operations. Does anyone else feel the need for these too?