[swift-dev] statically initialized arrays

Joe Groff jgroff at apple.com
Mon Jun 19 18:10:33 CDT 2017


> On Jun 19, 2017, at 1:45 PM, Erik Eckstein <eeckstein at apple.com> wrote:
> 
>> 
>> On Jun 19, 2017, at 8:53 AM, Joe Groff <jgroff at apple.com> wrote:
>> 
>>> 
>>> On Jun 14, 2017, at 11:24 AM, Erik Eckstein via swift-dev <swift-dev at swift.org> wrote:
>>> 
>>> Hi,
>>> 
>>> I’m about implementing statically initialized arrays. It’s about allocating storage for arrays in the data section rather than on the heap.
>>> 
>>> Info: the array storage is a heap object. So in the following I’m using the general term “object” but the optimization will (probably) only handle array buffers.
>>> 
>>> This optimization can be done for array literals containing only other literals as elements.
>>> Example:
>>> 
>>> func createArray() -> [Int] {
>>>  return [1, 2, 3]
>>> }
>>> 
>>> The compiler can allocate the whole array buffer as a statically initialized global llvm-variable with a reference count of 2 to make it immortal.
>>> It avoids heap allocations for array literals in cases stack-promotion can’t kick in. It also saves code size.
>>> 
>>> What’s needed for this optimization?
>>> 
>>> 1) An optimization pass (GlobalOpt) which detects such array literal initialization patterns and “outlines” those into a statically initialized global variable
>>> 2) A representation of statically initialized global variables in SIL
>>> 3) IRGen to create statically initialized objects as global llvm-variables
>>> 
>>> ad 2) Changes in SIL:
>>> 
>>> Currently a static initialized sil_global is represented by having a reference to a globalinit function which has to match a very specific pattern (e.g. must contain a single store to the global).
>>> This is somehow quirky and would get even more complicated for statically initialized objects.
>>> 
>>> I’d like to change that so that the sil_global itself contains the initialization value.
>>> This part is not yet related to statically initialized objects. It just improves the representation of statically initialized global in general.
>>> 
>>> @@ -1210,7 +1210,9 @@ Global Variables
>>> ::
>>> 
>>>   decl ::= sil-global-variable
>>> +  static-initializer ::= '{' sil-instruction-def* '}'
>>>   sil-global-variable ::= 'sil_global' sil-linkage identifier ':' sil-type
>>> +                             (static-initializer)?
>>> 
>>> SIL representation of a global variable.
>>> 
>>> @@ -1221,6 +1223,19 @@ SIL instructions. Prior to performing any access on the global, the
>>> Once a global's storage has been initialized, ``global_addr`` is used to
>>> project the value.
>>> 
>>> +A global can also have a static initializer if it's initial value can be
>>> +composed of literals. The static initializer is represented as a list of
>>> +literal and aggregate instructions where the last instruction is the top-level
>>> +value of the static initializer::
>>> +
>>> +  sil_global hidden @_T04test3varSiv : $Int {
>>> +    %0 = integer_literal $Builtin.Int64, 27
>>> +    %1 = struct $Int (%0 : $Builtin.Int64)
>>> +  }
>>> +
>>> +In case a global has a static initializer, no ``alloc_global`` is needed before
>>> +it can be accessed.
>>> +
>>> 
>>> Now to represent a statically initialized object, we need a new instruction. Note that this “instruction" can only appear in the initializer of a sil_global.
>>> 
>>> +object
>>> +``````
>>> +::
>>> +
>>> +  sil-instruction ::= 'object' sil-type '(' (sil-operand (',' sil-operand)*)? ')'
>>> +
>>> +  object $T (%a : $A, %b : $B, ...)
>>> +  // $T must be a non-generic or bound generic reference type
>>> +  // The first operands must match the stored properties of T
>>> +  // Optionally there may be more elements, which are tail-allocated to T
>>> +
>>> +Constructs a statically initialized object. This instruction can only appear
>>> +as final instruction in a global variable static initializer list.
>>> 
>>> Finally we need an instruction to use such a statically initialized global object.
>>> 
>>> +global_object
>>> +`````````````
>>> +::
>>> +
>>> +  sil-instruction ::= 'global_object' sil-global-name ':' sil-type
>>> +
>>> +  %1 = global_object @v : $T
>>> +  // @v must be a global variable with a static initialized object
>>> +  // $T must be a reference type
>>> +
>>> +Creates a reference to the address of a global variable which has a static
>>> +initializer which is an object, i.e. the last instruction of the global's
>>> +static initializer list is an ``object`` instruction.
>>> 
>>> 
>>> ad 3) IRGen support
>>> 
>>> Generating statically initialized globals is already done today for structs and tuples.
>>> What’s needed is the handling of objects.
>>> In addition to creating the global itself, we also need a runtime call to initialize the object header. In other words: the object is statically initialized, except the header.
>>> 
>>> HeapObject *swift::swift_initImmortalObject(HeapMetadata const *metadata, HeapObject *object)
>>> 
>>> There are 2 reasons for that: first, the object header format is not part of the ABI. And second, in case of a bound generic type (e.g. array buffers) the metadata is not statically available.
>> 
>> I did some work along these lines already so that KeyPaths could be immortal heap objects. I added an entry point _swift_instantiateInertHeapObject that does exactly this. We could un-underscore it and promote it to a SWIFT_RUNTIME_EXPORT.
> 
> sounds good. If I understood it correctly, _swift_instantiateInertHeapObject currently initializes the object with a reference count of 1. For array buffers we would need 2. But I think for KeyPaths you are fine with 2 as well.

Yeah, 2 is probably a more semantically-correct value for global objects anyway to ward off uniqueness checks, at least till we get a real "inert" bit pattern.

-Joe


More information about the swift-dev mailing list