[swift-evolution] Normalize Enum Case Representation (rev. 2)

Joe Groff jgroff at apple.com
Thu Mar 9 15:49:34 CST 2017


> On Mar 9, 2017, at 11:48 AM, Daniel Duan <daniel at duan.org> wrote:
> 
>> 
>> On Mar 9, 2017, at 12:31 AM, Xiaodi Wu <xiaodi.wu at gmail.com <mailto:xiaodi.wu at gmail.com>> wrote:
>> 
>> On Thu, Mar 9, 2017 at 1:07 AM, Daniel Duan <daniel at duan.org <mailto:daniel at duan.org>> wrote:
>> Thanks for the thoughtful feed Xiaodi! Replies are inline. I'm going to incorporate some of the responses into the proposal.
>> 
>> On Mar 8, 2017, at 9:56 PM, Xiaodi Wu <xiaodi.wu at gmail.com <mailto:xiaodi.wu at gmail.com>> wrote:
>> 
>>> The rendered version differs from the text appended to your message. I'll assume the more fully fleshed out version is what you intend to submit. Three comments/questions:
>>> 
>>> Enum Case "Overloading"
>>> 
>>> An enum may contain cases with the same full name but with associated values of different types. For example:
>>> 
>>> enum Expr {
>>>     case literal(Bool)
>>>     case literal(Int)
>>> }
>>> The above cases have overloaded constructors, which follow the same rules as functions at call site for disambiguation:
>>> 
>>> // It's clear which case is being constructed in the following.
>>> let aBool: Expr = .literal(false)
>>> let anInt: Expr = .literal(42)
>>> User must specify an as expression in sub-patterns in pattern matching, in order to match with such cases:
>>> 
>>> case .literal(let value) // this is ambiguous
>>> case .literal(let value as Bool) // matches `case literal(Bool)`
>>> 
>>> Comment/question 1: Here, why aren't you proposing to allow `case .literal(let value: Bool)`? For one, it would seem to be more consistent.
>> 
>> The example in proposal doesn't include any labels. Are you suggesting two colons for sub-patterns with labels? Like `case .literal(value: let value: Bool)`?  This looks jarring. But I'm definitely open to other suggestions.
>> 
>> That does look jarring. But hmm.
>>  
>>> Second, since we still have some use cases where there's Obj-C bridging magic with `as`, using `as` in this way may run into ambiguity issues if (for example) you have two cases, one with associated value of type `String` and the other of type `NSString`.
>> 
>> Either this should be rejected at declaration, or we need a way to accept a "pre-magic" resolution at pattern matching, when this scenarios is at hand.
>> 
>> Or we align pattern matching to function syntax and have such cases disambiguated in that way (see below).
>>  
>> I'm on the phone so I can't verify. Wouldn't function overloading face a similar problem?
>> 
>>> Also, since enum cases are to be like functions, I assume that the more verbose `as` version would work for free: `case .literal(let value) as (Bool) -> Expr`?
>>> 
>> 
>> This is not being proposed. When a user sees/authors a case, their expectation for the declared case constructor should resemble that of a function. Pattern matching was considered separately since it's not relatable syntactically.
>> 
>> This requires justification. If enum cases are to be like functions, then the logical expectation is that pattern matching should work in that way too. I see no rationale to undergird your claim that pattern matching is "not relatable syntactically." Allowing `case .literal(let value) as (Bool) -> Expr` would solve the issue above, as well as provide more flexibility with the issues below.
> 
> I have concerns about the verbosity this syntax introduces. Example:
> 
> enum A { case v(Int) }
> enum B { case v(A); case v(Int) }
> 
> To disambiguate a value of type B, it would be
> 
> case .v(A.v(let xValue)) as ((Int -> A) -> B)
> 
> This scales poorly for cases with deeper recursions and/or more associated values.
> 
> Disambiguate at the sub-pattern level doesn’t have this scalability problem.

This also is not the right meaning of `as`. `as` coerces the subpattern type, and a `.v(...)` pattern has type B. This coercion makes no sense.

Personally I think that overloading is unnecessary scope creep and should be left out of the proposal.

>> 
> We’ve encountered a bigger question that it initial seems. Let’s zoom out.
> 
> There are 2 popular kinds of patterns for value deconstruction in PLs: patterns for trees and sequences. The former deconstructs value who’s prominently recursive: enum, struct, tuple; the latter deals with list-like (grows in 1 direction indefinitely) things. We are now investigating the syntax that can potentially be used for all tree patterns. Whereas the “shape” alone isn’t enough information, user must use the type to supplement the pattern for a successful match. If we introduce patterns for structs in the future, whatever we came up here for type disambiguation should work there.
>>>  <https://github.com/dduan/swift-evolution/blob/SE0155-rev2/proposals/0155-normalize-enum-case-representation.md#alternative-payload-less-case-declaration>Alternative Payload-less Case Declaration
>>> 
>>> In Swift 3, the following syntax is valid:
>>> 
>>> enum Tree {
>>>     case leaf() // the type of this constructor is confusing!
>>> }
>>> Tree.leaf has a very unexpected type to most Swift users: (()) -> Tree
>>> 
>>> We propose this syntax declare the "bare" case instead. So it's going to be the equivalent of
>>> 
>>> enum Tree {
>>>     case leaf // `()` is optional and does the same thing.
>>> }
>>> 
>>> 
>>> Comment/question 2: First, if associated values are not to be modeled as tuples, for backwards compatibility the rare uses of `case leaf()` should be migrated to `case leaf(())`.
>>> 
>> 
>> Yes,
>> 
>> Cool.
>>  
>> and when user uses a arbitrary name when they should have used a label, or when labels are misspelled, the compiler should suggest the correct labels.
>> 
>> As below, I disagree with this restriction very strongly.
>>  
>> I wasn't sure how much of migrator related thing should go into a proposal. Perhaps there should be more.
>>> Second, to be clear, you are _not_ proposing additional sugar so that a case without an associated value be equivalent to a case that has an associated value of type `Void`, correct? You are saying that, with your proposal, both `case leaf()` and `case leaf` would be regarded as being of type `() -> Tree` instead of the current `(()) -> Tree`?
>>> 
>> 
>> Correct. I'm _not_ proposing implicit `Void`.
>>> [The latter (i.e. `() -> Tree`) seems entirely fine. The former (i.e. additional sugar for `(()) -> Tree`) seems mostly fine, except that it would introduce an inconsistency with raw values that IMO is awkward. That is, if I have `enum Foo { case bar }`, it would make case `bar` have implied associated type `Void`; but, if I have `enum Foo: Int { case bar }`, would case `bar` have raw value `0` of type `Int` as well as associated value `()` of type `Void`?]
>>> 
>>> 
>>>  <https://github.com/dduan/swift-evolution/blob/SE0155-rev2/proposals/0155-normalize-enum-case-representation.md#pattern-consistency>Pattern Consistency
>>> 
>>> (The following enum will be used throughout code snippets in this section).
>>> 
>>> indirect enum Expr {
>>>     case variable(name: String)
>>>     case lambda(parameters: [String], body: Expr)
>>> }
>>> Compared to patterns in Swift 3, matching against enum cases will follow stricter rules. This is a consequence of no longer relying on tuple patterns.
>>> 
>>> When an associated value has a label, the sub-pattern must include the label exactly as declared. There are two variants that should look familiar to Swift 3 users. Variant 1 allows user to bind the associated value to arbitrary name in the pattern by requiring the label:
>>> 
>>> case .variable(name: let x) // okay
>>> case .variable(x: let x) // compile error; there's no label `x`
>>> case .lambda(parameters: let params, body: let body) // Okay
>>> case .lambda(params: let params, body: let body) // error: 1st label mismatches
>>> User may choose not to use binding names that differ from labels. In this variant, the corresponding value will bind to the label, resulting in this shorter form:
>>> 
>>> case .variable(let name) // okay, because the name is the same as the label
>>> case .lambda(let parameters, let body) // this is okay too, same reason.
>>> case .variable(let x) // compiler error. label must appear one way or another.
>>> case .lambda(let params, let body) // compiler error, same reason as above.
>>> Comment/question 3: Being a source-breaking change, that requires extreme justification, and I just don't think there is one for this rule. The perceived problem being addressed (that one might try to bind `parameters` to `body` and `body` to `parameters`) is unchanged whether enum cases are modeled as tuples or functions, so aligning enum cases to functions is not in and of itself justification to revisit the issue of whether to try to prohibit this. 
>>> 
>> 
>> To reiterate, here patterns are changed not for any kind of "alignment" with function syntax. It changed because we dropped the tuple pattern (which remains available for matching with tuple values, btw), therefore we need to consider what a first-class syntax for enum case would look like.
>> 
>> Since the rationale for this proposal is to "normalize" enum cases by making them more function-like, again you will need to justify why pattern matching should break from that overarching goal.
>> 
>> This is a source-breaking change, so it's not enough that a "first-class syntax" from the ground up would be different from the status quo (which was the Swift 3 evolution standard--if we were to do it again from scratch, would we still do it this way?). The Swift 4 evolution expectation is that a source-breaking change should require "extreme" justification.
> 
> Fair enough. I think the Swift 3 criteria is met. As for Swift 4, I used the word “deprecated” in the source compatibility section. I imagine this means that only deprecation warnings and fix-its are issued in Swift 4 and the warning becomes an error in Swift 5. Obviously, that’s not a justification…
> 
> What do you think Joe? 

Unifying the declaration model is worth a minor source break, IMO.

>> 
>> The justification for this breaking change is this: with tuples, labels in pattern is not well enforced. User can skip them, bind value to totally arbitrary names, etc. I personally think emulating such rule prevents us from making pattern matching easier to read for experienced devs and easier to learn for new comers. 
>> 
>> Perhaps, but this is an argument that tuple pattern binding is inferior. It has nothing to do with enum cases in particular. In fact, several threads have touched on this topic holistically. The conclusions there have been that allowing (a: Int, b: Int) to bind (Int, Int) or vice versa is healthy and useful, but allowing (a: Int, b: Int) to bind (b: Int, c: Int) is not so good, and (a: Int, b: Int) binding (b: Int, a: Int) is counterintuitive and should be removed.
> 
> Fantastic! Really appreciate this summary.

Tuples are at the type system level convertible from labeled and unlabeled and in label-changing ways. We should arguably clamp down on that, and that might affect how pattern matching works with them. It isn't necessarily analogous to enum cases, since with enum cases we're working with a named declaration where the labels are needed for name lookup.

-Joe
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.swift.org/pipermail/swift-evolution/attachments/20170309/e80da02f/attachment.html>


More information about the swift-evolution mailing list