[swift-evolution] A path forward on rationalizing unicode identifiers and operators

David Sweeris davesweeris at mac.com
Tue Oct 3 18:05:50 CDT 2017


> On Oct 2, 2017, at 10:06 PM, Chris Lattner <clattner at nondot.org> wrote:
> 
> On Oct 2, 2017, at 9:12 PM, David Sweeris via swift-evolution <swift-evolution at swift.org <mailto:swift-evolution at swift.org>> wrote:
>>> Keep in mind that Swift already goes far above and beyond in terms of operators
>> Yep, that's is a large part of why I'm such a Swift fan :-D
> 
> Fortunately, no one is seriously proposing a major curtailing of the capabilities here, we’re just trying to rationalize the operator set, which is a bit of a mess at present.
I guess I don't really understand why it's currently "a bit of a mess". Maybe I should go take a look at the relevant compiler code to try to get a better understanding of what you're talking about. Or is this purely a matter of finding places where we disagree with Unicode's character classification? If so, I think I'm back at not quite getting it... In any case, I've apparently misunderstood something because I was under the impression that this would lead to a "major curtailing".

>>> in that: (a) it allows overloading of almost all standard operators; (b) it permits the definition of effectively an infinite number of custom operators using characters found in standard operators; (c) it permits the definition of custom precedences for custom operators; and (d) it additionally permits the use of a wide number of Unicode characters for custom operators. Most systems programming languages don't even allow (a), let alone (b) or (c). Even dramatically curtailing (d) leaves Swift with an unusually expansive support for custom operators.
> 
>> Yes, but many of those custom operators won't have a clear meaning because operators are rarely limited to pre-existing symbols like "++++++++" (which doesn't mean anything at all AFAIK), so operators that are widely known within some field probably won't be widely known to the general public, which, IIUC, seems to be your standard for inclusion(?). Please let me know if that's not your position... I hate being misunderstood probably more than the next person, and I wouldn't want to be guilty of that myself.
> 
> The approach to operator handling in Swift is very intentional.  IMO, it is well known that:
> 
> 1) Operators can make code significantly easier to understand by reducing noise from complex expressions: writing x.matmul(y) is insane <https://www.python.org/dev/peps/pep-0465/> if you’re doing a lot of matrix multiplies.
> 2) Operators can be completely opaque to someone who doesn’t know them, and sometimes named functions are more clear.
> 3) Named functions can also sometimes be completely opaque if you don't know them, e.g. "let x = cholesky(y)"
> 4) Languages with fixed operator sets that also allow overloading (e.g. C++) end up with those operators being abused.
> 5) Some code can only be written and maintained by domain experts, and those experts often know the operators.
> 
> Swift’s approach is basically to say to users: “ok we allow overloaded operators, but at least if you encounter some operation that you don’t know… you know that you don’t know it”.  If you encounter "if ¬x {“  or “a ∩ b” in some source code, at least you can command click, jump to the definition and read what it does: you aren’t misled into thinking that the expression is some familiar thing, but find out later it was overloaded to do something crazy (bitshifts for i/o?  really??? :).
> 
> Set algebra is an illustrative example, because it is both used by people who are experts and people who are not.  As far as policies go, I think it makes sense for Swift libraries to define operator-like things as named functions (e.g. “intersection") and also define operators (“∩”) which can optionally be used in source bases that want them for convenience.  The compiler and language cannot know whether a code base is written and maintained by experts who know the symbols and who value their clarity (over the difficulty typing and recognizing them), and this approach allows maintainers of the codebase to pick their own policies.
Oh, yeah, I can't imagine a situation in which I'd think it'd be a good idea to not define a named function to go along with a unicode operator. I'm mainly concerned that we not limit the people in 5) unless we need to. And to be clear, if we actually need to, then I'm fine with doing that... It's just that -- like I said earlier in this message -- I don't clearly understand why this is a problem. That said, there are multiple people more knowledgable than I am on the topic who are telling me I'm wrong about something that I'm kinda surprised there was disagreement about in the first place... I think it's probably time for me to stop pressing an issue that I apparently don't fully understand and wait to see what's in the actual proposal.



> I do think that Ethan’s suggestion upthread interesting, which suggest considering something like:
>    import matrixlib (operators: [ᵀ,·,⊗])
> 
> Three concerns I see:
>  - Requiring them today would be a source incompatibility with Swift 4
If we leave the current "import everything" behavior as the default, why would it be a source-breaking change? We could also flip the syntax around and write something like "import matrixlib (operators: -[ᵀ,·,⊗])", where we're saying which operators we want to not import, because clearly (heh, famous last words in this thread) the default value would the empty list, and an empty list of what not to import implies we should import them all. (I'm not arguing for or against either, just exploring the issue & syntax)

>  - Multiple modules can define operators, unclear whether this refers to the operator decl or implementations of operators.
If it weren't for precedences, I'd guess it should probably only refer to the implementations, since there's no point (that I can see) of wanting to not import an operator unless you want to use it for something else and the two functions' signatures create ambiguities. Since an operator's precedence is attached to the operator itself rather than the function, it'd probably be better to just pretend it doesn't exist at all.

>  - Imports are per-module, not per-source-file, so this couldn’t be used to “user-partition” the identifier and operator space.  It could be a way to make it clear that the user is opting into these explicitly.

I haven't followed them as closely as I probably should've, but do the recent threads on submodules change anything WRT this?

- Dave Sweeris
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <https://lists.swift.org/pipermail/swift-evolution/attachments/20171003/8d7a203a/attachment.html>


More information about the swift-evolution mailing list