"The other feature you quickly find yourself needing is the ability to use arbit...

ErikCorry · on July 1, 2021

It looks like your "pipeEncode" is generating a string that reflects your desired key equality function. I mention this approach in the blog post, but I consider it a poor substitute for being able to use arbitrary objects and specify the equality function on the map or set.

The issue of mutable keys is a slightly different one. If you mutate any of the properties of your object that are used by the map you are going to have a bad time, so don't do that. And I guess if your maps are sufficiently simple (eg JS's object-identity maps) then the user can't make that mistake, but at what cost?

If they are generating strings as keys and they mutate the object after creating the string then this will also break so they haven't even really avoided the problem.

jerf · on July 1, 2021

"I consider it a poor substitute for being able to use arbitrary objects and specify the equality function on the map or set"

My point is that it's an even worse substitute than most programmers realize, because to use it properly you have to understand how to encode parameters. The thing that people usually use, string concatenation with some delimiter, is fundamentally flawed.

(My favorite... and, alas, I've inherited a system that uses this, though fortunately it hasn't surprised me yet... is using "underscore" as a delimiter, for values whose names routinely include underscores! Fortunately, nothing ever tries to extract the original values from the key programmatically, and it's not really in a place hackers are going to attack. But still... yeesh.)

The one exception I've sometimes made is that if you happen to be in an environment where you know a certain value will never be used, you can use that as the delimiter; I've used ASCII NUL for that a few times. But you have to be sure that it's not just a "weird" value that "nobody would ever use", but something truly excluded by the context, something that regardless of what is input by someone somewhere is completely impossible to ever get to your code. Generally, the characters you can type on a keyboard are not a good idea.

nemo1618 · on July 1, 2021

Go does not support using some types as map keys, including slices, channels, functions, and other maps. Slices are the most annoying, and I have seen plenty of code that uses fmt.Sprint(s) as a workaround. Fortunately the compiler now recognizes when you convert a []byte to a string for use as a map key, and will not allocate a new string.

arp242 · on July 1, 2021

In Go it depends if the type is "comparable"; i.e. if "==" works.

You can use arrays in Go; e.g. map[[2]int]string, and you can also use channels; although I'm not sure what the rules for comparing channels are exactly off-hand (I'm struggling to come up with a scenario when this would be useful off-hand actually).

The big problem with slices and maps is that they can be modified. That is, what happens if you modify a slice after you used it as a map key? In slices this is worse than with maps because the backing array can change if you run out of cap space. And also, do you compare by value or identity? And again, what happens if either changes?

I'm not sure if it's possible to come up with a set of rules that wouldn't take people by surprise in at least some cases.

jerf · on July 1, 2021

Python either. I correct my post above to switch to indexing by a tuple, because what I originally had is wrong:

    >>> d = {}
    >>> d[[1,2]] = 10
    Traceback (most recent call last):
      File "<stdin>", line 1, in <module>
    TypeError: unhashable type: 'list'

This is why I qualified my statement with "Each has their own take on the problem of being unable to use mutable keys,".

Go can be consistent in a simple way because of its type system, it can see if any part of a key has something in it that can't be hashed: https://play.golang.org/p/rf8IqPb76Em

Python, as befits Python, has default behavior for instances that I believe is "is" equivalency, but you can override that with various double-underscore methods to do whatever.

vharuck · on July 1, 2021

>Python, as befits Python, has default behavior for instances that I believe is "is" equivalency

Python dicts consider two keys the same if they have the same hash value and are "=="-equal. So __eq__ and __hash__ are the dunder methods to finagle. Python's sets are the same way.

A useful example is with the pathlib library.

    from pathlib import Path
    p = Path('a.txt')
    q = Path('a.txt').absolute()
    p is q # False
    {p, q} # Only one element

"q" carries some different info than "p", but it refers to the same file location. So not considering them distinct values is a good decision for this package.

joshuamorton · on July 1, 2021

Note that this is an oversimplification, as

    p = Path('a.txt')
    q = Path('a.txt')

    p is q  # False

since p and q are different objects and "is" equality checks if the objects are the same (this can be interpreted approximately as "have the same memory address"), so it'll almost never be true. And in the cases where it is ( 2 is 2), you shouldn't rely on it, as most of them are optimizations.

nirs · on July 1, 2021

p and q are the same file now, but after you change current directory their are not. They should never be equal.