Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deep-equality keys #1884

Closed
michaelhkay opened this issue Mar 17, 2025 · 2 comments
Closed

Deep-equality keys #1884

michaelhkay opened this issue Mar 17, 2025 · 2 comments
Labels
Discussion A discussion on a general topic.

Comments

@michaelhkay
Copy link
Contributor

michaelhkay commented Mar 17, 2025

Issue #119 proposes extending maps to allow arbitrary values as keys. This is very difficult to achieve, (a) because the fact that keys are atomic items is deeply embedded in the design of a number of functions and operations on maps, and (b) because it's very hard to define an equality function that suits everyone.

The way we tacked variable equality semantics for strings was via the collation-key() function, which takes a string and a collation as input and produces an opaque key value, which can be used as a key in maps, and which reflects the desired equality semantics.

We could extend the same idea to values other than strings. In particular, we could define a deep-equality-key() that can be calculated for any sequence, and that takes all the matching options of the deep-equal() function as a parameter. (We could then redefine deep-equal(a, b, options) to mean deep-equality-key(a, options) eq deep-equality-key(b, options)).

The main drawback is that the deep-equality-keys for large node trees or maps would be rather long strings. People might use the functionality without realising the expense.

Another problem is that one of our options in deep-equals() is a callback function for item equality, and we couldn't replicate this when computing a key. But this callback is the only way we have, for example, to compare nodes by identity rather than by content.

Note that an internal deep-equality-key concept (or at least a deep-equality hashcode) is needed anyway for efficient implementation of deep-equals where order is deemed irrelevant. Without it, the function becomes O(n^2). Quite independently of this proposal, we should perhaps have an explicit option on deep-equals() to compare nodes by identity.

@michaelhkay michaelhkay added the Discussion A discussion on a general topic. label Mar 17, 2025
@MarkNicholls
Copy link

you don't seem to have any takers, unfortunately I'm too ignorant to understand the question, if you need someone to "talk at", and ask dumb questions, feel free to explain more. My presumption is that a collation key is some sort of partial order? that indicates what are equal/greater/less than (or some sort of mapping into a total order). I'm not sure how that extends to sequences(?) as keys.

@michaelhkay
Copy link
Contributor Author

Dropping this for lack of interest.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Discussion A discussion on a general topic.
Projects
None yet
Development

No branches or pull requests

2 participants