-
Notifications
You must be signed in to change notification settings - Fork 18
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Need more discussion and explanation of deep-lookup operator #854
Comments
I would like to explore whether we can follow the example of JSONPath by making the lookup operators return a more complex object that is then "flattened" (in some sense) when the context demands it. This is analagous to the way the result of a path expression on a node tree is atomised when the context demands atomic values rather than nodes. Let's follow JSONPath by saying that the lookup operators return a sequence of "member locators", one for each member of an array or map that is selected. Except where otherwise specified, the effect of supplying a sequence of member locators to any function or operator is to flatten it to a sequence of items (thus preserving compatibility). But some operations are available that treat it differently: for example
This then provides the necessary infrastructure to support PR #832 - so long as the downwards selection is done using lookup operators, (a) we can update the located members as a unit (rather than operating on individual items), and (b) we can propagate updates upwards to return a new version of the containing tree. Finding syntax, as always, is difficult. Perhaps we could write array:members() would then likewise deliver a sequence of "member locators". Functions would available on member locators to obtain the value of the member (a sequence), its key/index, its containing map/array, etc. |
Before discussing syntactical questions, it would be helpful to head for some common terminology. Right now, it seems hard (at least to me) to follow what we believe is, or could be, a member, an entry, a value, etc. Here is what I think is the status quo:
With the proposal above, would you suggest sticking with this terminology? In #826 (comment), I suggested defining a function for both arrays and maps with the same name, which returns a singleton map for each unit. I proposed [ (), 1] => array:entries() (: map { 1: () }, map { 2: 1 } :)
map { 'a': (), 'b': 1 } => map:entries() (: map { 'a': () }, map { 'b': 1 } :)
…but following this thread I assume the functions (provided we’d want them) would rather be called |
Outline, sketchy thoughts:
|
If lookups return In any case, I think it should be possible to be able to write simple constructs like And I wonder whether there are additional use cases apart from updates to justify this fairly fundamental enhancement? The title of this issue is about the deep-lookup operator. What would be the connection? Sorry for all the questions, which aren't meant to be heretic. Probably I can't see yet how a majority of users could benefit from the changes. |
Dimitre raised the concern about the fact that the deep-lookup operator ( With traditional node selection, a path like |
Perhaps we can make it work like this:
|
I definitely like the last approach. It reminds me of a project we did some time ago to be able to traverse in maps & arrays with the help of function items (only now I notice it’s very similar). Here’s some basic runnable code that presents the basic idea: (: creates a function item encapsulating a value and its optional parent :)
declare function Q{Path}new($value, $parent := ()) {
function($op) {
if(string($op) = '_value') then (
(: return encapsulated value :)
$value
) else if(string($op) = '_parent') then (
(: create function item pointing to the parent :)
Q{Path}new($parent, $parent ! .('_parent'))
) else (
(: create function item pointing to a child :)
Q{Path}new($value?($op), $value)
)
}
};
let $value := [ map {}, map { 'xyz': 'text' } ]
let $path := Q{Path}new($value)
return $path(2)('xyz')('_parent')('_value') If we allowed lookups on function items (see #51), the last line could be written as follows: return $path?2?xyz?_parent?_value |
Indeed, that's very similar in concept. There's a problem in both solutions that they require reserved names for special properties (such as Can one get by without introducing a new data type, and just do everything by layering on top of the types we already have? I think that's probably not going to be too usable: at some level you want to be able to say "is this item a JNode?". But it might be that the most economical way of achieving that is with annotations, so a JNode is just a map annotated with |
Yes. As long as the lookup operator only accepts atomic values, we could pass on sequences with 0 or more than 2 items or function items. Next, the proposed approach does not support wildcard or descendant traversals. One solution could be to define the actual operation in the supplied function item: declare function Q{Locator}new($context, $ancestors := []) {
function($arg) {
let $path := array:join(($ancestors, [ $context ]))
return if($arg instance of function(*)) then (
(: compute result :) $arg($path)
) else (
(: create child locator :) Q{Locator}new($context?$arg, $path)
)
}
};
declare function Q{Locator}parent($path) {
let $ancestors := $path => array:trunk()
return Q{Locator}new(array:foot($ancestors), array:trunk($ancestors))
};
declare function Q{Locator}value($path) {
$path => array:foot()
};
let $value := [ map {}, map { 'xyz': 'text' } ]
let $locator := Q{Locator}new($value)
return $locator(2)('xyz')(Q{Locator}parent#1)(Q{Locator}value#1)
One proof-of-concept solution for
Following these steps, it should be fairly easy to write such a
…I definitely agree. Error messages for the selector function would be close to undecipherable. A Said this, I still have some sympathy for an even simpler solution (although it’s far from visionary): fn:deep-delete($map-or-array, [ 2, 'xyz' ]) |
I think the problem with the tumbler approach is that practical use cases will invitably involve predicates. |
I am sorry I missed this interesting discussion. What is wrong with returning a sequence of arrays, each containing a single member? I like the idea of JNodes, but don't like the name JNode. Maybe use a name like "structure" or "composite", or "packaged", or "wrapped"? In .NET the term used is "boxed" and "unboxed".
In case when a function accepts an argument of type some-type but is passed a |
Addressed by PR #832 |
Although PR #832 was abandoned, the work was progressed and the current design using modifiers after the |
During discussion of the
??
operator it was pointed out that we need more examples and explanation, especially of how to handle cases where the "flattening" behaviour of the operator is inconvenient. This applies equally to paths using the existing?
operator -$x?y?z
and$x??z
both have this problem. For example, there is no way of filtering the result of$x?y?z
or$x??z
to select only members of size 3.The text was updated successfully, but these errors were encountered: