[xquery-talk] variable references in location steps

Fri Feb 25 09:45:36 PST 2011

Let's simplify this down to essentials: what does //a/$e return when $e 
is a sequence of nodes? (We'll assume $e contains no duplicates and is 
in document order.) The answer is, it returns $e. Technically, it 
evaluates $e once for each node in //a, combines the results and sorts 
them into document order and eliminates duplicates; and the result of 
doing that is the original value of $e.

You seem to be working on the assumption that $e is not a set of nodes, 
but some kind of function or expression which only reduces to a set of 
nodes when it is called, and that it can return different sets of nodes 
depending on the context when it is called. That's not the case. 
Evaluation might well be deferred until first use, but the context for 
evaluation is the context at the point where the variable is 
initialized, not the context where it is used.

Looks like a serious bug in eXist to me.

Now, what are you trying to achieve? If it's code reuse, you need 
functions. If it's performance, just try to avoid that double "//" in a 
single path expression. I know evaluation strategies for path 
expressions vary between one engine and another, and a leading "//" 
sometimes makes things slower and sometimes faster, but I would imagine 
that repeated use of "//" in a single path will always cause trouble.

Michael Kay
Saxonica

On 25/02/2011 01:17, Ron Van den Branden wrote:
> Hi,
>
> I'm trying to optimize a complex XQuery expression by defining 
> variables for reused context nodes. Until recently, I assumed that 
> those variable references could be used in location steps as 
> substitution for the nodes they hold, but apparently it doesn't work 
> exactly like that. I'll try to clarify with following XQuery (note: I 
> realize this question is XPath/XSLT related, but since the practical 
> angle is XQuery-oriented, I'll post it here.):
>
> let $test :=
> <test>
> <a><el1/></a>
> <b><el2/></b>
> </test>
> let $el1 := $test//el1
> let $el2 := $test//el2
> return
> <result>
> <step>
> <el1.a>{$test//a//$el1}</el1.a>
> <el2.a>{$test//a//$el2}</el2.a>
> <el1.b>{$test//b//$el1}</el1.b>
> <el2.b>{$test//b//$el2}</el2.b>
> </step>
> <predicate>
> <a.el1>{$test//a[.//$el1]}</a.el1>
> <a.el2>{$test//a[.//$el2]}</a.el2>
> <b.el1>{$test//b[.//$el1]}</b.el1>
> <b.el2>{$test//b[.//$el2]}</b.el2>
> </predicate>
> </result>
>
> Developing against the eXist XML database, I assumed that the queries 
> in <el1.a> and <el2.b> would only return the <el1> and <el2> elements 
> in their respective contexts, while <el2.a> and <el1.b> would return 
> nothing. Instead, they all return the respective <el1> and <el2> 
> elements, even for context nodes in which they don't occur (see 
> <el2.a> and <el1.b>). Later I found out that the variable used in this 
> last location step doesn't resolve to the nodes it contains, but 
> instead is interpreted as a filter, which consequently returns the 
> entire nodeset held in the $el1 and $el2 variables (see Jeni 
> Tennison's clarification at 
> <http://markmail.org/message/hss4auv23t6y2ep2>). Note: my assumption 
> came from my experience with the eXist XML database to filter down 
> ancestor nodes containing other nodes by means of a variable reference 
> in a predicate: see the queries in the <predicate> section. In eXist, 
> only <a.el1> and <b.el2> return the respective <a> and <b> elements, 
> while <a.el2> and <b.el1> are empty, which suggests that in this case 
> the variable reference is resolved to the nodes it contains. However, 
> this behaviour apparently differs from Saxon's which also returns the 
> <a> and <b> nodes in <a.el2> and <b.el1>.
>
> Assuming Saxon is right, this makes me wonder: is my thinking based on 
> an incorrect implementation, or *is* there a way to make use of 
> variable references for filtering down nodesets in location steps?
>
> Any help much appreciated!
>
> Kind regards,
>
> Ron Van den Branden
>