Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Question about differences between localIds in ELM and annotations #1417

Open
nmorasb opened this issue Sep 18, 2024 · 5 comments
Open

Question about differences between localIds in ELM and annotations #1417

nmorasb opened this issue Sep 18, 2024 · 5 comments

Comments

@nmorasb
Copy link

nmorasb commented Sep 18, 2024

Hi! We're hoping to get a little clarification on why not all localIds appear in the annotations.
In MADiE, we rely on the annotations to re-construct CQL for our execution highlighting feature (for QDM). We finally got around to incorporating a newer version of cqframework and cql-to-elm, and noticed that there are now many more localIds in the ELM than previously. However, only a portion of those appear in the annotations.

For example, for this group:
 

{
  "signature": [],
  "resultTypeName": "{urn:hl7-org:elm-types:r1}Boolean",
  "type": "Not",
  "localId": "398",
  "locator": "30:13-30:45",
  "operand": {
    "signature": [],
    "resultTypeName": "{urn:hl7-org:elm-types:r1}Boolean",
    "type": "IsNull",
    "localId": "397",
    "locator": "30:13-30:45",
    "operand": {
      "path": "result",
      "resultTypeSpecifier": {
        "type": "ChoiceTypeSpecifier",
        "choice": [...],
        "localId": "389"
      },
      "scope": "GestationalAge",
      "type": "Property",
      "localId": "380",
      "locator": "30:13-30:33"
    }
  }
}

MADiE looks at the associated annotations:

{
  "r": "398",
  "s": [
    {
      "r": "380",
      "s": [
        {
          "r": "379",
          "s": [
            {
              "value": [
                "GestationalAge"
              ]
            }
          ]
        },
        {
          "value": [
            "."
          ]
        },
        {
          "r": "380",
          "s": [
            {
              "value": [
                "result"
              ]
            }
          ]
        }
      ]
    },
    {
      "value": [
        " is not null"
      ]
    }
  ]
}

Because localId 397 does not appear in the annotations, the statement associated with it is identified by the parent localId. When generating our coverage highlighting and comparing against the covered/executed clauses (identified by localId) from the execution output, we're ending up with a handful of uncovered clauses that can't be identified within our highlighting.

So, we'd like to request an enhancement (if feasible) to have the annotations updated so clauses are broken down to the same granular level to include all localIds, or we'd appreciate some feedback on why that may not be possible.
Thanks!

@nmorasb nmorasb changed the title Question about differences between localIds in Question about differences between localIds in ELM and annotations Sep 18, 2024
@brynrhodes
Copy link
Member

There isn't a 1:1 correspondence between ELM nodes and annotation nodes, the chunks are collapsed. So although not every ELM node will have an entry in the annotation, it should be the case that the annotation nodes cover all the ELM nodes in the chunks. The parent localId 398 in this case is the chunk, and so "includes" the operand localId 397. Does that work to indicate coverage?

@jkotanchik-SB
Copy link

Thanks for taking a look @brynrhodes! Sounds like the Annotations are functioning as intended and adjustments should be made in cqm-execution for overly funky cases like this one. We will move forward with making adjustments to cqm-execution.

Here's some additional background on the specific scenario that resulted in this Issue:

The parent localId 398 in this case is the chunk, and so "includes" the operand localId 397. Does that work to indicate coverage?

Not completely. Cqm-execution considers most clauses with a localId property to be a unique clause. Because 398 ("Not") and 397 ("IsNull") both have local Ids, cqm-execution considers both independently when calculating coverage. Hence, Test Case data that results in the target value being not null will count as 398 being covered, but not 397.

The Annotations have "IsNull" (397) collapsed under "Not" (398), so when the "Not" logic is covered, 397 and 398 are highlighted. There is no clear indication in our highlighting that "IsNull" (397) has not been covered except for the <100% coverage value. Adding another test case will complete the coverage, but doing so feels awkward, especially since highlighting will be unchanged.

@brynrhodes
Copy link
Member

I see, that makes sense, and yes, that's isn't ideal. Is there a listing of which clauses Cqm-execution does consider unique clauses? Or a principle that we could use to base that decision on?

@hossenlopp
Copy link
Contributor

We have been looking at this same issue from the fqm-execution side of things. Our current plan to resolve this coverage issue is to calculate coverage by only using localIds from the statement expression tree that are also found in the annotations. There has always been some disparity between the annotations and the expression tree. Only using the annotation localIds completely resolves the issue of having invisible coverage deficiencies. Our rationale is that only the visible CQL is able to be acted upon by the tester, and they shouldn’t be docked for something they cannot see.

@JPercival
Copy link
Contributor

Our rationale is that only the visible CQL is able to be acted upon by the tester, and they shouldn’t be docked for something they cannot see.

Yes, and part of what the CQL compiler does is rewrites of the CQL into a more efficient ELM (from a computation perspective) version. This is the same as other compilers, like C or Java or so on. There's not necessarily a 1:1 correspondence between the input CQL and the output ELM. The emitted ELM just needs to be semantically equivalent to the input CQL. Historically, this behavior has been pretty stable but more recent versions of the compiler are getting more intelligent about the types of rewrites they support. Two other related behaves are:

  1. The emission of As ELM nodes in queries/retrieves. The As effectively functions as a runtime check that the data returned in the query is of the correct type. The is similar to the behavior in Java if you do a cast: var x = (SomeType)someValue. Java inserts a runtime check that the value can be cast to the correct type.

  2. The targetMapping for QICore and related data models. The compiler emits some additional code under the covers to make the mapping to FHIR transparent to a CQL author.

All that said, we don't have good documentation about how the localIds correspond between the CQL and ELM, and we need to provide that.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants