chore(pelias_parser): add logging for chosen parser solution scores #1381

missinglink · 2019-11-08T10:58:36Z

When I was writing up pelias/parser#76 today I noticed that the 'scoring' in the Pelias parser is pretty good, it's fairly indicative of when the parse was low quality and you'll see scores < 0.5 next to a lot of incorrect parses 🎉

So this PR is pretty simple, it adds logging of the solution scores so that we can graph them and get a better idea about how it's doing in an unpredictable production environment.

example log line:

info: [api] pelias_parser_solution score=0.44

note: if there is a parsing failure the score will be 0.00

Joxit · 2019-11-08T11:23:33Z

sanitizer/_text_pelias_parser.js

@@ -95,6 +95,11 @@ function parse (clean) {
  // '    VVVV NN SSSSSSS AAAAAA PPPPP      '
  let mask = solution.mask(t);

+  // log information about the selected solution
+  logger.info('pelias_parser_solution', {
+    score: solution.score.toFixed(2)


It would be interesting to have the input too here

The input is already being logged at

api/sanitizer/_text_pelias_parser.js

Line 66 in ef2969b

params: clean,

I decided to split up those log sections so the pelias_parser log entry provides an overview of the parse and then we can have one or more pelias_parser_solution lines which log info about specific solutions.

At this stage, we only pick the first solution, but we could expand this later and if we end up having multiple pelias_parser_solution log lines it'll be cleaner to have the input only logged once since it doesn't change.

This was to have a simple way to analyze the logs.
For example if we want to quickly know which queries have a low score and the associated input all at once (using kibana or whatever). Unless it is possible to have links between log lines ? 🤔

orangejulius · 2019-11-08T14:25:14Z

It sounds like the purpose of this code is for manual investigation into a single query at a time, rather than providing data that we could analyze across many requests.

In that case, it might be better as part of the debug output, since it would then be along side all the other context from a request.

If we are trying to log something for later analysis, we'll need to do as @Joxit suggests and associate the solution score with a bunch of other data like the input text, or it won't be very useful.

chore(pelias_parser): add logging for chosen parser solution scores

ef2969b

Joxit reviewed Nov 8, 2019

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(pelias_parser): add logging for chosen parser solution scores #1381

chore(pelias_parser): add logging for chosen parser solution scores #1381

missinglink commented Nov 8, 2019 •

edited

Loading

Joxit Nov 8, 2019

missinglink Nov 8, 2019 •

edited

Loading

Joxit Nov 8, 2019

orangejulius commented Nov 8, 2019

chore(pelias_parser): add logging for chosen parser solution scores #1381

Are you sure you want to change the base?

chore(pelias_parser): add logging for chosen parser solution scores #1381

Conversation

missinglink commented Nov 8, 2019 • edited Loading

Joxit Nov 8, 2019

Choose a reason for hiding this comment

missinglink Nov 8, 2019 • edited Loading

Choose a reason for hiding this comment

Joxit Nov 8, 2019

Choose a reason for hiding this comment

orangejulius commented Nov 8, 2019

missinglink commented Nov 8, 2019 •

edited

Loading

missinglink Nov 8, 2019 •

edited

Loading