fix(lexer): handle regex and escaped delimiters #605

desertwitch · 2025-03-30T10:06:06Z

fixes #592

regex was parsed by string method and not considering escaped delimiters
a regex sequence with an escaped delimiter would be cut short (see issue)

lexing logic for parsing regex was moved into a separate method
a regex sequence with an escaped delimiter is now properly handled
test cases were added to make this behavior visible and testable for the future

desertwitch · 2025-11-14T22:56:59Z

Gentle bump: Are there any blockers for this bugfix (something I could/should address)?

raphamorim · 2025-11-17T19:06:09Z

lexer/lexer.go


+// readRegex reads a regex pattern from the input, handling escaped delimiters.
+// /foo\/bar/ => Token(foo\/bar).
+func (l *Lexer) readRegex(endChar byte) string {


hey @desertwitch thanks for the PR!!

I think we may need to improve this logic a bit. This function handles \/ escapes, but regexes can have other escapes like \\, \n, \d, etc. Current implementation would incorrectly consume \\ making \\/ become \/ in the literal.

The pattern /foo\\/ (ending with literal backslash) may not parse correctly since \\ followed by / would be consumed as escaped delimiter.

Seems logical, not sure why I didn't consider this at the time. Unfortunately relatively limited in bandwidth these days, but feel free to shuffle around or close/rework elsewhere as needed.

Edit: Now I remember. The lexer's job seemed to me just to extract the literal regex pattern between delimiters, and therefore only handle escapes that are relevant to the lexer syntax (escaped delimiters), not regex-specific escapes (to be handled downstream in an actual regex parsing function, which relies on such inner escapes to be there).

raphamorim · 2025-11-17T19:06:44Z

lexer/lexer_test.go

+		{token.AT, "@"},
+		{token.NUMBER, "1"},
+		{token.MINUTES, "m"},
+		{token.REGEX, "foo\\/bar"},


This test expects "foo\/bar" but the actual string stored should be "foo/bar" after consuming the escape.

Are you sure about this? The lexer's job seemed to me just to extract the literal regex pattern between delimiters, and only handling escapes that are relevant to the lexer syntax (escaped delimiters), not regex-specific escapes. This, I think, is also why I only handled such escaped delimiters, so other escapes would end up in the pattern string, to be handled as required during actual regex parsing.

raphamorim · 2025-11-18T15:44:28Z

Merged changes in #678 . thank you @desertwitch 🙏

desertwitch requested a review from a team as a code owner March 30, 2025 10:06

desertwitch requested review from bashbunni and removed request for a team March 30, 2025 10:06

fix(lexer): handle regex and escaped delimiters

7909a3e

desertwitch force-pushed the patch-1 branch from e958c4c to 7909a3e Compare March 30, 2025 10:12

This comment has been minimized.

Sign in to view

andreynering approved these changes Nov 17, 2025

View reviewed changes

raphamorim reviewed Nov 17, 2025

View reviewed changes

raphamorim closed this Nov 18, 2025

desertwitch deleted the patch-1 branch November 18, 2025 19:06

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(lexer): handle regex and escaped delimiters #605

fix(lexer): handle regex and escaped delimiters #605

Uh oh!

desertwitch commented Mar 30, 2025 •

edited

Loading

Uh oh!

This comment has been minimized.

desertwitch commented Nov 14, 2025

Uh oh!

raphamorim Nov 17, 2025

Uh oh!

desertwitch Nov 17, 2025 •

edited

Loading

Uh oh!

raphamorim Nov 17, 2025

Uh oh!

desertwitch Nov 17, 2025 •

edited

Loading

Uh oh!

raphamorim commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

fix(lexer): handle regex and escaped delimiters #605

fix(lexer): handle regex and escaped delimiters #605

Uh oh!

Conversation

desertwitch commented Mar 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

This comment has been minimized.

desertwitch commented Nov 14, 2025

Uh oh!

raphamorim Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

desertwitch Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

raphamorim Nov 17, 2025

Choose a reason for hiding this comment

Uh oh!

desertwitch Nov 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

raphamorim commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

desertwitch commented Mar 30, 2025 •

edited

Loading

desertwitch Nov 17, 2025 •

edited

Loading

desertwitch Nov 17, 2025 •

edited

Loading