Doc comments #2 by resolritter · Pull Request #128 · tree-sitter/tree-sitter-rust

resolritter · 2022-01-10T22:37:37Z

I suspect the main problem with #99 is what is pointed out in #99 (comment). Here I tried to fix that problem and also add some recursion guard tests just in case. EDIT: Some other cases were not being handled properly in #99 so I've included them and also added more tests.

This is being submitted as Draft so that @maxbrunsfeld can run the "randomized tests". I don't know how to run them.

#99 (comment) might be relevant and invalidate this approach altogether.

resolritter · 2022-01-10T22:49:22Z

+      // Might have already processed a doc comment in the loop above
+      if (lexer->result_symbol != DOC_COMMENT) {
+        lexer->result_symbol = LINE_COMMENT;
+        while (lexer->lookahead != '\n' && lexer->lookahead != 0) {


Maybe worth mentioning that Rust requires lines to end with \n according to https://doc.rust-lang.org/reference/tokens.html#string-literals, thus \r doesn't need to be checked for those loops.

That seems to be only for string literals though? I've certainly had .rs files with CRLF endings before

https://doc.rust-lang.org/reference/comments.html#doc-comments

Isolated CRs (\r), i.e. not followed by LF (\n), are not allowed in doc comments.

So it will never be standalone, but can be a part of a line ending.

I've certainly had .rs files with CRLF endings before

What I am trying to imply by that explanation is that even in the case of CRLF the line will still end with \r\n, not \r alone, therefore checking only for \n should detect all line endings.

mkrasnitski · 2022-08-18T01:08:24Z

@resolritter any updates on this? This (and also code highlighting in docstring examples) are places where rust.vim currently has better (or rather, any) highlighting.

cormacrelf · 2022-12-04T04:10:38Z

I'm having a look at this in the hope that we can inject a markdown parser in doc comments. This is obviously a great start, but from what I can tell it's not quite optimal for that scenario. I think we want:

Multiple successive doc comments contained, as they are here, in one doc_comment node, so editors can deal with successive doc comments as a group and for faster highlighting.
However even when coalesced, each /// line or /** */ block goes into a child node of doc_comment.
Each child node has the leading \s*///\s* or \s*//!\s* or \s*\*\s* skipped as if it's whitespace using lexer->advance(true);.
The */ line uses lexer->mark_end() to strip that as well.

That way, when you pass the contents of doc_comment to a injected markdown parser, it doesn't have all the comment symbol noise and mess up the markdown parser. It would be much nicer if the lexer could skip the start of each line using advance(true), but it appears that this skips all the previous lines whenever you call it, i.e. the TS lever API assumes whitespace is only skippable at the start and end of tokens.

mkrasnitski · 2022-12-04T05:29:51Z

Bumping, as it would be great to finally get some more activity on this. How would this idea interact with code examples? It'd be great if writing examples could be as seemless as writing normal rust code, where each subsequent line gets a /// auto-inserted. Having plugins like vim-closer/auto-pairs work in that context would also be amazing.

archseer · 2022-12-04T07:19:23Z

@mkrasnitski Maybe you could pick up the PR and finish the implementation?

@cormacrelf We've solved this for other grammars by using combined injections, @the-mikedavis explains it here helix-editor/helix#4870 (comment)

mkrasnitski · 2022-12-04T07:23:16Z

I'm just a casual observer in this case - I don't yet feel qualified enough to pick this up myself, as I have really no familiarity with tree sitter internals. This is just one place where I feel tree sitter is not at feature parity with the official rust.vim plugin, and so any forward progress gets me excited.

cormacrelf · 2022-12-04T08:48:12Z

The combined injections trick works. Thanks for the tip @archseer! Will PR separately, based on this PR (to handle the more-than-three-/ case, which Erlang doesn't have to worry about).

The only thing I can't do is get Neovim to use Rust highlights for fenced code blocks without an explicit language tag. Tangentially I think you could do that by adding a feature to nvim-treesitter where you can create an alias for the markdown parser, but with separate query files (that will most likely use ; inherits: markdown). It can currently do aliases but they don't get their own query files. This thin filetype would be called rustdoc, used just for injections.

resolritter · 2022-12-10T22:15:28Z

I will not go forward with this PR. Feel to continue the work in a new PR.

resolritter commented Jan 10, 2022

View reviewed changes

resolritter force-pushed the doc_comments branch from 72001be to 4762c74 Compare January 10, 2022 23:13

resolritter mentioned this pull request Jan 10, 2022

Designate doc comments #99

Merged

resolritter added 2 commits January 10, 2022 20:46

designate doc comments

4e012c3

regenerate parser

61190d9

resolritter force-pushed the doc_comments branch from 4762c74 to 61190d9 Compare January 10, 2022 23:47

archseer mentioned this pull request May 6, 2022

Update to newest tree-sitter-rust helix-editor/helix#1473

Closed

archseer mentioned this pull request Sep 7, 2022

LSP syntax highlighting helix-editor/helix#814

Closed

cormacrelf mentioned this pull request Dec 4, 2022

Separate regular comments from Doc Strings #147

Closed

the-mikedavis mentioned this pull request Dec 4, 2022

Parsing Documentation Strings Line By Line elixir-lang/tree-sitter-elixir#46

Open

resolritter closed this Dec 10, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Doc comments #2#128

Doc comments #2#128
resolritter wants to merge 2 commits intotree-sitter:masterfrom
resolritter:doc_comments

resolritter commented Jan 10, 2022 •

edited

Loading

Uh oh!

resolritter Jan 10, 2022

Uh oh!

archseer Jan 22, 2022

Uh oh!

archseer Jan 22, 2022

Uh oh!

resolritter Jan 24, 2022

Uh oh!

mkrasnitski commented Aug 18, 2022

Uh oh!

cormacrelf commented Dec 4, 2022 •

edited

Loading

Uh oh!

mkrasnitski commented Dec 4, 2022

Uh oh!

archseer commented Dec 4, 2022 •

edited

Loading

Uh oh!

mkrasnitski commented Dec 4, 2022

Uh oh!

cormacrelf commented Dec 4, 2022

Uh oh!

resolritter commented Dec 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

Conversation

resolritter commented Jan 10, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

resolritter Jan 10, 2022

Choose a reason for hiding this comment

Uh oh!

archseer Jan 22, 2022

Choose a reason for hiding this comment

Uh oh!

archseer Jan 22, 2022

Choose a reason for hiding this comment

Uh oh!

resolritter Jan 24, 2022

Choose a reason for hiding this comment

Uh oh!

mkrasnitski commented Aug 18, 2022

Uh oh!

cormacrelf commented Dec 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkrasnitski commented Dec 4, 2022

Uh oh!

archseer commented Dec 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mkrasnitski commented Dec 4, 2022

Uh oh!

cormacrelf commented Dec 4, 2022

Uh oh!

resolritter commented Dec 10, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

resolritter commented Jan 10, 2022 •

edited

Loading

cormacrelf commented Dec 4, 2022 •

edited

Loading

archseer commented Dec 4, 2022 •

edited

Loading