feat apilinks.json generator #153

flakey5 · 2024-11-29T07:59:02Z

Closes #152

~~Opening this as a draft currently to get feedback on the approach since it's a bit non-trivial relative to the other generators.~~ The apilinks.json file maps things exported by modules to their source locations on Github.

Example:

{
  "SomeClass.prototype.var": "github.com/nodejs/node/tree/<hash>/lib/file.js#L100"
}

This means we need to parse the module's javascript source in addition to its markdown.

What the current approach does is doing:

Adds another loading & parsing step for the js source files
- acorn is used for parsing the source files
- This intakes .js files, so to run it you need to do pass in node/lib/*.js as the input
- ~~This is dependent on the Markdown source parsing since it uses the source_link metadata in the docs~~
Exposes the parsed js ast to other generators by adding the ast-js generator
api-links generator is based off of the ast-js result

src/utils/git.mjs

src/parser.mjs

ovflowd · 2024-12-19T22:53:11Z

@flakey5 do you need review here? I think I forgot to review this PR!

flakey5 · 2024-12-20T06:35:01Z

do you need review here

As far as the approach yes please

ovflowd · 2024-12-21T23:20:43Z

bin/cli.mjs

+    .filter(path => path !== undefined && path.endsWith('.js'))
+);
+
+const parsedJsFiles = await parseJsSources(sourceFiles);


Does this process needs to be blocking? Do we need to parse all these sources beforehand and not on-demand?

We don't need too, this is just consistent to how to we parse markdown sources. We could definitely just store the paths of the sources and parse them in the generator instead though. It'd definitely lower the amount of memory that's allocated for the lifetime of the program

bin/cli.mjs

src/generators.mjs

src/generators/api-links/index.mjs

ovflowd · 2024-12-21T23:28:47Z

src/generators/api-links/utils/extractExports.mjs

@@ -0,0 +1,215 @@
+// @ts-check


What is this comment?

Temporary comment just enabling type checking in the file

src/generators/api-links/utils/extractExports.mjs

ovflowd · 2024-12-21T23:34:14Z

src/generators/api-links/utils/extractExports.mjs

I wonder how expensive these functions are, as it seems that this can quickly get bloated depending on the size of source files.

BTW, could you use https://unifiedjs.com/explore/package/estree-util-visit/ to walk/visit the nodes? This is what I was referring to, instead of plain forEaches and while loops.

Also, it might be worth to move "estree" moving logic to a dedicated "estree parser" IDK

Since we have a parser for the Markdown files, having one just for Estree makes sense I guess.

How about having a createMarkdownLoader and createJsLoader? Then the same with createMarkdownParser and createJsParser

src/loader.mjs

ovflowd · 2024-12-21T23:36:40Z

src/loader.mjs

+    return filePaths.map(async filePath => {
+      const fileContents = await readFile(filePath, 'utf-8');
+
+      return new VFile({ path: filePath, value: fileContents });


I am a bit concerned of the memory allocation that this will cause, some of the source files can be humungous. Have you made some comparisons of heap size differences?

src/parser.mjs

src/utils/git.mjs

ovflowd

Approach in general sounds good; Just, some of these utilities are way too complex and big and the git manipulation scripts are possibly dangerous or at least an easy vulnerability hell.

ovflowd · 2025-01-06T16:31:01Z

Also, @flakey5 this https://github.com/syntax-tree/estree-util-visit is what I was referring to use to walk through the Nodes of the source code. Since acorn generates an AST that is compatible with estree.

Closes #152 Signed-off-by: flakey5 <[email protected]>

Co-authored-by: Claudio W <[email protected]>

Signed-off-by: flakey5 <[email protected]>

flakey5 requested a review from a team as a code owner November 29, 2024 07:59

flakey5 marked this pull request as draft November 29, 2024 07:59

github-advanced-security bot found potential problems Nov 29, 2024

View reviewed changes

src/utils/git.mjs Fixed Show fixed Hide fixed

src/utils/git.mjs Fixed Show fixed Hide fixed

src/utils/git.mjs Fixed Show fixed Hide fixed

src/utils/git.mjs Fixed Show fixed Hide fixed

AugustinMauroy reviewed Nov 29, 2024

View reviewed changes

src/utils/git.mjs Outdated Show resolved Hide resolved

AugustinMauroy reviewed Nov 29, 2024

View reviewed changes

src/parser.mjs Outdated Show resolved Hide resolved