grass.tools: Add API and CLI to access tools without a session #5843

wenzeslaus · 2025-06-05T13:22:32Z

Building on top of #2923 (not merged), this adds functionality which allows accessing "packed" native GRASS rasters to be used as tool parameters in command line:

grass run r.slope.aspect elevation=~/data/elevation.pack slope=~/data/slope.pack
grass run r.univar map=~/data/slope.pack

The above syntax is not actually implemented, but the code below works:

PYTHONPATH=$(grass --config python-path)
python -m grass.app run r.slope.aspect elevation=.../elevation.pack slope=.../slope.pack

The same functionality is also available from Python where it copies the syntax of plain Tools from #2923:

from grass.tools import StandaloneTools

tools = StandaloneTools()
tools.r_slope_aspect(elevation="elevation.pack", slope="slope.pack", aspect="aspect.pack")
print(f"Mean slope: {tools.r_univar(map='slope.pack')['mean']}")

The above syntax does not fully work, but the following one does:

tools.run("r.slope.aspect", elevation="elevation.pack", slope="slope.pack")

This PR is not meant for merging as is, but currently represents a final combination of all different features proposed. See discussion #5830 for details.

This adds a Tools class which allows to access GRASS tools (modules) to be accessed using methods. Once an instance is created, calling a tool is calling a function (method) similarly to grass.jupyter.Map. Unlike grass.script, this does not require generic function name and unlike grass.pygrass module shortcuts, this does not require special objects to mimic the module families. Outputs are handled through a returned object which is result of automatic capture of outputs and can do conversions from known formats using properties. Usage example is in the _test() function in the file. The code is included under new grass.experimental package which allows merging the code even when further breaking changes are anticipated.

…ute with that stdin

…are different now)

… the max tries, focusing on timeout

…ting clearly ends the loop, the elapsed time may be significantly higher for some timeouts given that the lock process execution takes a second to execute.

…od solutions.

…n doc

…bject

wenzeslaus · 2025-06-11T20:27:33Z

Use of NumPy array IO with the standalone tools API

The combination of NumPy array IO (from #5878) with the standalone tools API (from #5843 - this PR) allows to use tools with NumPy arrays without a project:

from grass.experimental.standalone_tools import StandaloneTools

tools = StandaloneTools()
slope = tools.r_slope_aspect(elevation=np.ones((2, 3)), slope=np.ndarray)

Complications with computational region

With how the StandaloneTools are implemented now, the following will fail because the initially set region will be incompatible with the array size in the second call (see option 1 on the region comment above):

from grass.experimental.standalone_tools import StandaloneTools

tools = StandaloneTools()
slope1 = tools.r_slope_aspect(elevation=np.ones((2, 3)), slope=np.ndarray)
slope2 = tools.r_slope_aspect(elevation=np.ones((5, 5)), slope=np.ndarray)

One way how to avoid it is providing some parameter to StandaloneTools, like StandaloneTools(refresh_region=True). Another way is to use multiple instances:

from grass.experimental.standalone_tools import StandaloneTools

slope1 = StandaloneTools().r_slope_aspect(elevation=np.ones((2, 3)), slope=np.ndarray)
slope2 = StandaloneTools().r_slope_aspect(elevation=np.ones((5, 5)), slope=np.ndarray)

Having the multiple calls and having that instance immediately forgotten does not look that great.

Evaluating length of user code

One could also argue that, in case of NumPy arrays, really plain functions are preferable over calling a tool as a method of an object because even a single call still requires creation of an object beforehand or in the same statement as in these two examples:

from grass.experimental.standalone_tools import StandaloneTools

tools = StandaloneTools()
slope = tools.r_slope_aspect(elevation=np.ones((2, 3)), slope=np.ndarray)

from grass.experimental.standalone_tools import StandaloneTools

slope = StandaloneTools().r_slope_aspect(elevation=np.ones((5, 5)), slope=np.ndarray)

Shortcut object in the library

We could create a StandaloneTools object on the Python module level, so that users can import it. This would be similar to grass.pygrass.modules.shortcuts (hence calling it shortcut here). In the library, we would have:

# grass/experimental/standalone_tools.py

tools = StandaloneTools(refresh_region=True, keep_data=False, use_one_project=False)

And then the user code would be:

# myscript.py

from grass.experimental.standalone_tools import tools

slope = tools.r_slope_aspect(elevation=np.ones((5, 5)), slope=np.ndarray)

This would exist alongside the option to create one or more StandaloneTools objects, and it would likely have different configuration (independent region, no data preserved, for truly standalone runs). Result would be possible confusion due to another option and some inconsistency, but it might be the best way how to provide such API because it creates simplest user code.

…lean up tests and doc-strings.

…o-tools

…rameters

…er class

…o-tools

This is adding r.pack files (aka native GRASS raster files) as input and output to tools when called through the Tools object. Tool calls such as r_grow can take r.pack files as input or output. The format is distinguished by the file extension. Notably, tool calls such as r_mapcalc don't pass input or output data as separate parameters (expressions or base names), so they can be used like that only when a wrapper exists (r_mapcalc_simple) or, in the future, when more information is included in the interface or passed between the tool and the Tools class Python code. Similarly, tools with multiple inputs or outputs in a single parameter are currently not supported. The code is using --json with the tool to get the information on what is input and what is output, because all are files which may or may not exists (this is different from NumPy arrays where the user-provided parameters clearly say what is input (object) and what is output (class)). Consequently, the whole import-export machinery is only started when there are files in the parameters as identified by the parameter converter class. Currently, the in-project raster names are driven by the file names. This will break for parallel usage and will not work for vector as is. While it is good for guessing the right (and nice) name, e.g., for r.mapcalc expression, ultimately, unique names retrieved with an API function are likely the way to go. When cashing is enabled (either through use go context manager or explicitly), import of inputs is skipped when they were already imported or when they are known outputs. Without cache, data is deleted after every tool (function) call. Cashing is keeping the in-project data in the project (as opposed to a hidden cache or deleting them). The parameter to explicitly drive this is called use_cache (originally keep_data). The objects track what is imported and also track import and cleaning tasks at function call versus object level. The data is cleaned even in case of exceptions. The interface was clarified by creating a private/protected version of run_cmd which has the internal-only parameters. This function uses a single try-finally block to trigger the cleaning in case of exceptions. While generally the code supports paths as both strings and Path objects, the actual decisions about import are made from the list of strings form of the command. From caller perspective, overwrite is supported in the same way as for in-project GRASS rasters. The tests use module scope to reduce fixture setup by couple seconds. Changes include a minor cleanup of comments in tests related to testing result without format=json and with, e.g., --json option. The class documentation discusses overhead and parallelization because the calls are more costly and there is a significant state of the object now with the cache and the rasters created in the background. This includes discussion of the NumPy arrays, too, and slightly improves the wording in part discussing arrays. This is building on top of #2923 (Tools API, and it is parallel with #5878 (NumPy array IO), although it runs at a different stage than NumPy array conversions and uses cache for the imported data (may be connected more with the arrays in the future). This can be used efficiently in Python with Tools (caching, assuming project) and in a limited way also with the experimental run subcommand in CLI (no caching, still needs an explicit project). There is more potential use of this with the standalone tools concept (#5843). The big picture is also discussed in #5830.

echoix · 2025-10-07T02:45:50Z

Dependent #2923 was merged

wenzeslaus and others added 30 commits June 3, 2023 23:57

Support verbosity, overwrite and region freezing

aaef183

Raise exception instead of calling handle_errors

54db575

Allow to specify stdin and use a new instance of Tools itself to exec…

82f5894

…ute with that stdin

Add ignore errors, r_mapcalc example, draft tests

0f1e210

Add test for exceptions

f4e3fed

Add tests and Makefile

04087e8

Convert values to ints and floats in keyval

6ab8e40

Do not overwrite by default to follow default behavior in GRASS GIS

744cfac

Add doc, remove old code and todos

24c27e6

Add to top Makefile

ff187a6

Add docs for tests

22773c8

Allow test to fail because of the missing seed parameter (so results …

2911065

…are different now)

Merge branch 'main' into add-session-tools-object

3ac46c3

Lock with both timeout and max number of tries

bd3667b

Timeout in CLI, sleep times as increased initial sleep time, removing…

2a7439f

… the max tries, focusing on timeout

Actually measure the elapsed time in addition to counting. While coun…

f754f59

…ting clearly ends the loop, the elapsed time may be significantly higher for some timeouts given that the lock process execution takes a second to execute.

Add CLI for direct lock-unlock procedure

7b0f1d5

Doc timeout

a1203fa

Create a function to unlock a mapset

cb02a58

Add opt-in lock-unlock to the init function in Python

3e748e4

Add env to locking functions

fd1e285

Add force unlock to Python API

34ee30b

Add tests for Python API

35c1ea2

Messages which work in CLI and Python with focus on the mapset and go…

c542bf4

…od solutions.

Add Python doc

d1ccdcb

Doc for CLI

782f7d0

Disable lock test on Windows if they require actual lock to succeed.

34b127b

Use minimal __main__.py without the __name__ check according to Pytho…

0a2887f

…n doc

Add Tools from add-session-tools-object

bf761ca

wenzeslaus added 3 commits June 11, 2025 08:35

Make the special features standalone objects used by composition

4a1e374

Merge remote-tracking branch 'upstream/main' into add-session-tools-o…

651df11

…bject

Remove NumPy

ce7c53e

wenzeslaus mentioned this pull request Jun 11, 2025

grass.tools: Add raster pack files IO to Tools #5877

Merged

wenzeslaus mentioned this pull request Jun 12, 2025

grass.tools: Add API to access tools as functions #2923

Merged

More robust version of getting the parsed CLI from --json

bd12384

wenzeslaus changed the title ~~grass.experimental: Add API and CLI to access tools without a session~~ grass.tools: Add API and CLI to access tools without a session Jul 30, 2025

wenzeslaus added 19 commits September 2, 2025 16:11

Merge code-wise with main

4958982

Make the code functional again and align with the grass.tools code. C…

52b29d8

…lean up tests and doc-strings.

Remove rasters which are copies of pack files

0b3f412

Clean up the tmp files, further sync with grass.tools

bd5c6f6

Merge remote-tracking branch 'upstream/main' into add-pack-files-io-t…

4d14a47

…o-tools

Use g.list with JSON in tests

faf96a1

Merge remote-tracking branch 'upstream/main' into add-pack-files-io-t…

1c0bc05

…o-tools

Start the import-export machinery only when there are files in the pa…

fc48944

…rameters

Integrate pack code into grass.tools

998255f

Move code out of the Tools class, clean up code around ImporterExport…

353152d

…er class

Merge remote-tracking branch 'upstream/main' into add-pack-files-io-t…

fa1c2a3

…o-tools

Doc for tests

fcbf52f

Use tmp_path for files in tests. Test supported path representations.

2eaba0e

Remove last piece of grass.experimental.tools from CLI

b162e39

Merge main branch code-wise

821a1b9

Update the names (but code does not run)

a148d08

Integrate pack IO code

adf447f

Add full env and session handling

11006e8

Merge remote-tracking branch 'upstream/main' into cli-with-pack

cb801c2

echoix added the conflicts/needs rebase Rebase to or merge with the latest base branch is needed label Oct 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

grass.tools: Add API and CLI to access tools without a session #5843

grass.tools: Add API and CLI to access tools without a session #5843

wenzeslaus commented Jun 5, 2025

Uh oh!

wenzeslaus commented Jun 11, 2025

Uh oh!

echoix commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

grass.tools: Add API and CLI to access tools without a session #5843

Are you sure you want to change the base?

grass.tools: Add API and CLI to access tools without a session #5843

Conversation

wenzeslaus commented Jun 5, 2025

Uh oh!

wenzeslaus commented Jun 11, 2025

Use of NumPy array IO with the standalone tools API

Complications with computational region

Evaluating length of user code

Shortcut object in the library

Uh oh!

echoix commented Oct 7, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants