Add new path API for `os2` #4954

Feoramund · 2025-03-21T23:36:17Z

In order for os2 to replace os, filepath has to be decoupled from the package. This PR takes care of that by re-writing most of the main part of the core:path/filepath API directly into os2. (The glob-related API will come in a future PR that I want to spend more time testing.)

I completely rewrote how clean works to not depend on the old Lazy_Buffer structure and procs.
All of the procs now use system-specific internal procedures where applicable instead of when statements to keep things cleaner/more modular.
Documentation has been added.
The splitting API has been simplified.
- Instead of split, base, short_stem, ext, long_ext, and dir, we now have split_path -> (dir, filename) and split_filename -> (base, ext). There is also split_filename_all to treat the first . extension separator as the splitting point.
The splitting API will no longer include a path separator in the directory when a filename is present. This means /usr/share will split into /usr and share, instead of /usr/ and share as was done with the filepath API.
rel has also been rewritten. Currently, it is invalid on case-sensitive filesystems (POSIX, for example), as it uses strings.equal_fold. The rewritten version no longer automatically runs clean on the arguments for the sake of performance, and this is documented as an argument requirement.

Here is the new API summary. Given that it lives directly inside os2, the procedure names have been made verbose enough to outline their path-specific functionality, following in line with the rest of os2's procedures being fluently readable.

are_paths_identical compares two paths for exactness (but not equivalency). This is to handle case sensitivity on different platforms.
clean_path replaces filepath.clean.
is_absolute_path replaces filepath.is_abs.
get_absolute_path replaces filepath.abs.
get_relative_path replaces filepath.rel.
split_path replaces filepath.split, filepath.dir, and filepath.base.
join_path replaces filepath.join.
split_filename, when used with the filename result of split_path, replaces stem and ext.
split_filename_all, when used with the filename result of split_path, replaces short_stem and long_ext.
join_filename complements split_filename_*, as it made sense to me to have a join if we have split.
split_path_list replaces filepath.split_list.

Note: split_filename_*, when given ".foo" will return ".foo", "" instead of "", "foo".

I reduced the various splitting mechanisms down to just a couple for the sake of clarity and performance, because if someone needs both the name and the extension split, it's easier to just index the separator once and return both sides of the split. You can discard the other if you don't need it.

I'm interested in hearing feedback about this API: if it's sensible and matches expectations, particularly with regard to trimming final separators on split directories and treating dot-prefixed filenames as entire names. It's a convention in the POSIX world that dot-prefixed files are treated as hidden (so it feels more natural to me that it is part of the filename), but I'm not sure how Windows users feel about this particular handling.

There are several places where this is assumed to be true, most visibly in `is_path_separator`, as it takes a `byte` argument. Note that the data type of `_Path_Separator` is a rune, which allows any Unicode value.

…dows

gingerBill · 2025-03-24T09:55:20Z

core/os/os2/path.odin

+				}
+			case:
+				// Copy the path element verbatim and add a separator.
+				intrinsics.mem_copy_non_overlapping(raw_data(buffer[buffer_i:]), raw_data(elem), len(elem))


Any reason this style isn't used? I know mem_copy_non_overlapping is going to be faster in this case, but it is a little more noiser.

copy(buffer[buffer_i:], elem)

See all other cases of this too.

gingerBill

I am not sure about the benefit of intrinsics.mem_copy_non_overlapping vs copy. However, everything else looks really really good!

Feoramund · 2025-03-24T17:09:58Z

I am not sure about the benefit of intrinsics.mem_copy_non_overlapping vs copy. However, everything else looks really really good!

Initially I did this for speed, because I was under the assumption that memmove was creating an actual secondary buffer. I did some digging after reading your comments here, and I realized I mis-remembered a key point of the C standard which is that memmove works as if it's copying to a temporary buffer, but it doesn't have to. Sane implementations just check if the runs would overlap and copy backwards or forwards depending on that factor.

copy is completely fine and more readable.

Feoramund added 8 commits March 20, 2025 14:43

Assert that _Path_Separator is 7-bit ASCII

a495cd5

There are several places where this is assumed to be true, most visibly in `is_path_separator`, as it takes a `byte` argument. Note that the data type of `_Path_Separator` is a rune, which allows any Unicode value.

Add missing documentation to os2/path

d1d8623

Add new path API for os2

abe0c30

Decouple usage of filepath from os2

4e7f54c

Add tests for new os2 path API

3525e71

Remove if ODIN_OS == .Windows in file that can only be built on Win…

6a6980f

…dows

Make os2 Linux _is_path_separator compare against _Path_Separator

cfa3e97

Add require_results to getters in os2 path API

649376f

gingerBill reviewed Mar 24, 2025

View reviewed changes

gingerBill requested changes Mar 24, 2025

View reviewed changes

gingerBill merged commit 4a595f9 into odin-lang:master Mar 24, 2025
7 checks passed

laytan mentioned this pull request Feb 28, 2025

core:os tracking issue #4710

Open

61 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add new path API for `os2` #4954

Add new path API for `os2` #4954

Feoramund commented Mar 21, 2025

gingerBill Mar 24, 2025

gingerBill left a comment

Feoramund commented Mar 24, 2025 •

edited

Loading

Add new path API for os2 #4954

Add new path API for os2 #4954

Conversation

Feoramund commented Mar 21, 2025

gingerBill Mar 24, 2025

Choose a reason for hiding this comment

gingerBill left a comment

Choose a reason for hiding this comment

Feoramund commented Mar 24, 2025 • edited Loading

Add new path API for `os2` #4954

Add new path API for `os2` #4954

Feoramund commented Mar 24, 2025 •

edited

Loading