Add float parser, generalise one_of/.optional() a bit #14

jsdw · 2023-11-15T17:42:24Z

add chars::float to parse floats
add chars::case_insensitive_eq for case insensitive token matching.
generalise one_of and .optional(); allow expressions passed to either to return Options or bools, not just Options.

jsdw · 2023-11-15T18:31:52Z

Easyoakland · 2023-11-16T22:53:41Z

Why implement these in an extension trait instead of the same way that a user of the library would implement these (free functions)?

Would it not make more sense for some of these functions to use a stack allocated buffer with a maximum size since there are a maximum number of digits? For example u8 max is 256 so 3 chars is sufficient.

jsdw · 2023-11-17T00:44:16Z

Why implement these in an extension trait instead of the same way that a user of the library would implement these (free functions)?

I started with freestanding functions, but the extension trait felt ergonomically superior in the end. One example is for the functions parse_f32 and parse_f64; as freestanding functions you need to call parse_f32::<_, String>(toks) (ie need 2 generics, one for tokens and one for buffer.. granted, I could elide the tokens using impl syntax but that doesn't feel right in a proper library because one may want to specify it). With an extension crate it's toks.parse_f32::<String> which is just nicer and the one generic is easier to explain (plus nice symmetry with toks.parse::<f32, String>() or whatever)

Subjective I'm sure, but I tried both and preferred this!

Would it not make more sense for some of these functions to use a stack allocated buffer with a maximum size since there are a maximum number of digits? For example u8 max is 256 so 3 chars is sufficient.

I didn't add any functions like that; only ones for f32/f64?

Easyoakland · 2023-11-17T01:37:06Z

(ie need 2 generics, one for tokens and one for buffer.. granted, I could elide the tokens using impl syntax but that doesn't feel right in a proper library because one may want to specify it)

I'm not sure why that's preferable to impl syntax. If you want to specify a type why not do that earlier?

fn parser<B>(tok: &mut impl Token...){...}
let a: StrTokens<'static> = "1234".into_tokens();
parser::<String>(&mut a)

If the syntax that user's use is different from the library I think that user's defining new combinators (the main thing) becomes a second class citizen. I'm of the opinion that combinators defined in the library should be useful as a big tutorial.

I didn't add any functions like that; only ones for f32/f64?

Oh, apparently floats with excessive digits of precision are just rounded so the input can be arbitrarily long.

Easyoakland · 2023-11-17T02:15:45Z

I also think the naming parse_f32(), parse_f(), float(), ... is worse than f32(), float(), skip_float().

Another advantage of free functions is then if it is actually advantageous to have both parse and skip versions of things to make two modules named parse and skip.

I'm not sure it is though. Why not only have the skip versions and allow the user to .parse() if they want to?

Another alternative could be making these things Tokens. Then you could do f32(&mut toks).as_iter().count() !=0 (or make consume return a bool) for the bool case and f32(&mut toks).parse() for the parse case. For this to work the macro for defining token newtypes should probably be made first. Or it could be a function returning a particular Tokens that is parameterized by its parse function.

At this point it might make sense to impl<T: Tokens> Tokens for &mut T and take everything by value like in core::iter then you can f32(toks).parse().

jsdw · 2023-11-17T10:34:27Z

If the syntax that user's use is different from the library I think that user's defining new combinators (the main thing) becomes a second class citizen. I'm of the opinion that combinators defined in the library should be useful as a big tutorial.

I don't think it's a huge deal either way, but I do agree with the tutorial perspective and promoting freestanding funcs as a good way to do these things. I might revert to freestanding funcs with impl syntax again for this reason; I can't think of a good reason not to just use impl syntax here, really.

I also think the naming parse_f32(), parse_f(), float(), ... is worse than f32(), float(), skip_float().

This is just opinion but I prefer the current names; parse_f32 is nicely consistent with Tokens::parse so I prefer that. float is consistent with methods like Tokens::token and Tokens::tokens in that it will return a boolean whether it's seen the thing in question. parse_f is not public anyway.

I'm not sure it is though. Why not only have the skip versions and allow the user to .parse() if they want to?

I was going to just have float(), but these are already helper utils, and I feel like most of the time somebody would actually want to get the float value back. I was originally going to not expose float at all and just parse_f32/parse_f64 and may still go that path until there is some use case for exposing float

Another alternative could be making these things Tokens. Then you could do f32(&mut toks).as_iter().count() !=0 (or make consume return a bool) for the bool case and f32(&mut toks).parse() for the parse case.

I guess right now I don't see the point so I'd rather avoid that extra complexity. Would somebody really want to iterate over the tokens that are a part of a float and do something with only a subset of them? I doubt it. (if I kept float() then somebody could do as parse_f and take a slice to the tokens anyway if they really want, but I don't see why really, so it doesn't compel me to keep it really).

jsdw added 5 commits November 15, 2023 17:40

Add float parser, generalise one_of/.optional() a bit

1634f7f

fmt, clippy, doc fixes

0b8e7c1

Change freestanding char fns to CharTokens extension trait

74467cd

no-std

de6a679

tweak json example docs

b5e911a

tweak doc comments on parse float fns

5a8f929

jsdw added 2 commits November 17, 2023 00:37

use float parsing in JSON example

9a209e3

doc tweak

12d0847

jsdw added 2 commits November 18, 2023 00:28

back to freestanding chars fns

1e8677f

doc tidy

0c400c2

jsdw merged commit 1221933 into master Nov 18, 2023
4 checks passed

jsdw deleted the float-parsing branch November 18, 2023 00:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add float parser, generalise one_of/.optional() a bit #14

Add float parser, generalise one_of/.optional() a bit #14

jsdw commented Nov 15, 2023

jsdw commented Nov 15, 2023

Easyoakland commented Nov 16, 2023

jsdw commented Nov 17, 2023 •

edited

Loading

Easyoakland commented Nov 17, 2023

Easyoakland commented Nov 17, 2023

jsdw commented Nov 17, 2023 •

edited

Loading

Add float parser, generalise one_of/.optional() a bit #14

Add float parser, generalise one_of/.optional() a bit #14

Conversation

jsdw commented Nov 15, 2023

jsdw commented Nov 15, 2023

Easyoakland commented Nov 16, 2023

jsdw commented Nov 17, 2023 • edited Loading

Easyoakland commented Nov 17, 2023

Easyoakland commented Nov 17, 2023

jsdw commented Nov 17, 2023 • edited Loading

jsdw commented Nov 17, 2023 •

edited

Loading

jsdw commented Nov 17, 2023 •

edited

Loading