Improve Character emulation #10113

zbynek · 2025-04-06T22:14:41Z

Fixes #9705
Fixes #1989 (not 100% of methods are covered, but some are just impossible without bundling unicode database into compiled code)

Implements several APIs using https://developer.mozilla.org/en-US/docs/Web/JavaScript/Reference/Regular_expressions/Unicode_character_class_escape that should be well supported in all browsers.

The implementation is mostly similar to https://groups.google.com/g/google-web-toolkit-contributors/c/73-aScAShs4/m/gZAhlUXiBAAJ , but the fallbacks for old Edge versions are not included.

user/super/com/google/gwt/emul/java/lang/Character.java

niloc132 · 2025-04-30T16:33:02Z

user/super/com/google/gwt/emul/java/lang/Character.java

+  // Known differences between Java 17 and Chrome 135
+  // 11f50 .. 11f59, 16ac0 .. 16ac9, 1e4f0 .. 1e4f9, 1fbf0 .. 1fbf9


Should we leave this method out if it is still wrong? Which side do we consider to be "wrong" here, Java or Chrome? Would it make sense to have a test (possibly ignored) that shows this, so it is easier to reevaluate later?

Also, consider using hex in digit() above, so that it is the same convention as here, or change this to be decimal?

Maybe instead of this comment I should just link to the compatibility table https://docs.oracle.com/en/java/javase/24/docs/api/java.base/java/lang/Character.html#conformance
and mention that JRE behaviour on any Java < 24 won't match recent browser releases.

niloc132 · 2025-04-30T16:37:21Z

user/test/com/google/gwt/emultest/java/lang/CharacterTest.java


 /**
 * Tests for java.lang.Character.
 */
+@DoNotRunWith(Platform.HtmlUnitBug)


Can the specific failing tests be annotated with this, each with a comment about why it fails, rather than skip existing tests that pass?

I's pretty much all the tests that call any Character.is* method. They fail because the regexp doesn't even parse in HtmlUnit 2-4. If we want any meaningful test coverage we need #10115 + a custom build of HtmlUnit that includes mozilla/rhino#1848

niloc132 · 2025-04-30T16:37:48Z

user/super/com/google/gwt/emul/java/lang/CaseMapper.java

+  }
+
+  // If String.toUpperCase produces more than 1 codepoint, Character.toUpperCase should
+  // act either as identity or title-case conversion (not supported in GWT).


Maybe add a failing test and mark it ignored so we can see about restoring it when it works?

Co-authored-by: Colin Alworth <[email protected]>

zbynek marked this pull request as draft April 6, 2025 22:14

zbynek requested a review from Copilot April 6, 2025 22:25

This comment was marked as resolved.

Sign in to view

zbynek force-pushed the character-emul branch 6 times, most recently from 051a0d0 to 1ab960c Compare April 13, 2025 06:33

zbynek mentioned this pull request Apr 13, 2025

Update HtmlUnit to 4.11.1, update necessary dependencies #10115

Open

zbynek force-pushed the character-emul branch 2 times, most recently from d3a8f64 to 1ad6bc2 Compare April 16, 2025 00:20

Improve Character emulation

2f5f97a

zbynek force-pushed the character-emul branch from 1ad6bc2 to 2f5f97a Compare April 26, 2025 11:51

zbynek marked this pull request as ready for review April 26, 2025 11:51

Fix compile error

d9cd717

zbynek added the Category-JRE label Apr 27, 2025

niloc132 reviewed Apr 30, 2025

View reviewed changes

zbynek and others added 2 commits April 30, 2025 20:20

Call isDigit directly

32180fd

Co-authored-by: Colin Alworth <[email protected]>

Use hex codes, add documentation

2f55a39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve Character emulation #10113

Improve Character emulation #10113

zbynek commented Apr 6, 2025 •

edited

Loading

This comment was marked as resolved.

niloc132 Apr 30, 2025

zbynek Apr 30, 2025

niloc132 Apr 30, 2025

zbynek Apr 30, 2025

niloc132 Apr 30, 2025

		// Known differences between Java 17 and Chrome 135
		// 11f50 .. 11f59, 16ac0 .. 16ac9, 1e4f0 .. 1e4f9, 1fbf0 .. 1fbf9

Improve Character emulation #10113

Are you sure you want to change the base?

Improve Character emulation #10113

Conversation

zbynek commented Apr 6, 2025 • edited Loading

This comment was marked as resolved.

niloc132 Apr 30, 2025

Choose a reason for hiding this comment

zbynek Apr 30, 2025

Choose a reason for hiding this comment

niloc132 Apr 30, 2025

Choose a reason for hiding this comment

zbynek Apr 30, 2025

Choose a reason for hiding this comment

niloc132 Apr 30, 2025

Choose a reason for hiding this comment

zbynek commented Apr 6, 2025 •

edited

Loading