fix(sql): enforce UTF-8 when loading keyword resources #1260

renechoi · 2025-06-28T08:27:16Z

📝 Pull-Request Description

What & Why

Keywords.readLines loaded SQL keyword lists with the JVM’s default charset.
On environments configured for non-UTF-8 encodings (e.g. Windows CP-1252) this silently corrupted any keyword containing non-ASCII characters, leading to parsing errors in templates that rely on those lists.

This patch forces UTF-8 decoding for every /keywords/* resource, guaranteeing identical behaviour on all platforms.

Changes in this PR

Type	Module / File	Summary
🛠 Bug-fix	querydsl-sql/src/main/java/com/querydsl/sql/Keywords.java	Passes StandardCharsets.UTF_8 to InputStreamReader, replacing reliance on the default charset.
✅ Test	querydsl-sql/src/test/java/com/querydsl/sql/KeywordsEncodingTest.java	New regression test that loads a UTF-8 resource (encoding-test) and asserts the content is preserved.
📦 Resource	querydsl-sql/src/test/resources/keywords/encoding-test	Minimal UTF-8 test asset (SELECT + ÄÖÜ) used by the new unit test.

Compatibility

Non-breaking – internal implementation detail only; public API unchanged.
Applies uniformly to all dialects that depend on Keywords.

Tests & CI

New JUnit test verifies UTF-8 decoding.
All existing tests continue to pass locally.
Current CI hiccup around easy-jacoco-maven-plugin resolution is unrelated; if desired I can follow up with a version pin or mirror configuration.

Keywords.readLines previously relied on the JVM default charset, which could mis-parse the word list on non-UTF-8 systems (e.g. CP1252). Changes: • Pass StandardCharsets.UTF_8 to InputStreamReader • Add KeywordsEncodingTest to guard against regressions Cross-platform behaviour is now deterministic.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

fix(sql): enforce UTF-8 when loading keyword resources #1260

fix(sql): enforce UTF-8 when loading keyword resources #1260

Uh oh!

renechoi commented Jun 28, 2025

Uh oh!

Uh oh!

Uh oh!

fix(sql): enforce UTF-8 when loading keyword resources #1260

Are you sure you want to change the base?

fix(sql): enforce UTF-8 when loading keyword resources #1260

Uh oh!

Conversation

renechoi commented Jun 28, 2025

📝 Pull-Request Description

What & Why

Changes in this PR

Compatibility

Tests & CI

Related

Uh oh!

Uh oh!