Fix some issues with custom format function #277

smaye81 · 2025-04-30T15:24:12Z

This fixes a few issues noticed from failing conformance tests:

Quotes were being escaped in string values in lists
Durations had the word duration added to their formatting (along with escaped quotes)
Doubles were not formatting the same as CEL-Go. Namely, insignificant digits were not being removed (1.0 should format as 1, not 1.0)

In addition, this adds some additional unit tests for doubles to ensure the string formatting of doubles is adhering to the same behavior that CEL-Go uses.

smaye81 · 2025-04-30T15:24:45Z

build.gradle.kts

+ this.testLogging {
+ events("failed")
+ exceptionFormat = org.gradle.api.tasks.testing.logging.TestExceptionFormat.FULL
+ showExceptions = true
+ showCauses = true
+ showStackTraces = true
+ }


Added this bc it's much easier to see failing tests in the terminal than having to navigate out to a browser link to see the failing tests.

smaye81 · 2025-04-30T15:29:09Z

src/test/java/build/buf/protovalidate/FormatTest.java

+ }
+
+ @Test
+ void testDouble() {


@jchadwick-buf this is what I meant by some nuance in testing. We could move these double tests to the conformance tests I guess, but not sure about things like testing the above thrown exception. Wdyt?

Personally, I'm not overly concerned about handling all of the error cases identically as long as the error cases do fail as-expected, so I don't see this as a huge problem for adding tests to conformance per-se.

pkwarren · 2025-04-30T16:00:45Z

src/main/java/build/buf/protovalidate/Format.java

 final class Format {
 private static final char[] HEX_ARRAY = "0123456789ABCDEF".toCharArray();
 private static final char[] LOWER_HEX_ARRAY = "0123456789abcdef".toCharArray();
+ private static final DecimalFormat decimalFormat = new DecimalFormat("0.#########");


DecimalFormat instances aren't thread safe - we should use a ThreadLocal to cache these (if we expect they'll be used a lot), add synchronization around them, or create them on demand:

Decimal formats are generally not synchronized. It is recommended to create separate format instances for each thread. If multiple threads access a format concurrently, it must be synchronized externally.

https://docs.oracle.com/javase/8/docs/api/java/text/DecimalFormat.html

pkwarren · 2025-04-30T16:02:08Z

src/main/java/build/buf/protovalidate/Format.java

+ builder.append(val.value());
 } else if (type == TypeEnum.Bytes) {
- formatBytes(builder, val);
+ builder.append(new String((byte[]) val.value(), StandardCharsets.UTF_8));


Mainly a question on CEL in general but can we always assume bytes can be converted to a valid UTF-8 string? Is there some fallback where it would display invalid UTF-8 strings as hex or something like that?

Our formatting is pretty inconsistent right now and we need to add more comprehensive tests to make sure it behaves the same. Right now, CEL-Go's built-in formatting function will throw a runtime error with invalid UTF-8, but our Java implementation will print placeholders �� when using %s and will just print hex digits when using %x. It's on the docket to unify all this behavior soon.

pkwarren · 2025-04-30T16:03:24Z

src/main/java/build/buf/protovalidate/Format.java

 } else if (type == TypeEnum.Bytes) {
- formatBytes(builder, val);
+ builder.append(new String((byte[]) val.value(), StandardCharsets.UTF_8));
+ } else if (type == TypeEnum.Int || type == TypeEnum.Uint || type == TypeEnum.Double) {


Shouldn't we only need to format float/double as decimals? It feels like for int/uint we could just output the string value.

Steve Ayers added 8 commits April 29, 2025 17:47

Duration

e4f1916

Add tests for duration format

3cab97f

Format

ea5ff72

Merge branch 'main' into sayers/fix_duration

5558c85

Tests

4dbc4f3

Adds some additional formatting tests

26665e1

Adds some additional formatting tests

aefec2e

Format

7b75aab

smaye81 commented Apr 30, 2025

View reviewed changes

smaye81 requested review from a user and pkwarren April 30, 2025 15:25

smaye81 commented Apr 30, 2025

View reviewed changes

pkwarren reviewed Apr 30, 2025

View reviewed changes

Feedback

1453f41

smaye81 requested a review from pkwarren April 30, 2025 17:56

pkwarren approved these changes Apr 30, 2025

View reviewed changes

smaye81 merged commit 04f235f into main Apr 30, 2025
4 checks passed

smaye81 deleted the sayers/fix_duration branch April 30, 2025 21:12

smaye81 changed the title ~~Fix some formatting issues~~ Fix some issues with custom format function May 1, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix some issues with custom format function #277

Fix some issues with custom format function #277

Uh oh!

smaye81 commented Apr 30, 2025

smaye81 Apr 30, 2025

smaye81 Apr 30, 2025

ghost Apr 30, 2025

pkwarren Apr 30, 2025

pkwarren Apr 30, 2025

smaye81 Apr 30, 2025

pkwarren Apr 30, 2025

Uh oh!

Labels

3 participants

Fix some issues with custom format function #277

Fix some issues with custom format function #277

Uh oh!

Conversation

smaye81 commented Apr 30, 2025

smaye81 Apr 30, 2025

Choose a reason for hiding this comment

smaye81 Apr 30, 2025

Choose a reason for hiding this comment

ghost Apr 30, 2025

Choose a reason for hiding this comment

pkwarren Apr 30, 2025

Choose a reason for hiding this comment

pkwarren Apr 30, 2025

Choose a reason for hiding this comment

smaye81 Apr 30, 2025

Choose a reason for hiding this comment

pkwarren Apr 30, 2025

Choose a reason for hiding this comment

Uh oh!

Labels

3 participants