lexicons: more string limits #1994

bnewbold · 2023-12-28T11:15:54Z

The most sensitive thing here is putting a (very long!) length limit on alt text. I believe this is long enough to not cause accessibility issues, but can update if needed.

The motivation is that an issue with very long embed image descriptions came up that has been causing records to be discarded by golang code. We'll separately work around that by bumping limits in the CBOR library (cborgen), but I think communicating some reasonable limit on these fields is helpful. This isn't intended to be antagonistic; the current problematic records seem to have huge values by mistake not intentionally.

While I was at it I did an informal review of all the "string" fields in records (I might have missed a couple, this was an informal scan).

These are technically violations of Lexicon stability, so need a bunch of review and buy-in on these. I'm not strident about these changes, but I think they are worth re-visiting, and this is probably our last chance to push them through.

I think the URL fields being format=uri would be uncontroversial
The embed description limits will invalidate a small handful of real-world records
I don't know how many real-world records the alt-text limit might cause. I expect very few but we might want to verify before merging that.

The specific values are somewhat arbitrary. I chose "large" ones to minimize the change in behavior (given that there are no limits right now).

Haven't run codegen, would do that just before merging.

cc: @whyrusleeping w/r/t embed description length

dholms

These all make sense to me 👍

I can't imagine any of these limits being hit in normal use

Let's get @pfrazee's opinion before merging tho

pfrazee · 2023-12-28T18:57:01Z

Runs the risk of invalidating some existing records but I dont know how many without, you know, scanning the whole network. I'd lean towards yoloing it.

pfrazee · 2023-12-28T18:57:11Z

The limits look fine to me btw

The motivation is to have *some* size limit on every string in post records, to maximize interoperation. For example, we currently have a CBOR library rejecting some records because of too-long strings. We don't want to limit the ability of folks to be very descriptive in alt text, specifically, so chose what seems to be a very large limit. If this is not large enough, based on feedback, we can bump it even higher. As context this is the largest string length limit in all of our lexicons.

This mostly results in checks against the string being empty, or unlimited size.

bnewbold · 2024-01-02T15:20:18Z

Updated PR with codegen and changeset. My current disposition is to merge, and some small dev comms.

This reverts commit ad0d976.

…andling * origin/main: Revert "lexicons: more string limits (#1994)"

bnewbold requested review from pfrazee, devinivy and dholms December 28, 2023 11:15

dholms approved these changes Dec 28, 2023

View reviewed changes

bnewbold added 5 commits January 2, 2024 16:11

limit external embed strings sizes

8d3ccaf

make thumbnail URL fields format=uri

fa067d3

This mostly results in checks against the string being empty, or unlimited size.

codegen: string limits

9967c3b

add changeset for string format lex updates

5a2f140

bnewbold force-pushed the bnewbold/lex-post-string-limits branch from eddda66 to 5a2f140 Compare January 2, 2024 15:19

bnewbold merged commit ad0d976 into main Jan 2, 2024
10 checks passed

bnewbold deleted the bnewbold/lex-post-string-limits branch January 2, 2024 23:24

github-actions bot mentioned this pull request Jan 2, 2024

Version packages #2011

Closed

devinivy added a commit that referenced this pull request Jan 3, 2024

Revert "lexicons: more string limits (#1994)"

5560b7a

This reverts commit ad0d976.

estrattonbailey added a commit that referenced this pull request Jan 5, 2024

Merge remote-tracking branch 'origin/main' into eric/resume-session-h…

40689f8

…andling * origin/main: Revert "lexicons: more string limits (#1994)"

bnewbold mentioned this pull request Jan 30, 2024

Lexicon nits (Jan 2024 edition) #2111

Open

7 tasks

bnewbold mentioned this pull request Mar 20, 2024

lexicon nits: use string format uri in more places #2348

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lexicons: more string limits #1994

lexicons: more string limits #1994

bnewbold commented Dec 28, 2023

dholms left a comment

pfrazee commented Dec 28, 2023

pfrazee commented Dec 28, 2023

bnewbold commented Jan 2, 2024

lexicons: more string limits #1994

lexicons: more string limits #1994

Conversation

bnewbold commented Dec 28, 2023

dholms left a comment

Choose a reason for hiding this comment

pfrazee commented Dec 28, 2023

pfrazee commented Dec 28, 2023

bnewbold commented Jan 2, 2024