
Description
The image facility is cell-based, and can be used for far more than just "images". It can be a fallback for fonts (which could be handy if alt-text or similar is available), font size (e.g. VT100 double-width/double-height), images of course, custom emojis, and much more. Example - this (multiplexed multihead) screen shows at least 20 images comprising the main picture (each text row is a separate image), plus 94 images, one for each CJK glyph.
A "64 images" max might be a "64 CJK glyphs" or "64 emojis". (I know the spec calls these minimums, but for the purpose of this discussion we should assume them to be a maximum.)
I think the minimum system requirement should be:
- 80x24 = 1920 distinct images screen (minimum VT100 screen size)
- Up to 1920*{cell width in pixels}*{cell height in pixels} total RGBA8888 pixels in storage, enough to fully fill an 80x24 region of screen at 32 bit depth.
This also supports the design philosophy that text cell operations work on images the same way, on a per-cell basis, rather than other protocols' intention for images and text to be fully distinct entities.