Clarify: RegExp scf meaning #3594

Jack-Works · 2025-05-11T03:53:24Z

There are two different wordings when mentioning the Simple Case Folding:

If the file CaseFolding.txt of the Unicode Character Database provides a simple or common case folding mapping for ch, ...

scf:

the Simple Case Folding (scf(cp)) definitions in the file CaseFolding.txt of the Unicode Character Database

The Unicode file mentioned above writes:

# The status field is:
# C: common case folding, common mappings shared by both simple and full mappings.
# F: full case folding, mappings that cause strings to grow in length. Multiple characters are separated by spaces.
# S: simple case folding, mappings to single characters where different from F.
# T: special case for uppercase I and dotted uppercase I
#    - For non-Turkic languages, this mapping is normally not used.
#    - For Turkic languages (tr, az), this mapping can be used instead of the normal mapping for these characters.
#      Note that the Turkic mappings do not maintain canonical equivalence without additional processing.
#      See the discussions of case mapping in the Unicode Standard for more information.
#
# Usage:
#  A. To do a simple case folding, use the mappings with status C + S.
#  B. To do a full case folding, use the mappings with status C + F.
#
#    The mappings with status T can be used or omitted depending on the desired case-folding
#    behavior. (The default option is to exclude them.)

The wording in Canonicalize is clear, it is C + S. I'm not sure if scf also refers to C + S (by the Usage comment), or it's just S (by the status field).

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Clarify: RegExp scf meaning #3594

Clarify: RegExp scf meaning #3594

Jack-Works commented May 11, 2025 •

edited

Loading

Clarify: RegExp scf meaning #3594

Clarify: RegExp scf meaning #3594

Comments

Jack-Works commented May 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jack-Works commented May 11, 2025 •

edited

Loading