Skip to content

Unicode Chapter 5 #7

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open
sffc opened this issue Apr 16, 2025 · 1 comment
Open

Unicode Chapter 5 #7

sffc opened this issue Apr 16, 2025 · 1 comment

Comments

@sffc
Copy link

sffc commented Apr 16, 2025

I didn't see this mentioned in your presentation, but you may be interested in perusing the section in the Unicode spec about how to efficiently sort UTF-16 in code point order.

https://www.unicode.org/versions/Unicode16.0.0/core-spec/chapter-5/#G21458

@mathiasbynens
Copy link
Member

Thanks @sffc!

Since JavaScript strings support lone surrogates as well (making them not quite UTF-16), I wanted to emphasize this note from the linked section:

Note that this mechanism works correctly only on well-formed UTF-16 text. A modified algorithm must be used to operate on 16-bit Unicode strings that could contain isolated surrogates.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants