Frans wrote on Feb 5, 2021, 14:50:
- it gets stored in MySQL, which along with its tables and string columns are set to UTF-8 (utf8_unicode_ci)
Which version of MySQL is it? Older versions of MySQL, and all versions of MariaDB, required the use of their specially named "utf8mb4" character set instead for true utf-8, one that also supports 4-byte characters such as all emojis.
Frans wrote on Feb 5, 2021, 14:50:
Separately, we still have a problem with UTF-8 characters in single stories. In fact, the two Cyberpunk Exposé stories showing up in the popular threads box mid-January is how I first noticed it. And one of them is still erratic on the Share/Comments links:
Currently for me, your #2 link consistently serves the title of the story as
Cyberpunk 2077 Expos0xE9
, which is invalid utf-8, and as such, consistently shows broken. Both in the h2 posting title, and the top-level
title tag which then ends up as the tab's title.
However, the issue is only on this /s/ "Share" link, which I gather was statically generated one time.
If I click over to the comment viewer
board.pl version, it is always correct. It's served as a proper
Cyberpunk 2077 Expos0xC3A9
…
If I had to guess, not all statically generated pages got refreshed after you have fixed whatever was borked in the perl settings earlier, and some corrupted static pages remain.
Thank you for all the work you do for Blue!