General Discussion is the text corruption on the front page being investigated?

View : : :
16.
 
Re: is the text corruption on the front page being investigated?
Feb 10, 2021, 14:58
16.
Re: is the text corruption on the front page being investigated? Feb 10, 2021, 14:58
Feb 10, 2021, 14:58
 
fds wrote on Feb 10, 2021, 08:41:
Which version of MySQL is it? Older versions of MySQL, and all versions of MariaDB, required the use of their specially named "utf8mb4" character set instead for true utf-8, one that also supports 4-byte characters such as all emojis.
5.7, and yes I'm aware of the mb4 charset situation, but was hoping to avoid that. And the eacute is 2-bytes so I'd think the mb4 angle wouldn't explain the problem.

But the OS is due for a major LTS upgrade anyway, so I have some small hope that that might sort out the problem as yet, if it isn't a coding bug in our scripts.

fds wrote on Feb 10, 2021, 08:41:
Currently for me, your #2 link consistently serves the title of the story as Cyberpunk 2077 Expos0xE9, which is invalid utf-8, and as such, consistently shows broken. Both in the h2 posting title, and the top-level title tag which then ends up as the tab's title.
How did you determine the received byte(s), packet sniffing?

fds wrote on Feb 10, 2021, 08:41:
However, the issue is only on this /s/ "Share" link, which I gather was statically generated one time.
No, the only truly static page is the old HTML archive index (and 3 more that aren't relevant here). And as noted, the frontpage is a static .html file served via the switch script.

Share links have been shortened for SEO purposes, but are executed (via Apache rewrite) by the board.pl script, action viewstory. This shows replacement characters for me as well.

fds wrote on Feb 10, 2021, 08:41:
If I click over to the comment viewer board.pl version, it is always correct. It's served as a proper Cyberpunk 2077 Expos0xC3A9
For title element and story header yes, but most comment subjects show ?'s for me. Although those come out of a separate table/column, and were copied from the news story header when that was still an incorrect 1-byte Latin1 version of eacute, thus the ?'s are to be expected.

fds wrote on Feb 10, 2021, 08:41:
If I had to guess, not all statically generated pages got refreshed after you have fixed whatever was borked in the perl settings earlier, and some corrupted static pages remain.
No, that's not it then.

But thanks for helping to investigate.
-- Frans
Avatar 1258
Date
Subject
Author
7.
Feb 1, 14:50Feb 1 14:50
8.
Feb 2, 23:10Feb 2 23:10
9.
Feb 3, 14:43Feb 3 14:43
 16.
Feb 10, 14:58Feb 10 14:58
  Re: is the text corruption on the front page being investigated?
12.
Feb 6, 08:27Feb 6 08:27