Changes between Initial Version and Version 2 of Ticket #62172
- Timestamp:
- 10/05/2024 07:18:14 PM (2 months ago)
Legend:
- Unmodified
- Added
- Removed
- Modified
-
Ticket #62172 – Description
initial v2 18 18 - HTML is sent to the browser and a `<meta charset="">` tag is produced to instruct the browser how to interpret the bytes it receives. This may or may not match the HTML which WordPress is generating, as most block code and most filters are hard-coded PHP strings in UTF-8 or are at least isomorphic to it up to US-ASCII. 19 19 20 So as is the case with deprecating XHTML and HTML4 support, deprecating UTF-8is mostly about being honest with ourselves and making space officially to remove complex and risky parts of the codebase that often do more harm and help. There's a good chance today that WordPress is already extremely fragile when working with non-UTF-8 systems, and deprecating it would make it possible to fix those existing issues.20 So as is the case with deprecating XHTML and HTML4 support, deprecating non-UTF-8 support is mostly about being honest with ourselves and making space officially to remove complex and risky parts of the codebase that often do more harm and help. There's a good chance today that WordPress is already extremely fragile when working with non-UTF-8 systems, and deprecating it would make it possible to fix those existing issues. 21 21 22 Deprecating UTF-8means WordPress can stop attempting to support an N-to-M text-encoding architecture and replace it with an N-to-1 architecture, where strings that need to be converted are converted at the boundary of the system while everything inside the system is UTF-8, harmonizing all of the different levels of encoding and code.22 Deprecating non-UTF-8 support means WordPress can stop attempting to support an N-to-M text-encoding architecture and replace it with an N-to-1 architecture, where strings that need to be converted are converted at the boundary of the system while everything inside the system is UTF-8, harmonizing all of the different levels of encoding and code.