Opened 10 years ago
Closed 10 years ago
#29573 closed defect (bug) (fixed)
Hebrew post slug is not properly cut on Edit Post screen
Reported by: | KingYes | Owned by: | |
---|---|---|---|
Milestone: | 4.1 | Priority: | normal |
Severity: | normal | Version: | 4.0 |
Component: | Permalinks | Keywords: | has-patch fixed-minor commit dev-feedback |
Focuses: | Cc: |
Attachments (8)
Change History (43)
#1
follow-up:
↓ 3
@
10 years ago
- Component changed from General to Permalinks
- Keywords has-patch added
- Milestone changed from Awaiting Review to 4.1
#2
follow-up:
↓ 4
@
10 years ago
- Summary changed from Summery permelink in Edit Page not work with Hebrew to Hebrew post slug is not properly cut on Edit Post screen
#3
in reply to:
↑ 1
@
10 years ago
Replying to SergeyBiryukov:
This patch should not go into version 4.0.1?
#4
in reply to:
↑ 2
;
follow-up:
↓ 11
@
10 years ago
Replying to SergeyBiryukov:
Why don't we have mb_strlen() fallback in compat.php? I think it makes things easier and you've already had one: ticket:21013:21013.2.patch
#6
@
10 years ago
It is not necessary to check existence of mb_strlen twice. I suggest expanding the patch to make the code easier to read with more if statements.
#7
follow-up:
↓ 9
@
10 years ago
- Keywords has-patch removed
- Version changed from trunk to 4.0
Sergey, this bug was introduced in [28948] and needs to be reverted in 4.0.1.
#8
@
10 years ago
@SergeyBiryukov, You can add seems_utf8()
but you need to add my if statements
too.
#9
in reply to:
↑ 7
;
follow-up:
↓ 10
@
10 years ago
- Milestone changed from 4.1 to 4.0.1
Replying to miqrogroove:
Sergey, this bug was introduced in [28948] and needs to be reverted in 4.0.1.
You're right.
Replying to bainternet:
And Its a major major bug which should be fixed in the next release.
Note that it's only a cosmetic bug with with the representation, the actual permalink is correct,
trac_29573.patch can still display an invalid character if mbstring is disabled. I think a correct fix would be not to cut UTF-8 slugs in that case, see 29573.patch. Working on a unit test.
#10
in reply to:
↑ 9
@
10 years ago
Replying to SergeyBiryukov:
You're right.
#11
in reply to:
↑ 4
@
10 years ago
Replying to tenpura:
Why don't we have mb_strlen() fallback in compat.php? I think it makes things easier and you've already had one: ticket:21013:21013.2.patch
Well, we only use it twice in core (the other instance is in POMO_Reader
), and it ended up not being required for #21013.
#13
@
10 years ago
- Keywords commit added
To summarize, there are two issues:
- If mbstring is disabled, we don't properly cut non-latin slugs with
strlen()
more than 30. This part is not a regression, see [6633], [6794], and [9986]. - Since [28948], the same applies to non-latin slugs with
mb_strlen()
of 30 or less andstrlen()
more than 30, if mbstring is enabled. This part is a regression.
29573.2.patch fixes both of these issues.
#14
@
10 years ago
- Keywords commit removed
I disagree. This patch would disable string truncation when mb_strlen is unavailable. Please add another unit test:
$post_name = 'This is a very long post name that is both UTF-8 and > 30 chars.';
If you could run that with the mb_strlen function disabled, you would find the string is not 'abridged'.
I think for 4.0.1 the best solution is to revert [28948].
#15
@
10 years ago
ASCII is a subset of UTF-8, so if the reason you used ! seems_utf8() is to skip multibyte UTF-8 characters, it should not work as intended.
#16
@
10 years ago
Two more takes:
29573.3.patch uses the same approach as we use in wp_trim_words() if mb_strlen()
is not available.
29573.4.patch introduces a compat version of mb_strlen()
to make things easier.
Replying to miqrogroove:
I think for 4.0.1 the best solution is to revert [28948].
I think so too.
#17
@
10 years ago
- Keywords commit added
I agree with reverting [28948] for 4.0.1. Marking that part for commit and will handle.
This ticket was mentioned in IRC in #wordpress-dev by SergeyBiryukov. View the logs.
10 years ago
This ticket was mentioned in IRC in #wordpress-dev by nacin2. View the logs.
10 years ago
#24
@
10 years ago
- Keywords commit added
Either 29573.3.patch or 29573.4.patch should be good to go. We just need to decide whether we want to add a compat version of mb_strlen()
, or just fix the issue at hand. I'm leaning towards 29573.4.patch.
#25
@
10 years ago
Maybe I'm missing something but: 29573.4.patch includes if ( ! in_array( $charset, array( 'utf8', 'utf-8', 'UTF8', 'UTF-8' ) ) )
which can be reduced to 'UTF-8' === $charset
because of _canonical_charset()
.
We should do the same for _mb_substr()
and wp_check_invalid_utf8()
.
#27
@
10 years ago
Shouldn't we be shortening that string from JS? Seems more appropriate and would work better. Perhaps try that in 4.2?
#28
@
10 years ago
- Keywords dev-feedback added
Seems like we need a decision and a new patch here. Any takers?
#29
@
10 years ago
IMHO best would be to just revert [28948] like in 4.0.1 for now and split and shorten that string from JS in 4.2.
If I remember right some years ago we had mb_strlen()
compat function but dropped it later (not sure if it made it to a release). Seemed nearly all hosts that host non English sites have the mb_* functions enabled. For 29573.3.patch, the u
modifier might not work on some hosts (PCRE is without UTF-8 support).
#30
@
10 years ago
Thinking more about this: why do we stick the ellipsis in the middle of the slug? Probably because at the time this was added, the CSS text-overflow: ellipsis;
wasn't working in all browsers.
However all currently used browsers support that now. The CSS ellipsis are used everywhere, including about 20 other places in the admin, and almost all users are familiar with them. Seems it is time to update the slug truncation too.
Fixes