Opened 18 years ago
Closed 16 years ago
#3206 closed defect (bug) (duplicate)
strip initial exclamation and question marks in permalinks
Reported by: | pandem | Owned by: | |
---|---|---|---|
Milestone: | Priority: | normal | |
Severity: | minor | Version: | 2.0.4 |
Component: | General | Keywords: | formatting i18n |
Focuses: | Cc: |
Description
These two symbols ¡ and ¿ are very common in Spanish titles, but they are not stripped out for the post slug. A similar bug has been reported and patched for 2.0.5 (http://trac.wordpress.org/ticket/2735), so this one could probably be fixed for the same release.
Change History (14)
#5
@
17 years ago
- Milestone 2.5 deleted
- Resolution set to fixed
- Status changed from new to closed
Going by foolswisdom's comment this can be closed.
#7
@
17 years ago
- Milestone changed from 2.5 to 2.7
- Resolution fixed deleted
- Status changed from closed to reopened
Based on reading #2735, I think the ticket suggests that these should be stripped a together, and this issue isn't fixed.
#8
@
17 years ago
The problem is that the ¡ and ¿ characters should be stripped out, just like the ! and ? already are. These characters are used in Spanish to start a question or an exclamation.
#9
@
17 years ago
Should we change the behavior to white list allowed characters rather than playing whack-a-mole every time someone points out a character that's 'undesirable' in URLs?
#10
@
17 years ago
Wouldn't we have to whitelist most of UTF-8? That's a big list. :-)
There's also the problem that changing what is stripped will break existing slugs made using the old code.
#11
@
17 years ago
I hoped I'd get that response ;-) I don't think the whitelist needs to be that large. To work, whitelist would limit the slug to just (see RFC 3986 Appendix A)
ALPHA / DIGIT / "-" / "." / "_" / "~"
I'm not sure that'd be well received but if we could find a way to make it work.
I think we have three options.
- Do nothing. This seems inconsistent, we have done something for other characters.
- Whitelist the small set of characters that won't be percent encoded according to RFC 3986
- Whack-a-mole filtering of characters as and when people ask for it.
From 4636, %c2% is left in the permalink in place of ¿