Opened 8 years ago
Last modified 6 years ago
#39190 new defect (bug)
RSS feed not valid: U+001A or 0x1a
Reported by: | pavelevap | Owned by: | |
---|---|---|---|
Milestone: | Awaiting Review | Priority: | normal |
Severity: | normal | Version: | |
Component: | Formatting | Keywords: | |
Focuses: | Cc: |
Description
I am not sure how is it possible, but one of our users copy/paste text from another website into TinyMCE and content is now filled with strange non-visible Unicode characters U+001A
/ 0x1a
(001a
in JSON response).
It is not visible in TinyMCE or frontend, but it is saved in database and also RSS feed is considered as invalid (displayed without problem in browser, but characters are there).
RSS feed is processed by other custom scripts and we are receiving following messages (different validators):
PCDATA invalid Char value 26
Input is not proper UTF-8, indicate encoding! Bytes: 0x1A
I can replace those characters and add a check before saving content into database, but it should not be possible for users to break their feeds by copying content into TinyMCE?