﻿id	summary	reporter	owner	description	type	status	priority	milestone	component	version	severity	resolution	keywords	cc
13590	Inserting a 4-byte UTF-8 character truncates data	sardisson		"WordPress 2.9.2 on Apache 2.2.15, MySQL 5.1.46-log, PHP 5.2.13 

I was writing a post in which I used the ""tetragram for advance"" (U+1D319) in both the post title and in the body of the post (as a raw UTF-8 glyph rather than an entity).

When I had WordPress save a draft of the post, both the title and the post body were truncated at the point where U+1D319 had been (U+1D319 was also removed).  

(In addition, in the permalink field, WP generated something that was represented by the glyph for ""invalid codepoint"" [black diamond with ? inside] on Mac OS X, rather than successfully percent-encoding the glyph as WordPress does for other non-ASCII characters in post titles when generating permalinks.  And, although I manually percent-encoded the glyph for the URL, the permalink ended up being 404.  I suspect there's a whole host of places where unexpected glyphs cause problems?)

On the one hand, this is very much a dataloss issue (I lost 1/3 of my post), but on the other hand it's probably not likely to happen often in real-world usage, so I've left priority and severity set to default values ;)"	enhancement	closed	normal		Database	2.9.2	normal	maybelater	has-patch	aercolino
