#20263 closed defect (bug) (fixed)
Backticks in dbDelta cause warning and actually causes a query to alter all columns and indexes to run even if none have changed
Reported by: | Driskell | Owned by: | ocean90 |
---|---|---|---|
Milestone: | 4.6 | Priority: | normal |
Severity: | normal | Version: | 1.5 |
Component: | Database | Keywords: | has-unit-tests has-patch |
Focuses: | Cc: |
Description (last modified by )
Backticks in a CREATE TABLE in a dbDelta call are causing the below warning, and running a query to alter all columns and indexes with backticks even if they haven't changed:
Notice: Undefined offset: 1 in /home/backuptechnology/public_html/wp-admin/includes/upgrade.php on line 1544
Seems this started a long long time ago after this was fixed: #8014
Seems there is a match on line 1587 of WordPress 3.3.1 wp-admin/includes/upgrade.php:
// For every field in the table foreach ($tablefields as $tablefield) { // If the table field exists in the field array... if (array_key_exists(strtolower($tablefield->Field), $cfields)) { // Get the field type from the query preg_match("|".$tablefield->Field." ([^ ]*( unsigned)?)|i", $cfields[strtolower($tablefield->Field)], $matches);
The preg_match bit is what fails to match because it is looking for "fieldname" instead of "fieldname
".
So this is where the warning occurs, and why it assumes the type of the field is wrong.
This means the previous patch 3 years ago probably meant when you have backticks in your CREATE TABLE, when you run dbDelta it actually runs an ALTER TABLE for every single column in the table, and then an ALTER TABLE ADD INDEX / PRIMARY for every single KEY and PRIMARY (there's more than just the above code that needs adjusting), causing duplicate key errors, but ignored. Guess it kinda works though...
Also seems during the DESCRIBE it doesn't use backticks so if someone has a table with a reserved name (stupid I know) it would break dbDelta completely, but that's me digressing I just think getting rid of the warnings for CREATE TABLES with backticks will be good.
But explains why my plugin takes a few seconds to activate hahaha. :-)
I'll provide a patch at some point this week or next unless someone has already done so.
Attachments (10)
Change History (41)
#2
@
13 years ago
- Cc kpayne@… added
- Keywords has-patch added
I can replicate the original problem. Here's the sample plugin I used to test this:
https://gist.github.com/2151282
Just activate the plugin and look for errors. Deactivate the plugin to reset.
#3
@
13 years ago
- Version changed from 3.4 to 1.5
Definitely not new, setting version to 1.5 when dbDelta came around.
Adding ` to the list of characters to trim() might be enough, yes?
#4
@
13 years ago
Adding `
to the trim list won't fix it because a backtick can occur in the middle of the string:
`user_ip` int(11) DEFAULT 0,
#5
@
12 years ago
What's so complicated? Took me 10 seconds:
preg_match("|`?" . preg_quote($tablefield->Field, '|') . "`? ([^ ]*( unsigned)?)|i", $cfields[strtolower($tablefield->Field)], $matches);
This also fixes problems with tablenames that contain "special" characters, in MySQL for example foo.bar.[b-a-z] is a perfectly valid tablename, besides using non-sanitized external input is always a very bad idea.
However, this Regex is pretty fragile anyways, it can't handle multiple spaces, it can't handle tabs, it can't handle spaces between the parentheses enclosing the column length, etc... someone should really rewrite that whole stuff, it might explode by a slight sneeze.
#7
@
10 years ago
It's been couple of years. Can this be finally fixed because my plugins spits a ton of errors on each update. @timfs proposed a reasonable patch.
#8
@
10 years ago
Little context: Here with a mentor (Mark Jaquith) on contributor day WordCamp London 2015 and just looking to get my first commit going, found this ticket :)
We did this assumption: When you supply a CREATE TABLE statement, dbDelta()
should handle what MySQL gives you when you export the CREATE TABLE statement
I tried to simplify this case by first removing the first appearance of the column name from the query, and then ltrim()
from the remainder any remaining spaces and an optional backtick (in case the column name was enclosed in backticks). The string should now start with the data type, maybe having " unsigned" behind it. The remaining preg_match()
is quite simple now.
This ticket was mentioned in Slack in #glotpress by ocean90. View the logs.
9 years ago
#12
@
9 years ago
- Keywords needs-unit-tests added
We noticed the same with GlotPress which has a colum "references", see https://github.com/GlotPress/GlotPress-WP/pull/343.
It seems to work with BackPress because BP_SQL_Schema_Parser::delta()
supports backticks, see https://backpress.trac.wordpress.org/browser/trunk/includes/class.bp-sql-schema-parser.php?marks=118,130,144#L117.
#13
@
9 years ago
Just joining the conversation here, hoping this gets resolved since it seems like an easy fix. Still an issue in WP 4.4 :(
#14
@
9 years ago
20263.2.diff adds a simple unit test but it probably needs a few more.
@davidmosterd: I noticed that the strtolower()
for $tablefield->Field
got removed in 20263.diff. Not sure if that was intended or not.
#15
@
9 years ago
- Description modified (diff)
- Keywords has-unit-tests added; needs-unit-tests removed
- Milestone changed from Future Release to 4.6
- Owner set to ocean90
- Severity changed from minor to normal
- Status changed from new to reviewing
#16
@
9 years ago
Rather than doing the ltrim()
magic, I'd be inclined to add the `
s to the regex. That's the same as how wpdb::get_table_from_query()
behaves, as well as the BackPress example that @ocean90 linked to.
#18
@
9 years ago
20263.3.diff is 20263.patch plus an unit test.
#19
@
9 years ago
- Keywords needs-patch added; has-patch early removed
Turns out the approach in 20263.patch doesn't work because it would break a query where backticks are required, like for reserved keywords. 20263.4.diff includes a test for that.
In 20263.4.diff I tried to escape every field in our custom tables which doesn't quite work because of the array_search()
call.
I'll take another look tomorrow and will probably copy parts of BP_SQL_Schema_Parser:: get_index_definition()
and https://backpress.trac.wordpress.org/browser/trunk/includes/class.bp-sql-schema-parser.php?marks=118-143#L117 to wpdb
.
This ticket was mentioned in Slack in #glotpress by ocean90. View the logs.
9 years ago
#21
@
9 years ago
- Keywords has-patch added; needs-patch removed
20263.5.diff normalizes the index definitions so they match with the result of SHOW INDEX FROM
in https://core.trac.wordpress.org/browser/trunk/src/wp-admin/includes/upgrade.php?rev=37538&marks=2274-2337#L2274.
The regular expressions are copied from BackPress, slightly changed to have named capture groups.
This patch fixes also #34959, #34871, #34873 and #34869. #34874 would be possible too.
#22
@
9 years ago
In 20263.7.diff:
- Updated the patch to apply cleanly after [37574].
- Added
SPATIAL
to the indexpreg_match()
and tidied it up a little. - Merged two
trim()
calls at the start offoreach ( $flds...
. (The sequential trims didn't operate quite the same as previous usage.)
I like the direction of this patch. My primary concern is that the new index matching chunk of code is fairly dense, it could do with some whitespace, and maybe even some comments (forbid the thought!). I don't relish the idea of trying to remember how this all works when our next dbDelta()
maintenance spree comes around.
Oh, and I forgot to fix it it my patch - after foreach ( $index_columns...
we need to unset( $index_column )
.
#23
@
9 years ago
20263.8.diff is 20263.7.diff plus whitespace and a few more comments. It also includes the missing unset()
part.
#24
follow-up:
↓ 25
@
9 years ago
In 20263.9.diff I changed the regex to match a column/index name from \w
to (?:[0-9a-zA-Z$_-]|[\xC2-\xDF][\x80-\xBF])+
which is also used by wpdb::get_table_from_query()
.
Technically, MySQL supports more characters. From the docs:
Permitted characters in unquoted identifiers:
- ASCII: [0-9,a-z,A-Z$_] (basic Latin letters, digits 0-9, dollar, underscore)
- Extended: U+0080 .. U+FFFF
Permitted characters in quoted identifiers include the full Unicode Basic Multilingual Plane (BMP), except U+0000:
- ASCII: U+0001 .. U+007F
- Extended: U+0080 .. U+FFFF
Should we support all characters?
#25
in reply to:
↑ 24
@
9 years ago
Replying to ocean90:
Should we support all characters?
Let's split that out into a new ticket, because it may turn into a rabbit hole, and we'll need to change WPDB to add the same support.
#27
@
9 years ago
@ocean90 Cool :)
So, in the end my solution was not used at all? I was not able to do more on this; wish I could. I did not get props I see, which I get if my code was not good enough, just want to make sure it's no mistake ;)
#28
@
9 years ago
@davidmosterd You got props in [37538] for the initial patch. :) The patch was good, thanks for that, but as mentioned in comment:16 adding the backtick to the regex directly is a way less magical.
Remove backticks when comparing fields and indexes