#57898 closed defect (bug) (fixed)
Google Doc saved as .docx file not allowed in media library
Reported by: | winterstreet | Owned by: | audrasjb |
---|---|---|---|
Milestone: | 6.4 | Priority: | normal |
Severity: | normal | Version: | 6.1.1 |
Component: | Upload | Keywords: | has-patch has-screenshots changes-requested commit |
Focuses: | Cc: |
Description
On a site with no plugins, and a default theme, I get denied uploading a .docx file created from a Google Doc. If I upload a .docx file downloaded from Microsoft it works fine.
Attachments (6)
Change History (26)
#3
@
18 months ago
I have tested this in both WP 6.1.1 and WP 5.7.2 - I was able to reproduce the issue
Env
- WordPress 5.7.2
- Chrome Version 110.0.5481.177 (Official Build) (arm64)
- MacOS Monterey
- Theme: Twenty Twenty Three
Steps to test
- Add one .docx file downloaded from Google
This ticket was mentioned in Slack in #core-test by juhise. View the logs.
18 months ago
#5
follow-up:
↓ 10
@
18 months ago
<?php File: wp-includes/functions.php 3116: 3117: // Validate files that didn't get validated during previous checks. 3118: if ( $type && ! $real_mime && extension_loaded( 'fileinfo' ) ) { 3119: $finfo = finfo_open( FILEINFO_MIME_TYPE ); 3120: $real_mime = finfo_file( $finfo, $file ); 3121: finfo_close( $finfo ); 3122:
The main issue is that finfo_file()
is returning "application/vnd.openxmlformats-officedocument.wordprocessingml.documentapplication/vnd.openxmlformats-officedocument.wordprocessingml.document" mime_type for google docs. As we can see, it is just a redudant of mime type "application/vnd.openxmlformats-officedocument.wordprocessingml.document" because of that the validation is failing.
This ticket was mentioned in PR #4228 on WordPress/wordpress-develop by @mi5t4n.
18 months ago
#6
- Keywords has-patch added
Trac ticket: https://core.trac.wordpress.org/ticket/57898
#8
@
18 months ago
Hi! I am trying to deploy the patch, but I was stuck in errors.
Looks like the patch was not merged. Please check https://github.com/WordPress/wordpress-develop/pull/4228
#9
@
17 months ago
- Keywords has-screenshots added; needs-testing removed
- Owner set to audrasjb
- Status changed from new to accepted
I reproduced the issue as well. I tested the proposed patch and it appears it fixes the issue.
By the way, I'm curious about how did we get a redundant file info at the first place… 🤨
#10
in reply to:
↑ 5
@
17 months ago
- Keywords 2nd-opinion added
Replying to mi5t4n:
The main issue is that
finfo_file()
is returning
application/vnd.openxmlformats-officedocument.wordprocessingml.documentapplication/vnd.openxmlformats-officedocument.wordprocessingml.document
mime_type for google docs. As we can see, it is just a redudant of mime type
application/vnd.openxmlformats-officedocument.wordprocessingml.document
because of that the validation is failing.
Sounds like a bug in finfo_file()
?
Frankly I'm not sure about the patch in https://github.com/WordPress/wordpress-develop/pull/4228 as this is somewhat security related. If this is a bug in finfo_file() that seems to happen only for *.docx
files saved by Google Docs, lets add an exception for this particular case only. I.e. match the whole wrong mime type string and replace it with the right one.
#11
@
17 months ago
@audrasjb The finfo_file()
returns redundant mime type for google docs.
@azaozz You're right, it's an issue with the finfo_file()
function. It will be better to handle that specific edge case only. I have pushed new changes.
#12
@
15 months ago
Checked into this a bit, and wanted to confirm that it does seem to be an upstream bug:
https://bugs.php.net/bug.php?id=77784
The ticket indicates it may affect Excel / xlsx
files as well, returning application/vnd.openxmlformats-officedocument.spreadsheetml.sheetapplication/vnd.openxmlformats-officedocument.spreadsheetml.sheet
.
A commenter in the above ticket notes that they believe it's an issue in libmagic
. I had difficulty searching for references in the tracker for libmagic
for this issue, but if anyone is able to do so, please feel free to reference here.
If there isn't yet an open (or resolved) issue there, it'd likely be a good idea to open one so that it can be fixed upstream.
In the meantime, I'm not opposed to fixing the exceptions here like in the proposed patch.
#13
@
15 months ago
- Milestone changed from 6.3 to 6.4
I uploaded several files from my Google Disk - .docx and .xlsx and some of them have correct mime types and were uploaded fine and some of them have this duplicated string. I opened .xlsx file in Excel and resaved it, and it got the correct mime type. I wonder if we need to make a workaround for errors which were made by some other tool instead of addressing the source of this thing — Google Docs or any function/library they are using while saving a file as .docx or .xlsx. All .pptx filed I tried were uploaded successfully, but it doesn't mean that this can be the case also.
Solution is questionable and patch is not ready. And if there will be a patch, I believe that it will need Unit test as well. So, I am moving this ticket for 6.4 for further consideration.
#14
@
13 months ago
- Keywords changes-requested added
@azaozz and @mikeschroder we need a decision, what we are doing. I would have preferred to fix this in the source of the problem, but it looks like there things are not going anywhere, and due to inconvenience for WordPress users, I am suggesting to continue with the patch, but make a solution which will work for all doubled mime types which can accrue and make a proper description for this work around to make clear why we need such strange thing in the first place.
I didn't manage to get such doubled mime time with pptx, but I think, it needs to be checked for such possibility as well.
#15
@
13 months ago
@oglekler Since, it is an issue with docx, xlsx and some of the odt files as well. Should I have revert the PR which contains a solution which catches all the duplicated mime issues?
#17
@
13 months ago
- Keywords commit added; 2nd-opinion removed
The patch (PR) looks good imho. Fixes exactly this problem without affecting anything else.
@audrasjb commented on PR #4228:
13 months ago
#19
committed in https://core.trac.wordpress.org/changeset/56497
Hi,
I also reproduced the issue. the file extension is the same but file created by google can't upload.