Opened 9 years ago
Closed 9 years ago
#32125 closed defect (bug) (fixed)
Emoji script: multiple second execution time when using AngularJS to insert text
Reported by: | brad989 | Owned by: | azaozz |
---|---|---|---|
Milestone: | 4.3 | Priority: | normal |
Severity: | normal | Version: | 4.2 |
Component: | Formatting | Keywords: | |
Focuses: | Cc: |
Description
I'm building a theme using AngularJS. After updating to 4.2, I experienced a significant slowdown when viewing posts (handled by custom javascript, not php theming functions), due to the emoji script. The script execution time appears to scale with the length of the post - longer posts making the execution time several seconds, with single word posts only needing a few ms.
Chrome's JS Profiler points at the parseNode function in wp-emoji-release.min.js (I unminified to inspect)
That function was called over 2000 times for one of my articles. I think the way that angular manipulates the DOM may be triggering duplicate calls.
Attachments (4)
Change History (29)
#1
follow-up:
↓ 2
@
9 years ago
I, uh, accidentally uploaded a picture of my dog instead of the chrome profiler at first and I'm not sure how to delete it. It's my first time here. :)
#2
in reply to:
↑ 1
;
follow-up:
↓ 3
@
9 years ago
Replying to brad989:
I, uh, accidentally uploaded a picture of my dog instead of the chrome profiler at first and I'm not sure how to delete it. It's my first time here. :)
I've removed it for you :-)
#3
in reply to:
↑ 2
@
9 years ago
Replying to DrewAPicture:
Replying to brad989:
I, uh, accidentally uploaded a picture of my dog instead of the chrome profiler at first and I'm not sure how to delete it. It's my first time here. :)
I've removed it for you :-)
Haha, thank you!
#4
@
9 years ago
- Milestone changed from Awaiting Review to 4.3
Seems there are two ways this can be optimized:
- Get the new or modified nodes
textContent
and dotwemoji.test()
before runningparse()
. That is still a big regex but will be much faster in case there are no emoji chars. - Implement something like Underscore's
_.throttle()
that will execute as soon as called but delay the next execution by 100ms -- 200ms.
#6
follow-up:
↓ 7
@
9 years ago
How many DOM manipulations are you making? Is it there anywhere we can see it? If you're using JS for the theme it might be better to remove the script and parse user generated content as strings before using it.
#7
in reply to:
↑ 6
@
9 years ago
Replying to iseulde:
How many DOM manipulations are you making? Is it there anywhere we can see it? If you're using JS for the theme it might be better to remove the script and parse user generated content as strings before using it.
So, weirdly, I only experience this issue on posts where I've copy pasted text from the internet into the new post dialog. Loading a post where I've manually typed text into the new post dialog works as expected.
You can see this here: ((link deleted - i've removed the script demonstrating the bug from the site, but I'm not doing outrageous DOM manipulation))
Might also be worth noting that I'm loading the data using the WP API.
How would I go about removing the emoji script for the time being?
EDIT: Just going to use this for now. https://wordpress.org/plugins/disable-emojis/
#8
follow-up:
↓ 9
@
9 years ago
In 32125.patch: test if any added nodes contain text with emoji chars before parsing them. That optimizes/speeds up processing of added nodes quite a bit. The testing regex in Twemoji is quite large but still much better than parsing nodes needlessly.
@brad989: can you check if this fixes it in your case.
#9
in reply to:
↑ 8
@
9 years ago
Replying to azaozz:
In 32125.patch: test if any added nodes contain text with emoji chars before parsing them. That optimizes/speeds up processing of added nodes quite a bit. The testing regex in Twemoji is quite large but still much better than parsing nodes needlessly.
@brad989: can you check if this fixes it in your case.
The patch does indeed speed up the execution time for posts without special character codes quite a bit (from ~80ms to 4ms).
However, when the posts include special character codes that aren't emojis (’ for apostrophes, for instance), I'm still getting 1000ms+ execution times when angular tries to compile it.
#10
follow-up:
↓ 12
@
9 years ago
Related: #32197
More info: The problem I posted there only occurs when the "Enable full-height editor and distraction-free functionality." is enabled and you're in Text mode, not in Visual mode. Same profiler results.
#12
in reply to:
↑ 10
;
follow-up:
↓ 16
@
9 years ago
Replying to Otto42:
Related: #32197
Yes, suspecting this is the exact same problem with Chrome and probably WebKit. Note that the Twemoji regex is used for both testing and replacing the chars. It is a huge difference, according to the fiddle over 2800 times slower! @brad989, could you confirm the performance in Firefox and IE11.
This will eventually be fixed in the affected browsers, not sure how we can mitigate it until then. Perhaps we can try to make a shorter/faster regex to test for emoji chars, even if it is not as precise?
#13
@
9 years ago
FYI, I reported this issue to Chromium and created a jsfiddle to demonstrate the issue:
https://code.google.com/p/chromium/issues/detail?id=482998
https://jsfiddle.net/7vo8wteu/
Try that jsfiddle in Firefox and a few other browsers to see a marked difference. I'm convinced this has something to do with the v8 irregexp code, but no idea what. Maybe it has a different code path there for unicode optimization or something.
(for those who may have missed it in the other ticket)
#14
@
9 years ago
@azaozz: Results for that jsfiddle on my Windows 8.1 laptop:
Firefox 37.0.2: 2 ms and 65ms. Results very consistent. Almost no variation.
IE 11.0.18: 525ms and 482ms. The results vary quite a bit, but always hover around 500ms for both tests. No appreciable difference. The unicode case appears to be slightly faster on most runs.
Chrome 43.0.2357.45 beta-m (64-bit): 2ms and 2880ms. Almost no variation.
#15
@
9 years ago
Yep, pretty much the same results here. Chrome 43/Win7 1ms and 2840ms...
Going to try to make a shorter regex, perhaps using ranges? Maybe will get better results in Chrome then.
#16
in reply to:
↑ 12
@
9 years ago
Replying to azaozz:
@brad989, could you confirm the performance in Firefox and IE11.
Yep, I only experience the big delay on Chrome. Firefox, IE, and Safari are all much quicker.
#17
@
9 years ago
Came up with 32125.2.patch. Includes all lower emoji (apparently there are quite a few of them, see http://www.unicode.org/Public/emoji/1.0/emoji-data.txt), the "Supplementary characters" and the "Private Use Area # 2".
#18
@
9 years ago
Actually we can probably reduce the second regex to /[\uDC00-\uDFFF]/
(test for the second half of the surrogate pair) as we only need to check if there are emoji chars in the string, not match them.
#19
follow-up:
↓ 20
@
9 years ago
In 32125.3.patch: added couple of missing chars to the singles, reduced the pair regex to test only for the second half of the surrogate pairs.
Tested with http://www.unicode.org/Public/emoji/1.0/emoji-style.html. Detects 1251 emoji, skips "U+00A9 copyright sign" and "U+00AE registered sign" (do we want to replace these with images too?). Needs more testing :)
JS Profiler