Context Navigation

← Previous Ticket
Next Ticket →

#17267 closed defect (bug) (fixed)

twentyeleven_url_grabber() needs work

Reported by:	nacin	Owned by:
Milestone:	3.2	Priority:	normal
Severity:	normal	Version:	3.2
Component:	Bundled Theme	Keywords:	has-patch close
Focuses:		Cc:

Description

/**
 * Grab the first URL from a Link post
 */
function twentyeleven_url_grabber() {
	global $post, $posts;

	$first_url = '';

	ob_start();
	ob_end_clean();

	$output = preg_match_all('/<a.+href=[\'"]([^\'"]+)[\'"].*>/i', $post->post_content, $matches);

	$first_url = $matches [1] [0];

	if ( empty( $first_url ) )
		return false;

	return $first_url;
}

The regex needs some work:

a.+href would match area href. While that isn't much of a concern, it would also skip to the second link as it's greedy.
Likewise we should probably use a non-greedy (.+?) rather than ([\'"]+) for what we're capturing.
We probably want the s pattern modifier there, in case there's an \n somewhere such as after <a and before href.
No need for the .*> at the end, just grab the href and bail.

Something like this might work: '/<a\s[^>]+href=[\'"](.*?)[\'"]/is.

Also:

No need for the $posts global to be called upon.

Rather than preg_match_all, we should just use preg_match, since we want the first one. If ! preg_match, then we should return false. No need to assign the result to $output.

We should arguably use get_the_content() since this is being called in within a loop, rather than the raw, unfiltered data.

Anyone know what the output buffering was designed to do?

Attachments (2)

17267.diff (951 bytes) - added by duck_ 14 years ago.
17267-2.diff (868 bytes) - added by lancewillett 14 years ago.: Refreshed patch with better comment text