I want to let the user type in tags:

windows linux "mac os x"

and then split them up by white space but also recognizing "mac os x" as a whole word.

Is this possible to combine the explode function with other functions for this?

There has to be a way.

share|improve this question

1  
Tell the user to use mac-os-x :) – AntonioCS Dec 19 '09 at 10:23
I would create a dictionary file of tags and use the levenshtein function to find the best match. – Kevin Dec 20 '09 at 4:46
feedback

5 Answers

up vote 2 down vote accepted

As long as there can't be quotes within quotes (eg. "foo\"bar" isn't allowed), you can do this with a regular expression. Otherwise you need a full parser.

This should do:

function split_words($input) {
  $matches = array();
  if (preg_match_all('/("([^"]+)")|(\w+)/', $input, $reg)) {
    for ($ii=0,$cc=count($reg[0]); $ii < $cc; ++$ii) {
      $matches[] = $reg[2][$ii] ? $reg[2][$ii] : $reg[3][$ii];
    }
  }
  return $matches;
}

Usage:

$input = 'windows linux "mac os x"';
var_dump(split_words($input));
share|improve this answer
feedback

When the user is entering the string "mac os x" you can automatically detect the white space and change to string to "mac-os-x" then you can still explode this way:

$os = "metasys solaris mac-os-x";
$strings = explode(' ', $os);

You can do this using the replace function.

share|improve this answer
This implies that the user is entering tags one at a time, in which case, it's possible to keep the tags separate from the beginning. Also making the user convert spaces to "-" isn't very usable. – Justin Johnson Dec 19 '09 at 12:25
Even if the user will be entering the tags one at a time you aren't going to be sending it to the server immediately. And by the way i talking about entering a tag and just hitting the enter button. It's user friendly and it it is equivalent to spaces in between. Users just have to hit ENTER in my case and SPACE BAR in the other case. – Colour Blend Dec 19 '09 at 14:59
feedback

Either have the user separate their tag values with commas as Elzo Valugi suggested, or improve on your UI so that users enter one tag at a time (similar to Google Wave or Wordpress's tagging UI). I suggest the later.

If you really want to stick with your proposed entry format (which I don't suggest), you could maintain a list of multi-word tags (those that aren't supposed to be split). Compare the combined tag string provided by the user against this list and make sure that you don't split those terms. If you're set on sticking to this method, I could go into the details more, but I don't think it's a good idea as the entry format itself is flawed.

share|improve this answer
The underscore _ could also be used in tags in place of spaces, then a simple str_replace done. – Zahymaka Dec 19 '09 at 10:22
That's not really a legitimate request to make of a user – Justin Johnson Dec 19 '09 at 12:21
feedback

I would ask the user to enter the tags commas separated and explode with comma delimiter:

$string = "windows, linux, mac os x";
$pieces = explode(',', $string);

This is they way most tag system work anyway.

otherwise you'll need to construct a parser because explode cannot cope with what you want. Regex is an overkill in my opinion.

share|improve this answer
feedback

You could do a regex. I'm not the best at writing them, but someone else here should be able to match the 'words' breaking them on spaces that aren't in quotes.

share|improve this answer
feedback

Your Answer

 
or
required, but never shown
discard

By posting your answer, you agree to the privacy policy and terms of service.

Not the answer you're looking for? Browse other questions tagged or ask your own question.