Want to read Slashdot from your mobile device? Point it at m.slashdot.org and keep reading!


Forgot your password?

Submission + - Regex Golf, XKCD And Peter Norvig (i-programmer.info)

mikejuk writes: A recent xkcd cartoon has started some deep academic thinking. When AI expert Peter Novig gets involved you know the algorithms are going to fly. Code Golf is a reasonably well known sport of trying to code an algorithm in the shortest possible code. Regex Golf is similar, but in general the aim is to create a regular expression that accepts the strings in one list and rejects the strings in a second list. The xkcd cartoon in question http://xkcd.com/1313/ revealed that this is but the first step. Programmers like recursion and a regex is a string after all and a regex can process a string so a regex can process a regex and this means you can have meta-regex golf and meta-meta-regex golf.... Yes my friend, it's regexes all the way down!
The hover over text gives a regular expression that matches the last names of the elected US presidents, but not the losers. This started Peter Norvig, the well-known computer scientist, director of research at Google and wearer of brightly colored shirts, thinking about the problem. Is it possible to write a program that would create a regular expression to solve the xkcd problem? The result is an NP hard problem that needs AI like techniques to get an approximate answer.
To find out more read the complete description, including Python code, at Peter Norvig's blog post http://nbviewer.ipython.org/url/norvig.com/ipython/xkcd1313.ipynb which ends with the challenge:
"I hope you found this interesting, and perhaps you can find ways to improve my algorithm, or more interesting lists to apply it to. I found it was fun to play with, and I hope this page gives you an idea of how to address problems like this."

This discussion was created for logged-in users only, but now has been archived. No new comments can be posted.

Regex Golf, XKCD And Peter Norvig

Comments Filter:

Top Ten Things Overheard At The ANSI C Draft Committee Meetings: (8) I'm on the committee and I *still* don't know what the hell #pragma is for.