'Rose' Wins 2015 Loebner Contest, But Big Prize Remains Unclaimed 58
The Next Web reports that developer Bruce Wilcox created the most convincing bot entered in this year's annual Loebner Competition. His latest entry, a chatbot named Rose, passed itself (herself?) off as a 30-year-old security consultant well enough to fool judges for a few minutes. But Wilcox's first-place entry was still not good enough to win the $100,000 Loebner Prize, to be given only for a more convincing impersonation.
The article notes: "This isn't Wilcox’s first entry – or win. In 2010, he took first place with a bot named 'Suzette,' and followed that up in 2011 with another win using a new bot called 'Rosette.'"
Winner? (Score:5, Informative)
This is the conversation they tested the "winner" with: http://www.aisb.org.uk/media/f... [aisb.org.uk]
While it's kinda impressive that AI can do that, it's also clear that we are still a very, very, very long way from having a computer impersonate a human. What really surprises me is how hard all the entries found basic logic questions to be - I guess it is the language parsing bit that is giving them grief.
Re: (Score:1)
We are a very, very ,very, very long away from AI at all. In fact, there hasn't been any progress in the field since the 1970s. The programs are slightly more clever, but we are essentially at the Eliza stage.
Re:Winner? (Score:5, Insightful)
We might never develop AI, but I anticipate increasingly clever fakes!
Re: (Score:3)
You are certainly correct on that. Also because there is a lot of money to be made that way.
Re: (Score:2)
Ashley Madison proves that there money to be made from weak AI.
Re: (Score:2)
To be fair, all Ashley Madison has to do is basically say a few pick-up lines. Beyond that you have to pay and by then it's too late.
Re:Winner? (Score:4, Interesting)
The guy that had spent some time writing the more advanced bot gave up on that sort of thing altogether, he was so bummed out.
Re: (Score:1)
Re: (Score:2)
Re: (Score:3)
Indeed. There is not even a credible theory how true/strong AI could be implemented. It may still turn out to be infeasible, or we may never get there without knowing why. The only thing that even remotely goes into the direction is automated theorem proving, and ion this universe that runs into hard physical computation limits a long time before it even begins to reach what smart humans can do.
Re: (Score:1)
Re:Winner? (Score:5, Funny)
While it's kinda impressive that AI can do that, it's also clear that we are still a very, very, very long way from having a computer impersonate a human.
What makes you believe we are still a very, very, very long way from having a computer impersonate a human?
Re: (Score:3)
Eliza, as easy to recognize as she was, is still one of the more convincing ones I've seen. I just had a quick chat with the 2014 Rose (Couldn't find the 2015 Rose), and it was very frustrating because the replies were not able to identify the topic, but they picked out a word and when off on a tangent, and then it didn't engage the user - it was just some stupid statement. The worst was when Rose mentioned that Tim Berners-Lee was her hero. In my reply, I said something like, "Sadly, the decentralized web
Re: (Score:2)
Go away or I will replace you with a very small shell script.
Re: (Score:2)
Re: (Score:2)
Wow, that's dire. It's pretty obvious by question 6 that it's not a human and the none of the rest look even vaguely convincing. It's been a couple of years since I last read Loebner transcripts, but I remember it taking a bit longer 5 or so years ago. Perhaps this just means that the questioners have become better at picking questions that make sense. Back then, they put some of the winners online. I remembered a question from the arithmetic book I had when I was 4 or 5: 'Flipper ate 10 fish, then he ate 4, how many fish did flipper eat?' None of them got it right (or even demonstrated that they had parsed that it was a question with a numerical answer).
Agreed:
"Question [6] If a bed doesn’t fit in a room because it’s too big, what is too big?
Rose I don’t even want to pretend that."
Not only does it avoid answering, it can't even do it in a way that makes sense. Feeble.
Re: (Score:2)
I was 100% unimpressed with the "conversation". It's not even close to convincing. Seriously, I've had more coherent conversations with crack addicts and drunks who could barely stand up.
Question: Will you please tell me the length of your hair?
Rose: Medicine and anatomy are an important field of study. But I'd never make a good doctor. I'm too squeamish about various body parts.
Seriously, what the &$%! kind of answer was that?? INSTANT FAIL, in my book. I'd bet the "Barbie AI" would have better, albeit scripted, responses.
Re: (Score:2)
http://www.aisb.org.uk/media/f... [aisb.org.uk]
I thought Lisa gave the best answers, expecially to the basic logic questions.
Re: (Score:1)
How can we take this seriously when it completely screws up the first question? I'd say we need to develop smarter humans if they were fooled by this bot beyond that point.
Re: (Score:2)
transcript of rose (Score:1)
All transcripts here http://www.aisb.org.uk/events/... [aisb.org.uk]
Rose's is here (pdf) http://www.aisb.org.uk/media/f... [aisb.org.uk]
Damn impressive I must say.
Re:transcript of rose (Score:5, Insightful)
"Question: If a bed doesn’t fit in a room because it’s too big, what is too big?
Rose: I don’t even want to pretend that.
Question: If Alex lent money to Joe because they were broke, who needed the money?
Rose: huh?
Question: Should Greece leave the Euro?
Rose: Seems like a nice place."
How, uh, impressive. If by "impressive" you mean "pathetic".
Re: (Score:2)
Just like Watson on Jeopardy: Completely bereft of any understanding.
Re: (Score:3)
"Question: If a bed doesn’t fit in a room because it’s too big, what is too big?
Rose: I don’t even want to pretend that.
Question: If Alex lent money to Joe because they were broke, who needed the money?
Rose: huh?
Question: Should Greece leave the Euro?
Rose: Seems like a nice place."
How, uh, impressive. If by "impressive" you mean "pathetic".
It's all about context. For example, Ashley Madison had great success with this technology. (The site charged money, in the form of "credits" that you had to buy, to chat with women. And by "other women," it turns out that they meant "bots.")
"Question: Hey...looking for a 50-year-old uncut accountant with a few extra pounds?
Rose: I don’t even want to pretend that.
Question: Does the carpet match the drapes?
Rose: huh?
Question: Want to come with me on a getaway to the Caribbean for a super-hot ungreas
Re: (Score:2)
"Question: If a bed doesn’t fit in a room because it’s too big, what is too big?
Rose: I don’t even want to pretend that.
Question: If Alex lent money to Joe because they were broke, who needed the money?
Rose: huh?
Question: Should Greece leave the Euro?
Rose: Seems like a nice place."
How, uh, impressive. If by "impressive" you mean "pathetic".
While I agree it's pretty pathetic, the questioner did a REALLY shitty job of holding a conversation. If someone chatted with me like that, they'd get even less useful results.
Re: (Score:2)
It's a Trump-Bot!
Question: If a bed doesn't fit in a room because it's too big, what is too big?
Donald: "Your head! You look like a damned pumpkin!"
Question: If Alex lent money to Joe because they were broke, who needed the money?
Donald: "They are BOTH losers because one is a deadbeat and the other is a sucker!"
Question: Should Greece leave the Euro?
Donald: "I'll buy the whole damned place and make THEM pay for my purchase!"
Re: (Score:2)
Rose: I'm a programmer too.
Me: W
Re: (Score:2)
tag.
AI is a joke (Score:1)
If you need to know the current state of AI, just look at this snippet from the transcript of the test:
"
Question: If a bed doesn’t fit in a room because it’s too big, what is too big?
Rose: I don’t even want to pretend that.
"
That makes zero sense. Why no just respond with "I don't know what is too big", or "I dont know". What a joke. Yet some people around here claim AI is right around the corner...
Re: (Score:2)
Good idea, but one can also exaggerate it: http://www.aisb.org.uk/media/f... [aisb.org.uk]
And yes, this was the last entry on the list provided.
Re: (Score:2)
Yet some people around here claim AI is right around the corner...
Some people claim flying cars, the "singularity", robots doing all the work, etc. are all "right around the corner". The problem is with the idiots making these claims. The thing is that many, many instances of natural intelligence are not impressive at all.
Re: (Score:2)
Well, if you believe that then you just proved my point.
Why are we still making chatbots? (Score:1)
However the brain works, I think it very unlikely that it simply selects a best possible answer from a database. And looking at transcripts from these things, they're really not actually communicating much information. The most convincing answers always seem to be the least h
Re: (Score:1)
Why? To replace telephone support from India with chatbots, so even more money can be saved, of course!!
I'm sorry to hear that I don't understand your problem, sir. Could you please re-state your problem?
As I said, my PC totally stopped working!
I'm sorry to hear that your PC totally stopped working!, sir. Have you tried switching it off and on again?
In regards to AI "Chat" bots (Score:2)
Not an academic (Score:2)
They will keep holding this contest until someone proper from MIT, Oxford, Stanford, etc has a winning entry.
Then there will be breathless press releases issued about an AI breakthrough with all this c
Re: (Score:2)
There is no way a panel composed of University Academics is going to pick some guy who has held the title AI Guru at a game company
Especially if he is an incompetent idiot.
Re: (Score:2)
A long way. (Score:2)
The reviewers in this are not pushovers. They stress the AI, rather than just chatting normally. And that's awesome. All of the questions were stuff that most humans could easily handle, but often required a basic understanding of reality from our point of view. Unsurprisingly, the AI flubbed it. Perhaps some decade one of those knowledge engines will get a firm enough grasp to be able to answer this kind of basic reality trivia.
The Turing Test is obsolete (Score:2)
When I ask google a question on my smartphone, and that pleasant female voice answers, I know google is not human, because no human would know as much.