![]() |
Man vs App on Jeopardy tonight.
Tonight is the debut of "Watson" the Jeopardy playing app.
Quote:
|
Read this about it the other day, written by Bob Harris, 8-time Jeopardy winner and participant in Jeopardy Masters Tournament. I think it's a good summation of what the event "means", if anything.
|
So here's something I've been trying to figure out, but haven't seen anything that explains it. How does Watson understand what the categories mean. Especially the punny type categories. Does it have some definition of the kinds of puns and clue structures that Jeopardy has? Does it try to figure it out from the title? Or does it just ignore it and go purely from question content.
Judging from what we saw tonight, I'm guessing option C. Compare Watson's performance in the "APB" category to the decades category. It kicked butt in the APB category since, while the clue structure was weird, it was chock full of keywords and had very straight forward answers. Just plug in the keywords, voila, answer. Whereas the decade category, just plugging in the keywords that are in the clue would perhaps bring up a bunch of dates, but wouldn't lead directly to answering in the form of a decade (and indeed, on a few of them, the Watson display showed it wasn't even considering a decade as an answer). So, if that's the case, I'm more impressed that it's really trying to figure out the KINDS of answers that it should be looking for, rather than just being fed the category definitions. Nifty! And they never show enough detail on the server racks on these things. What kind of storage are they using? What about networking. I presume the servers are clustered, and using 1GbE, but is there a separate storage network. Fibre channel? iSCSI? Infiniband? (hahahaha.........okay, I'm the only one who thinks that's funny) |
Infiniband!!! Bwah hahahahaha !
Infiniband? |
Quote:
Assuming it's not a typo, Watson has 4TB of storage (the same as my home PC) and 16TB of memory. Quote:
|
WTF! The final jeopardy question was leaked, I'm thinking the fix is in.
Spoiler:
|
Sorry, your question must be phrased in the form of an answer.
|
This has been so interesting! Watson is killing them. I think the buzzer thing just wasn't calibrated well to allow them to get in.
Watching what Watson gets wrong is so illuminating. Seems the quiz writers put in some extra effort on the Final Jeopardy question and succeeded in flummoxing Watson. Even so, his interestingly particular betting ability kept him out of trouble. The alternate answers chart on the screen including confidence in answers makes this even more fun. It cracks me up to see the computer scientists in the audience applauding their baby. |
Well, unless they built in an uncertainty factor (and I haven't been paying attention much) Watson should essentially always win a buzzer battle. That particular piece of technology (responding immediately to a visual signal) is pretty old and should trigger at microsecond levels.
Out of curiosity, did they eliminate video clues from the game or is Watson up to those as well? |
There were no video clues in the first go around.
|
Quote:
Obviously that is not the case. Watson is clearly winning the battle of the button press time and time again when all contestants clearly know the question long before Alex finishes reading the answer. Still the Watson App does have to come up with the correct answer very quickly and is doing that extremely well. |
But all Watson has to do is react to an electrical signal telling him when the buzzers are unlocked, which if it has already decided to answer should be an instant reflex.
For the humans there is a light that goes on to tell when the buzzers are unlocked. That unlocking is done by an assistant producer to tries to time it to Trebek finishing the question. So if the player waits until the light goes on then they're at an obvious huge disadvantage in terms of reflex. But good players don't do that, they try to get a sense of the timing and guess when the light will come on so that they have started the press before it is actually possible to buzz in. If they time correctly it is theoretically possible they could slip into the window of whatever lag Watson has but that has to be an incredibly small opportunity. And the humans have a disadvantage that Watson doesn't, if the human guesses slightly wrong and buzzes in a microsecond early then they are locked out of buzzing in for a significant fraction of a second (that's why sometimes you see someone mashing the buzzer even when nobody else is trying). So I don't see how, on those questions where Watson has decided it will answer it can consistently be beat to buzzing in. |
Quote:
Still from another point of view, the question being asked by the Watson project is can a machine answer questions as well and as fast as the best humans. In that respect I think the answer so far is yes. |
Sure, but to the extent it is also a test of whether a machine can win at Jeopardy it has an unfair advantage due to the buzzer mechanism.
And we're not actually learning if Watson can answer more Jeopardy questions correctly than Ken Jennings. It could be that Jennings knew 58 answers to Watson's 52 but it doesn't matter if Watson gets to answer all of the 50 they have in common. But yes, it is certainly a great demonstration of the technology, I just quibble as to what exactly it is showing. |
Quote:
|
From the Jeopardy perspective, sure (though I don't expect I'd be that entertained, for the reasons mentioned, they already knew the computer could answer the questions or it wouldn't be on the show, once that's the case most suspense is gone). But from the computer science perspective I think they are important distinctions.
|
Quote:
Quote:
|
We TiVod the episodes (I was out of town for a couple of days) and we just watched the first episode last night.
I was surprised to learn that Watson gets the questions as a text file: I would have thought speech recognition has evolved enough to use. But then I started wondering - how was Watson getting the trigger to know when to push the button? But at least he has to push the same button as the players (albeit, electro-mechanically driven). And yes, I am changing my view just a little on the show - it is a little more educational than I expected. It can be amusing to see some of the responses that Watson has been coming up with. |
On the trigger. When the production assistant activates the buzzers after Trebek is done reading the question this turns on a light that the human players see, for Watson it sent a digital signal.
As mentioned that gives Watson an advantage since it is capable of essentially instanteous reflexes and can't (as human players risk when trying to time the activation) buzz in early causing a lockout. The other advantage is that by getting a text file Watson was probably often well on its way to an answer before the human players had even had a chance to begin comprehending what it was asking. Like I said, it is possible that the humans knew more answers than Watson, but that is almost irrelevant if on any question that Watson also knows he is almost guaranteed winning at the buzzer. |
So, as i see it, the two flaws in this game are:
|
Quote:
Quote:
|
What Moonie said, a computer can still act faster and (more importantly) more c consistently to a light than a human. There's not a whole lot that can be done about it. The BEST they could do is introduce an artificial random delay to mimic human hesitation - but it still doesn't leave Watson susceptible to the "early trigger lockout" hazard that humans are.
|
I saw an article that asked about speech recognition and the engineers said getting a adequate recognition capability is still a decade away.
As for responding to the light that would really be just as instantaneous as responding to a signal sent directly to him (since all the signals involved are moving at the speed of light). Oops, missed the next page with the two previous responses. |
Quote:
But let's assume the article is right and there really is still a big gap to traverse to get fast enough voice-to-text, how about visual processing? Just have Watson read the clue off the screen. My cheap-o scanner does darn good OCR, it couldn't be that difficult to get Watson to read the very legible Jeopardy board. |
True, I'm sure there was hyperbole in that number and probably none of the people on the Watson team are experts on the state of voice recognition. But I could see it being not so much of a problem with accurate transcription as the processing time of the transcription. 1 second of lag for visual voicemail isn't noticeable but one second of lag for Watson would have just flipped the buzz-in advantage since unlike the human players Watson wouldn't be able to read faster than Trebek talks.
OCR would be fine, but once in place, dealing with a completely standardized font it probably wouldn't be a significantly slower interface than just getting it as a text file. But that part really isn't a big deal, as Ken Jennings has said, for the best players they almost always know the answer (or have comprehended the question well enough to know they will know the answer and want to buzz in) before the buzzers are active so it all comes down to that. To eliminate the buzz-in advantage I think what I might have done (though I haven't thought this through very much) is have run response time tests with the two human players to see what their average buzz in times were after activation for questions they knew the answer to, along with standard deviation and then programmed a random delay into sending the signal to Watson that matched that statistical distribution. Then we'd have a true test of Jeopardy skills instead of the already known fact that a person who knows a lot of answers but always wins the buzz in will usually beat the person who knows all the answers but can't buzz in if anybody else does too. I could be Ken Jennings at Jeopardy if I always had first option to answer. |
So basically, the best players at Jeopardy are the ones who time the button pushing the best, not necessarily the people who know the answers the best. Yes, you have to have the knowledge, but a less knowledgeable person could theoretically edge out a more knowledgeable one based on button-push timing alone.
|
Quote:
|
Quote:
I figure that on the average board I know 65-80% of the answers, assuming that I could resist the temptation to buzz in when I didn't, first crack should give me at least the lead going into Final Jeopardy every every time (there's still the variable of where the ones I know are distributed and who gets the daily doubles). |
Quote:
|
Quote:
But I did say "there's still the variable of where the ones I know are distributed." |
All times are GMT -7. The time now is 08:20 PM. |
Powered by vBulletin® Version 3.6.4
Copyright ©2000 - 2025, Jelsoft Enterprises Ltd.