ELO

Catch-all for all development not having a specific forum.
Ksero
Posts: 87
Joined: Sun Feb 15, 2004 8:00 am
Location: Sweden

Post by Ksero »

FreeBeer wrote:QUOTE (FreeBeer @ Jul 13 2006, 10:42 PM) Die. Just die, okay? I am, of course, referring to the fact that the singular of "dice" is "die". /mrgreen.gif" style="vertical-align:middle" emoid=":D" border="0" alt="mrgreen.gif" />
I counter your argumentum ad verecundiam with argumentum ad populum! Quoth the wiktionary:
QUOTE Using the plural dice as a singular instead of die when the plural is dice is considered incorrect by most authorities, but has come, especially in gambling, into widespread use.[/quote]
/tongue.gif" style="vertical-align:middle" emoid=":P" border="0" alt="tongue.gif" />
"Better than Light Booster 1"
MrChaos
Posts: 8352
Joined: Tue Mar 21, 2006 8:00 am

Post by MrChaos »

Ksero wrote:QUOTE (Ksero @ Jul 13 2006, 02:43 PM) Consider another situation: If you want to see if a particular dice is weighted, you can roll the dice a hundred times and see how often each face comes up. So one way to test it would be to take two teams and let them play against each other 100 times. Then compare the win-percentages to what ELO predicted. But that's not feasible. Instead of throwing the same dice many times, we're throwing a new dice each time, since every pickup game is unique.
But what if we can group similar dice... I mean games... together? For example, we could check all the games where the estimated outcome was between 75-25 and 65-35. Then calculate how many games were won by the underdogs. That would be one way to measure the accuracy of ELO. If it deviates significantly from 30, then we should become suspicious.

/wub.gif" style="vertical-align:middle" emoid=":iluv:" border="0" alt="wub.gif" /> Ksero

The predicted outcomes aren't like dice (plural) at all since we use the evaluation criteria, ranking, to effect the individual rankings those also our predictive model. It's not a WTFMrCS© (What the $#@! is MrChaos Saying) honestly and it's at the heart of the matter.

The classic example given for die throwing is this one: Any number on a thrown die has a 1 in 6 chance of coming up, no matter how many times the die ha been thrown or the number has appeared previous.

Thrown enough times all numbers 1 to six will come up,1/6 of the time with a fair/balanced die. Any individual throw all bets are on so speak. Follow?

Pay even money that 1 will be rolled next throw even if 1 has been thrown ten times I'll take the bet everytime (die fair assumed)

Checking for a weighted dice requires using a mean, confidence interval, standard deviation approach. Since it is CERTAIN that you WILL throw 100 1s in a row if you throw a die (a fair balanced one) enough times.

There is NO reason it cannot happen on throw 1 to 100 then it can on throw 20000 to 20099.

More data more confidence but even 1,000,000 1s in a row doesn't CONCLUSIVELY prove anything . Im pretty darn confident but you don't KNOW.

Taking some rather good advice, STFU, and waiting for more discussion.

MrChaos <---- squaded up today
Ssssh
FreeBeer
Posts: 10902
Joined: Tue Dec 27, 2005 8:00 am
Location: New Brunswick, Canada

Post by FreeBeer »

Of course, I was just going for the joke. /mrgreen.gif" style="vertical-align:middle" emoid=":D" border="0" alt="mrgreen.gif" />

And just because it's popular, doesn't make it right (see: politics, voting patterns) /mrgreen.gif" style="vertical-align:middle" emoid=":D" border="0" alt="mrgreen.gif" />

Sorry for the thread drift. I shall now cease further off-topic ramblings.
[img]http://www.freeallegiance.org/forums/st ... erator.gif" alt="IPB Image">

chown -R us base
MrChaos
Posts: 8352
Joined: Tue Mar 21, 2006 8:00 am

Post by MrChaos »

FreeBeer wrote:QUOTE (FreeBeer @ Jul 13 2006, 07:39 PM) Of course, I was just going for the joke. /mrgreen.gif" style="vertical-align:middle" emoid=":D" border="0" alt="mrgreen.gif" />

And just because it's popular, doesn't make it right (see: politics, voting patterns) /mrgreen.gif" style="vertical-align:middle" emoid=":D" border="0" alt="mrgreen.gif" />

Sorry for the thread drift. I shall now cease further off-topic ramblings.
New Brunswick hmmmm don't you guys make sports equipment there? /wink.gif" style="vertical-align:middle" emoid=";)" border="0" alt="wink.gif" />

(HEY blame FREEBEER he replied) I created 20 imaginary players who's ELO was between 850 to 2500 using Excel's Data Analysis tool. Then randomly selected six players and gave them 100% time in game. The other 14 got time played as randomized variable. I then recompute dropping a single player based on least time in game.
'13 people in total on team 2 (only six at any one time) then 12 etc

Just viewing the results
(no tests ran) shows no surprising outcomes.

Im often full of IT FREEBEER both words and it. Feel free to put us back on course if Im causing things to go haywire.

MrChaos
Ssssh
Ksero
Posts: 87
Joined: Sun Feb 15, 2004 8:00 am
Location: Sweden

Post by Ksero »

MrChaos wrote:QUOTE (MrChaos @ Jul 14 2006, 01:09 AM) More data more confidence but even 1,000,000 1s in a row doesn't CONCLUSIVELY prove anything . Im pretty darn confident but you don't KNOW.
Yeah... but we can make a good estimation of whether the dice is fair / the ELO estimates are accurate. And then we can estimate how good our estimation is /wink.gif" style="vertical-align:middle" emoid=";)" border="0" alt="wink.gif" />

So we'll sort out all the games with two teams where the estimated win % is in a small interval around 70. Each game result can then be seen as an outcome of a stochastic variable with Binomial distribution. Now... why did I leave my statistics book at my apartment over the summer... :\
"Better than Light Booster 1"
MrChaos
Posts: 8352
Joined: Tue Mar 21, 2006 8:00 am

Post by MrChaos »

Ksero wrote:QUOTE (Ksero @ Jul 14 2006, 05:26 AM) Yeah... but we can make a good estimation of whether the dice is fair / the ELO estimates are accurate. And then we can estimate how good our estimation is /wink.gif" style="vertical-align:middle" emoid=";)" border="0" alt="wink.gif" />

So we'll sort out all the games with two teams where the estimated win % is in a small interval around 70. Each game result can then be seen as an outcome of a stochastic variable with Binomial distribution. Now... why did I leave my statistics book at my apartment over the summer... :\
Now THAT got the wheels turning thanks.

Well you'll have to include every result to get proper results. Cherry picking data gives false results.

Also binomial is a yes/no test with an assigned probability based on that assumption.

Coins=gravy. Elo check not so much.

The results are weighted based on the expected outcome by percent. Maybe we could do it that way. Since we got success/failures and we have a weight too. Hmmmm

MrChaos
Ssssh
Post Reply