geek
The Hotness
Games|People|Company
Dominion - Stash Promo Card
Runewars
Dominion: Alchemy
Thunderstone
Dominion
Dungeon Lords
Alexander the Great
Agricola
Twilight Struggle
Murder at the Four Deuces
Stronghold
The Republic of Rome
Race for the Galaxy
Small World
Arkham Horror
Founding Fathers
Race for the Galaxy: The Brink of War
Through the Ages: A Story of Civilization
Vapor's Gambit
Battlestar Galactica
Chaos in the Old World
Le Havre
Last Night on Earth: The Zombie Game
Mystery Express
Puerto Rico
Endeavor
Power Grid
Vasco da Gama
Warhammer: Invasion
Descent: Journeys in the Dark
Space Hulk (3rd Edition)
Pandemic
Hansa Teutonica
Carson City
Campaign Manager 2008
War of the Ring
Livingstone
Cosmic Encounter
Stone Age
Le Havre: Le Grand Hameau
Tobago
Cosmic Encounter: Cosmic Incursion Expansion
Twilight Imperium 3rd Edition
Summoner Wars
War of the Ring Collector's Edition
Macao
Carcassonne
Dominion: Prosperity
Neuland
Steam
Stats: Dimensionalizing BGG
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Recommend
10
In some of the other stats threads, folks have occasionally wondered how many dimensions there are underlying the game ratings. What do I mean? Think of politics: we all understand that there's a left-right dimension in politics that has something to do with economics, but there are also other dimensions. For example, people's views on abortion don't necessarily track with their views on economics. So there are at least two dimensions in politics that vary fairly freely with regard to each other: economics and abortion.

The following analysis is like that but for games. I have taken all of the games with more than 1500 ratings (except Carcassonne: The Expansion because it's an...expansion) and downloaded the ratings of all of the players who have rated all twenty of them, and there happen to be exactly twenty such players. I then put this data into Hierarchical Clustering Explorer 3.0, a freeware tool designed originally for identifying dimensions in genetic data (which I once used to dimensionalize human rights in my dissertation).

The way this works is that the program first identifies the most highly associated pair of games, then hierarchically identifies the most highly associated game with the initial cluster of two, and so on. If any pairs of games that have not yet joined the initial cluster are as similar or more similar than the growing cluster, then another cluster starts building alongside the first.

Where to set the line for distinguishing the number of clusters/dimensions in the data is more art than science,* so I decided to distinguish dimensions (starting from the top of the list, in which all the games are clustered together) to the point at which one game would be on its own. This left five dimensions, with a minimum similarity of 0.66 (using the simplest approach to measuring simultaneous similarity: Pearson correlation).

See http://www.analytictech.com/networks/hiclus.htm for a fairly clear explanation of a more intuitive kind of hierarchical clustering, and http://www.cs.umd.edu/hcil/hce/ for the freeware Hierarchical Clustering Explorer software.

The question here is what the games in each cluster have in common that would distinguish them from the other four clusters. I look forward to your speculation.

The dimensions are (highest rated game pictured -- that is, highest rated by BGG, not necessarily by the sample):

* There are significance tests for dimensionalizing data, but there are probably too few gamers here for any of these clusters to be statistically significantly distinct.
Your Tags: Add tags
Popular Tags: [View All]
Posted Mon Apr 11, 2005 2:01 am
1. Board Game: Ticket to Ride [Average Rating:7.51 Overall Rank:55]
[Shop]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
The big dimension. What do they have in common that distinguishes them from the others?

Acquire, Carcassonne, Ticket to Ride, Citadels, Lord of the Rings, Bohnanza, Tikal, Lost Cities, Axis and Allies, Settlers of Catan Card Game

The next dimension to pop out of the analysis would be Axis and Allies and the Settlers of Catan card game on the one hand and the rest of this cluster on the other (at a similarity level of 0.67). Lost Cities would be the next to break away at 0.72.
Matt Crawford
United States
San Francisco
California
flag msg tools
Avatar
mbmbmbmbmb
Sort of the lightish group of BGG-type games, wouldn't you say? Tikal is pretty heavy, and A&A is garbage, but in general, if I had to choose a description for this group, I would say it's the family games set.
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Yeah, I think if Tikal and Settlers were reversed from their current positions, the distinction would be a lot like light and heavy.
2. Board Game: Puerto Rico [Average Rating:8.30 Overall Rank:2]
[Shop]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Settlers of Catan (note: not with the card game!), Princes of Florence, Euphrat & Tigris, Puerto Rico, El Grande

Is it an accident that these are close to the five most popular games on BGG? The trendy dimension? Or is there a deeper similarity at work?

This cluster breaks from the first at a similarity level of 0.65, just before Scrabble becomes a loner.

El Grande pops out of this cluster as a singleton at 0.71, and Euphrat & Tigris and Puerto Rico as a pair break from Settlers and Princes of Florence at 0.78.
A. P.


msg tools
Avatar
patron0405
Quote:
Is it an accident that these are close to the five most popular games on BGG?


Heck no...it's an artifact of your selection process: you "downloaded the ratings of all of the players who have rated all twenty of [the twenty most-often-rated games]." Since this site is designed for, caters to, and promotes itself towards a particular type of gamer -- let's call that a "heavy Euro-gamer," shall we? -- then not surprisingly those games which share the characteristics favored by that type of gamer all have high ratings.
There's no mystery here, nor in any of the recent heavily stats-driven geeklisting and such. Why is everyone acting like they've stumbled upon some new data? I'm really not trying to sound judgmental nor superior -- I just want to know, is there any other dimension to this stuff I'm missing due to my statistical illiteracy? Or are y'all just re-proving what I and others have been saying for years: that BGG is, as a product and as a community, heavily slanted in favor of a particular school of gaming?
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
I didn't mean to assert that I'd discovered anything radically new...BUT I still think it's interesting that the Eurogames actually don't all fall into one dimension. These five are in a separate group from Ticket to Ride, Carcassonne, etc. And that group (TtR, Carcassonne, etc.) also contains some non-Eurogames, unlike this group. So perhaps these are the Eurogames par excellence, and the first cluster has something else in common?
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
And besides, do you really want to read another ultimate/white elephant/uber/superduper/supercalifragilisticexpialidocious trade list? At least you got a chance to vent.

As for the game-clique-ishness of BGG, see my comment to #5. I don't deny it at all. Heck, I don't even like Puerto Rico!

To explain a bit further. The dimensions could've broken down in more obvious ways. If people who tended to rank tile placement games highly liked card-driven wargames less than those who tended to rank auction games highly, then that consistent difference in voting behavior would've showed up as two different dimensions. (Example from my work on human rights: I get a separate dimension for women's rights than for civil liberties generally. Is this surprising? Not really, to me anyway. But many people and international organizations assume that all human rights necessarily travel together and assume that trying to enhance property rights procedures will help women.)

So it could've been the case that all the Eurogames clustered together, but they didn't. The big five are in this cluster, but they're distinct on some basis in the voting behavior from those in the first cluster.

I suspect that if I restricted this analysis to Eurogames, then it would be easier to identify more dimensions, because with all these games included the obvious dimensions will consume most of the differentiating power of the model. Maybe next list. :)
3. Board Game: Magic: The Gathering [Average Rating:7.29 Overall Rank:116]
[Shop]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Chess, Magic the Gathering

Interesting, no? I've heard these two compared before, but as I haven't played Magic, I'm not the best person to explain what connects these two and distinguishes them from the others.

These two and Scrabble become distinct from the first two clusters at a similarity level of 0.61. These two would break into singleton clusters at 0.71.
4. Board Game: Scrabble [Average Rating:6.46 Overall Rank:691]
[Shop]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Scrabble

The loner.

This and the third cluster become distinct from the first two clusters at a similarity level of 0.64.
5. Board Game: Risk [Average Rating:5.61 Overall Rank:4779]
[Shop]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Risk, Monopoly

The not-so-fondly-remembered-childhood dimension?

Keep in mind that correlation does not measure whether games tend to have the same ratings, but the degree to which gamers' ratings can be used to predict other gamers' ratings -- either positively or negatively. If my ratings are the exact opposite of yours, then we're perfectly correlated. Also, if my distribution of scores resembles yours, proportionally speaking, but uses a more compressed range of ratings (say, 6-9 versus your 1-10), then we will also be highly correlated (and that's true whether it's 6-9 vs. 1-10 or 9-6 vs. 1-10).

Risk and Monopoly are the second most associated pair in the analysis, next to Settlers of Catan and Princes of Florence, and break away from all of the other games taken as a single cluster at a similarity level of 0.44.
Jorgen
United States
Ferndale
California
flag msg tools
Avatar
mbmbmbmbmb
Why are you lumping RISK with that piece of white trash from a trailer park? :what: ;)
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
Blame the statistics! :D

But seriously, I suspect that by selecting only the twenty gamers who have played and rated all of these games (well, I guess it's a sloppy assumption to say that they've played all of them ;) ), I'm hitting on people who have pretty similar views, broadly speaking, on games. These are the BGG'iest of BGG'ers, insofar as this place has a common personality (at least with respect to the ratings). But there are still these dimensional differences even within the games they like.
11 comments [Hide]
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
The software I used for this can handle mismatched data: data where not all the games are rated by all the members, in our case. It would be fun to dimensionalize the whole top 50 (of ratings or of number of ratings, or whatever) (more than that would make for an unwieldly presentation), but I can't download all of the ratings for all of the games in one sheet. Or at least I don't think I can.

I also note that my taste in pictures has only one-dimension: zoomed-in-ness. :)
Matt Crawford
United States
San Francisco
California
flag msg tools
Avatar
mbmbmbmbmb
Very interesting list! The couple of critical comments are probably with merit, but that's just room for improvement. So even with those caveats, this is an interesting idea for a list.

Basically I think it taps into my desire for the "ratings correlations" to come back. I still wish we could use the correlations with other users to predict what games I would like. This is sort of along the same lines.
Ray
United States
Carpentersville
Illinois
flag msg tools
Avatar
04050607080910
mbmbmbmbmb
My strawman would be that these 5 groupings represent the game categories with different ratings lifecycles. (A ratings lifecycle is how when you first play a game you may rate it an 8, then on your second playing it may rate a 9, and by your 20th playing it may rate a 7). I think these patterns come out with your data because each game may be at a different position in its rating lifecycle for each user. Here are your 5 groupings:

1) This is the High-Low rating lifecycle category. These are games with great wow factor, but that show themselves to be lighter (or at least more random) with repeat playings.

2) This is the High-High rating lifecycle category. These are games with great wow factor that even with repeat playings remain difficult to solve (are deep)

3) This is the Low-High rating lifecycle category. These are the games that while fun initially take on a whole new level when serious time is invested in learning them. They also are the games most likely to impress by watching more experienced players play them.

4) This is the diffuse category. In this case the rating for scrabble depends more on the unrelated skill of depth of vocabulary. Deep game theory is not the issue.

5) This is the Low-Low rating lifecycle category. These are the games that are not popular with the folks that rate games. Its possible that this is a member of category 1, but since everyone left the high ratings stage so long ago that doesn't show in the data.
L Z
Italy
Unspecified
Unspecified
flag msg tools
Avatar
Absolutely take no offense.
All geeklist, by me , have the right to exist here on BGG.
This said, am I the only one that finds Statistical geeklists incredibly boring? :snore:
David Yamanishi
United States
Mt. Vernon
Iowa
flag msg tools
patron05060709
I'm sure you're not, but at least you don't have to wade through as many of them to get to the good stuff as I have to wade through trade lists to get to the good stuff. ;)

And back to the boring: Ray, your comments seem pretty plausible. It's sometimes hard to remember when you're analyzing clusters that you're looking for consistently different voting patterns for games rather than consistently different voting patterns by gamers. If you were taking my research methods class right now, you'd be doing well. ;)

Aldie has again proven what a wonderful web genius he is by adding an option to the stats tool to permit downloading all of the people who've rated any one game in a set of games. Note that he had to put some value in for missing games, and he picked zero. So depending on what program you're using, you should replace all of the zeroes with the missing value marker for your program or tell the program to ignore zeroes, or any calculations will be wildly off the mark. I already have the set for the top 50 rated games (except expansions), and it has 8,866 people in it. I'll see what it has to say later today or tomorrow.
Front Page | Welcome | Contact | Privacy Policy | DMCA | Advertise | Support BGG | Feeds RSS
BoardGameGeek and the BoardGameGeek logo are trademarks of BoardGameGeek, LLC.