Thanks to the SQL gurus who responded so quickly to my “question”:https://crookedtimber.org/2005/04/22/sql-query-query/. Their help allowed me to get the data I wanted, namely, a table showing how often each of our authors has posted in each of our categories. A matrix like this allows for a “correspondence analysis”:http://www.statsoft.com/textbook/stcoran.html of the joint space of authors and topics, in the spirit of “Pierre Bourdieu”:http://www.amazon.com/exec/obidos/ASIN/0804717982/kieranhealysw-20/ref=nosim/.
I’ve updated the analysis from the original post, following some of my own advice to amalgamate the categories that different people were using to post a short joke or trivial item (like “Look like flies” or “Et Cetera” and so on). I grouped all of them into a “Trivia” category. “Books” and “Literature” were grouped together, as were “Internet”, “Intellectual Property” and “Information Technology”. Finally, I also grouped “British Politics” and “UK Politics” into a single category. Unfortunately I had to drop Jon Mandle from the analysis (sorry Jon!) because none of his posts has a category.
Correspondence analysis lets you represent two kinds of entity simultaneously in two dimensions, allowing you to see how the elements of each entity are related to one another, and to those in the other entity. The idea is to reduce high-dimensional spaces (many authors, many categories) to low-dimensionsal ones with minimal loss of information. The figure below (also available in “a larger size”:http://www.kieranhealy.org/files/misc/dudi-ct.png and in “PDF format”:http://www.kieranhealy.org/files/misc/dudi-ct.pdf) gives the results for the CT data.
The results are interesing. You can assess the similarity of authors and categories to one another by their closeness on the diagram — or, more specifically, by the size of the angle formed between any two entities and the origin. Entities further out from the origin are more influential in structuring the dimensions that the figure is constructed on.
The philosophers and political theorists, with the exception of Chris, are on roughly the same dimension. Tom and Micah in are quite similar in their concerns. (Not coincidentally, they’re two of the least-frequent posters to CT.) Brian and John Holbo also go together somewhat, covering the academia/philosophy angle. Harry is right on top of the “family life” category, which no-one has posted on (under that title) bar him. Chris has the most differentiated posting profile, being distinctively associated with several topics (history, political theory, UK-related stuff), but also influencing the positin of topics like Music, Architecture (more like Henry in this respect) and the World Economy (more like John Quiggin and Daniel). Unsurprisingly, John Q and Daniel are associated most strongly with a cluster of topics around economics, international finance, the environment and globalization.
Maria’s distinctive cluster is around IT issues and European Politics, which is unsurprising given her location and job, but also health care and religion. Henry is associated with Irish Politics, Science and Literature. Belle clusters on four categories — sexual politics books, humor and blogging (it’s underneath the others). Ted’s most strongly associated with U.S. Politics, as am I, but with Sociology added. Ted, Eszter and I have the dubious disctinction of being quite closely associated with trivia, too.
So rougly speaking we have four groups of people here: the philosophers (Harry, John H, Brian, Tom, Micah); the economists (Daniel, John Q); a fairly heterogenous Social Science/Blogging group (Henry, Belle, Eszter, Ted, Kieran, perhaps also Maria); and Chris in a class by himself — though you could also make a case for Maria in this respect.
{ 1 trackback }
{ 10 comments }
John Quiggin 04.22.05 at 9:36 pm
Cute! I really need to get back into quantitative stuff.
Kieran Healy 04.22.05 at 10:19 pm
I redid the analysis with slightly clumpier categories. A problem with the data is that it might be picking up, in part, the tendency of authors to choose more or less fine-grained categories for themselves: some might bother to do this, while others might not. E.g., a lot of the stuff I class as “Sociology” could have been more finely classified. So it may be that Chris’s status as a renaissance man may just reflect a higher propensity to file his posts accurately.
But then I would say that, being relatively homogenous.
a 04.23.05 at 2:04 am
This will be filed under the category “navel gazing”, I hope.
John Quiggin 04.23.05 at 2:48 am
Does the fact that the top-right quadrant is nearly empty have any significance, or could you change this by shifting the axes?
My pattern-recognition genes are kicking in to suggest that the group would be rounded out by an anti-Quiggin
rc 04.23.05 at 4:40 am
Can you post the contingency table?
lakelobos 04.23.05 at 5:52 am
A table will be more informative than the plot.
Ben Hyde 04.23.05 at 7:44 am
Kieran’s planning something and clearly his natural ally is Ted. As they consolidate power they will be wanting to eliminate, marginalize or possibly meerly deamonize Harry. In the comming elections we can expect a temporary increase in family-life postings from these two.
The political science version of this[1] toy can b e used to craft bills. Each bill cuts a line thru this space. Voters are predicted to vote yes or no depending which side of the line they appear. That, in turn, tells you who your natural allies are. In iterative rounds of voting parties form. Which leads to realizing that charts of this kind can explain why you get such strange bed fellows inside of a party. They also tell you who to drive out/in, and where to create the polarization or where to work your message to solidify your base.
It’s always fun to try and name/describe the X and Y axis on these things. Could we have the catagories enumerated in order with scores for the two axis?
[1] http://enthusiasm.cozy.org/archives/2003/01/rightleft-wing
Kieran Healy 04.23.05 at 9:36 am
_My pattern-recognition genes are kicking in to suggest that the group would be rounded out by an anti-Quiggin_
Right. Someone in the top right quadrant would be maximally different from you. Right now Brian and John H come closest to this role.
Matt Brubeck 04.23.05 at 8:20 pm
‘This will be filed under the category “navel gazingâ€, I hope.’
It’s filed under “blogging,” which should be close enough. ;)
John Emerson 04.25.05 at 2:40 pm
The chart os almost illegible on my browser. How are the four quarters defined, especially the Empty Quarter?
Comments on this entry are closed.