by John Q on November 21, 2005
The availability of search engines like Google provides an easy way of checking on factual claims you may find questionable – just enter the relevant keywords into a search engine and see what comes up. If such a search produces nothing to support the claim, or evidence to refute or qualify it, then it’s time to start demanding evidence.
This started me thinking about a more general problem with search engines. Using search engine results in the way I suggest rests on the assumption that a given query will produce given results. The same is true if I want to say “Site X is the top result on engine Y for query Z”. But what happens if, as is already possible, search results are personalised, based on, say, previous search history and choice among search results. The same search, undertaken by someone else, might produce completely different results.
[click to continue…]
by Eszter Hargittai on November 16, 2005
A few months ago I posted an entry called Google World in which I talked about the amount of information Google and other companies such as Yahoo!, MSN and AOL are amassing about their users.
This week’s launch of Google Base is another step in the direction of building elaborate profiles of users. Moreover, it is an interesting move by the company to get users to fill up Google’s own Web property with lots of valuable material for free.
Google Base is a collection of content submitted by users hosted on Google’s site. Let’s say you have some recipes (I mention these as that part of my own Web site seems to be one of its most popular sections and Google Base already in this early stage has a section on that), instead of simply hosting the recipes on your own site and having Google (and other search engines) drive traffic to it, the recipe can now live on Google’s own Web property. Other types of content range from classifieds about housing and jobs to course syllabi. Some have suggested it is like a gigantic expanded version of the popular Craig’s List, which I mention in case that is a service with which you are familiar. Google Base will be a collection of information that users provide for free, but for which Google gets credit when people find it.
It is hard not to wonder how much more prominent Google Base content will be in Google’s search results compared to other content on the Web.
[click to continue…]
by Kieran Healy on October 26, 2005
Speaking of “website gadgets”:https://crookedtimber.org/2005/10/26/map-of-ct-readers/, yesterday I tried out “Library Thing”:http://www.librarything.com, a service that lets you catalog your books online. Think of it as “Flickr”:http://www.flickr.com for your books. About 70 percent of the books in my office are already in a “Delicious Library”:https://crookedtimber.org/2004/11/13/delicious-monster/ catalog, which Library Thing can import, so I uploaded the lot. Like Delicious Library, the most obviously useful feature of a catalog is as yet unavailable — namely, the ability to do a full-text search on the books you own. Something like Amazon’s Search Inside. Maybe in the future there will be a way for applications like this to talk to Search Inside or “Google Print”:http://print.google.com/.
In the meantime, Library Thing lets you explore an affiliation network. You’re tied to other users through ownership of the same books, and in your “profile”:http://www.librarything.com/profile.php?view=kjhealy you can see who overlap the most with. It turns out that the user I’m closest to none other than “Chris Brooke”:http://users.ox.ac.uk/%7Emagd1368/weblog/blogger.html, of the Virtual Stoa. “He”:http://www.librarything.com/profile.php?view=chrisbrooke and I share 38 titles. This may partly be a size effect, as Chris has more than three times as many books cataloged as I do. But it may also index up our relative closeness in “Blau Space”:http://icc.oxfordjournals.org/cgi/content/abstract/13/1/263. Further evidence of affinity in tastes comes from the fact that “Chris’s photo”:http://www.librarything.com/profile.php?view=chrisbrooke and “mine”:http://www.librarything.com/profile.php?view=kjhealy come from the “same source”:http://www.planearium2.de/flash/spstudio.html.
by Eszter Hargittai on October 21, 2005

It’s been too long since we’ve had some geeky goodness around here. But wait! You don’t have to be a geek to appreciate and benefit from the following so read on regardless of your geek quotient.
I’ve been a big fan of Firefox since last Fall and given its wonderful features (better security [update: see comments for suggestions as to why this may not be the case], all sorts of functionality) I try to do my best to encourage others to use it as well.
In that vein, I have put together a page with a list of my favorite extensions. Firefox extensions are little programs that add features to the browser. Some of my favorites include being able to search for a street address without having to retype the address or pull up a map first, tabbed browsing, better use of browser space, etc. I know some of these features are available in other programs as well, but it’s great to have it all come together so nicely in one program. Feel free to list additional favorites in the comments to this post.
I have also put together a detailed tutorial on how to install the program (on Windows) for those who do not feel comfortable downloading programs. Feel free to pass along these page to your parents, cousins, friends, etc.
This Webuse.Info site contains some additional information so to recap:
Enjoy!
UPDATE: Since the comments have gone in all sorts of directions, I have highlighted in green sections of posts that refer to additional extensions for those who want quick access to that info.
by Eszter Hargittai on September 26, 2005
Inspired by this post on Digg, I started running searches on Google to see what would yield a really high number of results. A search on “www” yields results “of about 9,160,000,000”. This is curious given that according to Google’s homepage, the engine is “Searching 8,168,684,336 web pages”. Perhaps they are extrapolating to sites that they are not searching. Or perhaps those “of about” figures are not very accurate. In general, those numbers are hard to verify since Google won’t display more than 1000 results to any query. The figures may be helpful in establishing relative popularity, although it’s unclear whether the system can be trusted to be reliable even to that extent.
by Henry Farrell on September 26, 2005
“David Kopel”:http://www.insidedenver.com/drmn/news_columnists/article/0,1299,DRMN_86_4105129,00.html argues, rightly, that there is something very nasty about the willingness of companies like Google and Yahoo! to knuckle under to authoritarian regimes such as China by banning words from search engines, snitching out democracy activists and so on. He’s also correct when he “claims”:http://www.volokh.com/archives/archive_2005_09_25-2005_10_01.shtml#1127700874 that “the greedy and immoral policies of these corporations directly endanger Americans.” However, his claim that “[p]erhaps only consumer and shareholder pressure can persuade the American companies to change their evil ways” seems to me to be quite mistaken. Consumer and shareholder pressure simply isn’t likely to have much of an impact, when measured against the power of the Chinese government to ban these companies from access to a quite enormous and important marketplace. Nor does it seem likely to me that many large shareholders are likely to raise a fuss in any event. More generally, when firms weigh the power of consumers to use exit and protest against the ability of powerful states to impose heavy sanctions, and completely block access to important markets, they are usually going to do what the state(s) want them to do. The only solution that would have some chance of biting would be if the US passed legislation requiring US-based firms not to cooperate with Chinese government authorities on pain of substantial penalties, and enforced this regulation vigorously, transforming it into a battle between powerful states with big markets.
We’re going to see more and more of these problems cropping up. People used to think that the Internet would empower firms and other private actors against the state, helping the spread of democracy, free markets and all that. What we’re seeing instead is that firms and private actors have an interest in keeping powerful states happy, regardless of the impact on global prosperity, freedom and so on. This has always been the case – but it’s being exacerbated by the Internet. I’ve just written a “paper”:http://papers.ssrn.com/sol3/papers.cfm?abstract_id=799269 which talks about what this means for international politics (although it doesn’t discuss the particulars of the Yahoo! case).
by Eszter Hargittai on September 15, 2005
A serious problem with content filters – whether add-on software or the “safe” search mode of systems – is that they often block legitimate content that should not be filtered out. These false positives can include important information that most would have a hard time defending as harmful. Paul Resnick and colleagues have done some interesting work on this regarding filtered health information.
Now comes to us a helpful little tool (found through ResearchBuzz) that lets you run searches to see what content is blocked in the safe-search modes of Google and Yahoo!. Type in a search term and see what sites would be excluded from the results when running the safe mode on the two engines.
Curiously, Google blocks the TheBreastCancerSite.com when you turn to safe mode for a search on “breast cancer” while Yahoo! doesn’t. (The Breast Cancer Site does not seem to have objectionable material, its noted mission is to raise funds for free mammograms.)
By the way, Google’s and Yahoo!’s results can be quite different regardless of what gets filtered. Dogpile has a nifty little tool that visualizes some of the differences. I discussed it here while guest-blogging over at Lifehacker a few weeks ago.
by Eszter Hargittai on September 7, 2005
This post is a follow-up to another from a few days ago.
First, the bullet-point version of this post:
- A one-question survey has very limited utility
- Most respondents have tweaked their default homepage
- Several types of default pages are popular with respondents
- We cannot generalize findings from one blog’s readership to another
- When trying to learn about people’s Web uses, it can be very helpful and interesting to ask them for details
Second, thanks to the 784 of you who took the survey! Read on for more.
[click to continue…]
by Chris Bertram on August 30, 2005
Wow! Just had my first “Google Talk”:http://www.google.com/talk/ conversation. I’m sure most of you are old hands at this voice-over-internet stuff but it was my first time. Set it up, invited some friends and then up pops a mate from Novosibirsk (equipped with headset) for a chat, as if he was just down the road. It works, it’s simple and easy to use. Fantastic.
by Eszter Hargittai on August 29, 2005
Anecdotally, I still often hear people say (like I did this weekend, or like I’ve read in CT comments) that it wouldn’t take that much for a new company to enter the search-engine market. But we are not in the late 1990s and it would take tremendous resources to enter this market.
The major players at this point are AOL, Ask Jeeves, Google, Microsoft and Yahoo!. (Note that in contrast to much anecdotal evidence in the press and among other commentators, Google does not have nearly the market share that many people suggest. I’ve discussed this on CT before.)
Among the above search engines, AOL, Google, MSN, and Yahoo! represent much more than just search engines. They are vast empires of Internet-related products that continue to innovate and introduce new services.
This does not mean that there is no room for innovation. In fact, we seem to be undergoing a second boom these days (somewhat reminiscent of the late 90s, but in a much more realistic manner). Numerous interesting and innovative services have sprung up in the last few years. However, you will notice that many of these are eventually acquired by one of the companies above. Examples: Google’s acquisition of Blogger and Yahoo!’s acquisition of Flickr.
And to be sure, we have even seen new entrants in niche markets of search, for example, the searching of recently added content. Here, Technorati and Feedster come to mind. While offering valuable services – an almost immediate inclusion of blog content in search results – these engines focus on a very small segment of Web content.
It would take tremendous amount of resources in this day and age to even come close to the computation and labor resources that drive the above-mentioned companies and allow them to index Web content at a more general level. It is unlikely that we will see independent new entrants in the near future. If we do, they will likely be acquired by one of the companies above.
by Eszter Hargittai on August 29, 2005
I am guest-blogging over at Lifehacker this week while regular editor Gina Trapani takes a breather. Lifehacker is part of Nick Denton‘s Gawker Media empire that has managed to make money out of blogging. (We’re not all in it for the $s as you can tell by the lack of ads on CT, but it’s nice to know that some people who don’t necessarily have other main sources of income are able to pull it off.) CT readers are probably most familiar with Gawker’s Wonkette, but there are about a dozen Gawker sites at this point addressing all sorts of topics.
Lifehacker focuses on ways to make your life more productive. Many of the posts feature downloads (e.g. Firefox, Flickr), shortcuts and pointers to helpful Web sites. There is a whole category of advice pieces as well ranging from how to deal with various situations at work to ideas for getting things done more effectively.
If you have any lifehacking tips, please send them along to me this week by writing to tips@lifehacker.com.
by Henry Farrell on August 24, 2005
Sean Carroll “reports”:http://cosmicvariance.com/2005/08/24/arxivorg-joins-the-blogosphere/ that the “arxiv”:http://arxiv.org/ pre-print series has started to integrate itself into the blogosphere; this strikes me as a Very Big Deal indeed for academic blogging. Non-physicists may not be familiar with arxiv (I know that I certainly wasn’t before I started getting interested in network topology) – it’s effectively replaced journal publication as the primary means for physicists to communicate with each other. Journal publication is still important – but as an imprimatur, a proof of quality, rather than a way to disseminate findings to a wider audience. arxiv has now introduced trackbacks – people visiting the abstract of a paper on arxiv can see what blogs have commented on the paper, and read what they have had to say. Furthermore, arxiv has “rss feeds”:http://arxiv.org/help/rss of recent papers, classified by subject matter, making it much easier to keep up with new publications in a subfield.
This seems to me to be the nucleus of something like the new approach to academic publishing that John Holbo has advocated, in which blogs and bloglike tools become an integrated part of academia, creating conversation around interesting recent papers, filtering the good ones from the not-so-good ones etc etc. I can see potential problems down the line (trackback spam, attempts to game the system etc) – but the promise that this holds for physicists (and for non-physicists when we get around to creating arxiv equivalents) seems to me to be nothing short of extraordinary.
Update: It appears as though “Jacques Distler”:http://golem.ph.utexas.edu/~distler/blog/archives/000638.html had a lot to do with this.
by Eszter Hargittai on August 24, 2005
I am back from a five-stop two-week trip and am finally catching up on CT and various things Web. I missed the discussion John started a few days ago about Google. Instead of adding to that thread, I’ll add a whole post. To think of Google as just a company focusing on search is outdated, in my opinion. Google is becoming much more than that. Since the beginning they have been an expert at using network analysis to their advantage. With the various services they are rolling out, they can use that ability not only in the realm of search, but in the realm of building profiles of their users.
The title of this post does not refer to a new Google program. Rather, it’s what I suspect the company is aiming at overall. That is, they are introducing (whether through internal development or buyouts) new services constantly, many of which suggest that they have their eyes on doing much more than providing search. Today, they launched Google Talk so now they are in the instant messaging market. For Google Talk, you need a Google Account, which is the same as your GMail account if you already have one. If you don’t, you may consider getting one since now they offer over 2.5 gigabytes of storage. Of course, you may never need that amount of space for email (although I learned a long time ago never to say never when it comes to storage space) in which case you may just want to use it as a backup for files.
One of the great features about GMail is that it checks for new email regularly (several times a minute) so as long as you stay logged on, you can get regular email updates. Of course, as long as you stay logged on, Google can track all of your online activities connected to its services, which include searches run on its search engine. Not only do they have information about all of your emails, they also know what searches you run and what results you choose.
Being able to scan your email (as they do for the purposes of displaying Google Ads) doesn’t only give them information about what topics you discuss, they also know with whom. They can develop very nice maps of people’s networks. Now that they have launched Google Talk they will also know which of your email contacts are strong enough that you also tend to contact them through chat (assuming you are using Google Talk for IMing). They will have more data on which to draw for a network map of your connections. And since the use of Google Talk requires a Google Account from both users, they can construct network maps of those people as well. So your network map is not just about your direct connections.
[click to continue…]
by John Q on August 20, 2005
Google is about to issue 14 159 265 more shares (the number chosen is derived from the decimal expansion of pi) aiming to raise about $4 billion at an average price of about $250 a share. Given that I argued that Google was overvalued at the initial offer price of around $80, it might be time to take another look, both at Google as an investment and at the issues raised by its position in the Internet. In this post, I’ll stick to the first issue.
[click to continue…]
by Eszter Hargittai on July 28, 2005
Yahoo! responds to Google Video with The Yammys aka the Yahoo! Video Search Awards launched yesterday. Get those creative juices flowing in the following categories: Road Trips, Office Humor, Bloopers, Pets and Misc (so just about anything else). It’s a nice publicity idea to get some attention for their video search service. That is, assuming you – or anyone you know who likes to send around links:) – get a kick out of watching a dog on a skateboard. [thanks]