January 2009

Wikipedia in the news (again): reputation and status

Yesterday I noted this Guardian article, ‘Wikipedia editors may approve all changes’.

Now, as it turns out, that headline is highly inaccurate. It seems that the ‘Flagged Revisions’ system would in fact only apply to a certain subset of Wikipedia articles (biographies of living people) and (I think) to new/unregistered users. The article doesn’t tell you this, and in spite of the fact that this is an article about the internet, it contains no links to further external sources of information that might.

Today, the [sarcasm alert] well-known Web2.0 expert Marcel Berlins has waded into the debate. His piece does at least contain some links to Wikipedia pages, possibly because it’s a CIF (“C*ntishness is Free”) blog post/op-ed, rather than a news article. But still no links to the sources of the story.

Why is it too much to ask for a link to the actual web page where Jimmy Wales and the Wikipedians are discussing this right now*, so we can find out for ourselves what’s going on? If you want that, you don’t search the news sites; your best bet is to head for the blogosphere (eg).

This isn’t just a dig at the Graun; check out the story at other newspaper sites – how many do any better (except from within their blogs, eg here)? The Telegraph doesn’t here; nor does the Times here – and the latter contains the line “In his blog, Mr Wales said the “nonsense” of the false reports would have been “100% prevented by Flagged Revision” and said he wanted the changes to be implemented as soon as possible.” So you can have a direct quote from an online source, but still no link. On top of which, the source isn’t a blog, it’s Jimbo’s user talk page at Wikipedia.

Wikipedia articles, in contrast, generally contain extensive external links and references, not to mention having discussion pages and a full history of revisions just a click away. Wikipedia may be unreliable, but it’s transparently unreliable, and at least it attempts to document its sources (and its creation process).

Newspaper websites continue to get away with far more shoddy practices (I use the word deliberately: frankly, I think there’s no excuse for the absence of links to important sources in online versions of newspaper articles – and even in print versions too – and I’d also like to see wiki-style access to the full history of articles). OK, they’re in the business of news – the ‘rough draft of history’ – not reference.** The genre does make a difference.

But it’s also because of reputation and status, and assumptions about ‘amateurism’ vs ‘professionalism’.

Marcel Berlins (if I must):

The brutal fact is that a work of reference which depends mainly on volunteer amateurs, whose good faith, ability and expertise are unknown, and whose contributions are largely unchecked, cannot be other than unreliable.

Marcel perhaps needs to read some of the research comparing Wikipedia with works of reference by paid professionals. (And, as I noted very recently, the idea that ‘contributions are largely unchecked’ is phooey.) The issue is not ‘reliable’ vs ‘unreliable’ – it’s ‘how unreliable?’, and ‘is it good enough for a given purpose?’ (And of course, ‘how can you tell, if you’re not an expert in the subject?’)

Newspaper journalism, by default, is trusted to get things roughly right, to be good enough – regardless of how often we see mistakes in subjects we know something about, regardless of how many articles are just regurgitated press releases and uncritical plugs for somebody or something (cynics might say that newspapers can’t afford to have a general policy of linking to sources, because if they did, we’d be shocked by just how much of their content is of this type). Wikipedia, by default, isn’t trusted. And it’s still got a long way to go.

* I found this fascinating, but I also find Wikipedia’s ‘proposed deletions’ discussion logs fascinating. So YMMV.

** And no small part of the issue in this particular case is that Wikipedia itself is blurring the line between news and reference.


Darwin 200

Don’t forget: Darwin Day on 12 February. In preparation, you could do worse than to read Blogging the Origins. John Whitfield, a science journalist, is reading On the Origin of Species and blogging each chapter.


The futures of wikipedia and of digital humanities

In future, Wikipedia editors may edit all changes. First thoughts – Wikipedia continues on the path to conventionality and more editorial control. Which is probably a good thing, if it improves standards, but it used to be more fun. Second thoughts – read the comments. There are some beauties in there.

The future’s bright, the future is: Manifestoes! So here’s A Digital Humanities Manifesto, from the UCLA Mellon Seminar in Digital Humanities. Has its moments, but I’m having real trouble taking it seriously. (Also has some good comments. And CommentPress is nice.)


Recently noted around the web

What I’ve been reading online lately…

Writers’ email accounts of themselves
  What might Selected Editions of writers' email correspondence look like?

Google Hopes to Open a Trove of Little-Seen Books
  the latest from the NYT on Google Books (again)

5 Best Data Visualization Projects of the Year
  the medium is the message

Make-work
  brilliant post on what the New Deal projects of the 30s actually made


The hard work starts tomorrow

But today we dream.

And you can get the inauguration soundtrack here.


Thanks for the memories, Tony

Tony Hart has died.


(Warning: contains plasticine violence.)

RIP, Morph.


Irony. Or something.

I just discovered a post dated yesterday at the Society of Professional Journalists’ blog, entitled “History/Fact-Checking“. This consisted of a listing of online history-related resources, including this entry:

Old Bailey Online: History Site
From the famous British courthouse, this online database covers more than 22,000 trials dating between 1714 and 1759. Read the actual transcripts from historic proceedings, view the original documents, etc.

Only about, ooh, 5 years out of date there then. If I were a nice person I’d email them and quietly point it out, wouldn’t I? But I’m not, so I’m just going to point and laugh on my blog.


Inauguration Specials

OK, so you want something really special to commemorate the big day on the 20th? How about an inauguration thong?


Visualization and FlowingData

Data Visualization is one of the most exciting growth areas on the web. Who wasn’t captivated by Wordle and blown away by this graph of Box Office Receipts 1986-2008?

So one of my favourite new blog discoveries is FlowingData. Where I discovered that the Box Office visualization is powered by Stream Graph.

If you want to try out a wide range of visualization options without needing to install any software, Many Eyes is absolutely fab.

More data visualization tools.


Old Bailey Proceedings 2.0

Well, it’s been up for a few weeks now (I meant to post about it over Christmas. Strangely, that didn’t happen…) – we’ve launched a new community wiki for users of the Old Bailey Proceedings.

The wiki is intended as a supplement to the main site itself: both a resource for researchers, teachers and students and a community space for sharing information related to the history of the Old Bailey and the people who appeared there between the 17th and 20th centuries.

There are several main areas for contributions:

1. Biographical material about individuals (and families) who are documented in the Old Bailey Proceedings and Ordinary’s Accounts, drawing on source materials beyond the Proceedings themselves.

For example, perhaps you’ve researched your family history and found some of your ancestors in the trial reports at the Old Bailey Proceedings Online. The wiki may help you to find people who have uncovered different parts of the same puzzles as yourself, whilst also offering additional information to readers of the Proceedings that we could never provide. Here’s a nice example someone’s already posted, about a Thomas Dobyns who appeared as prosecutor in this trial.

2. Information supplementing the Historical Background sections of the main site. There are many unfamiliar things mentioned in the Proceedings – eg, forgotten places, objects, clothing, food and drink, London’s local histories and communities, the development of criminal justice and policing. We’d also welcome information about other primary sources of relevance to the Proceedings – newspapers, criminal biographies, archival sources, etc.

3. OBP-related teaching resources. We’ve already placed copies of the old Schools pages at the wiki for teachers to use, either to update those pages or simply to give some ideas for entirely new pages. We hope that the wiki can become a major resource for teachers and students at all levels of the education system.

4. A new version of the OBP Bibliography (to which users can also add items, although this is a slightly more complex procedure than the rest of the wiki).

5. Last but not least, you can let us know about errors in the OBP transcriptions and data.

Moreover, we plan to create extensive links between the information at the wiki and the main site so that contributions to the wiki, large or small, can enrich the experience of OBP visitors. I’ll try to keep you all updated on progress.

It’s going to be interesting!


Who writes Wikipedia?

Who the hell writes Wikipedia anyway? asks Henry Blodget. (H-T.)

Jimmy Wales has asserted that Wikipedia is overwhelmingly the work of a fairly small core Wikipedia community. But it seems that’s only true if you count numbers of edits. If you look at content, Aaron Swartz argued, it’s a different story.

When you put it all together, the story become clear: an outsider makes one edit to add a chunk of information, then insiders make several edits tweaking and reformatting it. In addition, insiders rack up thousands of edits doing things like changing the name of a category across the entire site — the kind of thing only insiders deeply care about. As a result, insiders account for the vast majority of the edits. But it’s the outsiders who provide nearly all of the content.

That is perhaps particularly striking in the light of the many complaints I’ve read from academics and other specialists who’ve contributed their knowledge to Wikipedia and then painfully seen their work trampled, chewed over (and sometimes spat out) by people with far less understanding of the subject in question – but far more understanding of how Wikipedia works.

If Swartz is right (and NB that it does appear to be based on a very small sample of pages), then this is a crucial dynamic, and I suspect not just for Wikipedia but for many wikis and similar Web2.0 sites. Which is of particular interest to me right now for reasons that I meant to post about before Christmas and, um, forgot. (Watch this space.)


Good Digital Things for a New Year

Back to work on Monday. Bah. Do not want. Apart from that, Happy New Year everyone!

Sarah Werner’s Wynken de Worde, a fine blog with an emphasis on early modern books and reading, has won the Cliopatria History Blogging Award for Best New Blog.

Mercurius Politicus has found some splendid manuscript, palaeography and book history resources. Manuscript sources are the next big growth area for digital early modernists, as digitizers work out that although it’s not that easy, it’s not that hard either.

But there are losses in digitization. Diapsalmata has been reading a special issue of Image and Narrative on Digital Archives, in which several articles engage with the relationship between digital humanities and the archive.

Whereas Ted Vallance has some thoughts on those quaint paper book thingummies in 2008. (His New Year’s resolution is to buy fewer rubbish books in 2009.)

Chris at the Virtual Stoa has a New Year’s Resolution I can really get behind. Eat more butter. Mmmmmmm.