Archive for January, 2005

Tour Down Under 2005

We went and saw the final stage of the 2005 Tour Down Under. Some of my photos are available on Flickr.

Comments

Disambiguation in Folksonomies

Tim Bray's post on Technorati Tags highlighted
that it wasn't just me that though disambiguation
in folksonomies
is important.

I spent a while thinking about it, and I came up with a half-backed scheme where uses could qualify their tags with a
parent tag, and an elaborate justification for why this was okay to do.

Then I realised that the disambiguation data is already there in the form of additional tags on the same data item.

In Del.icio.us you can tag a page with more than one tag. If the user interface presented each one of these tags as
related concepts in a dynamically constructed hierarchy then it would make browsing simple.
Query based access could be done by simple Boolean joins.

In Tim Bray's example, he wants to disambiguate Petroleum+Geology->Drills and Military+Training->Drills by using different
classification schemes. My current job is in the digital library area, so I, too feel the pull of well defined classification
schemes.

However, I think a better (in the Web 2.0, loosely bound sense) way to disambiguate would be to
have a service that returns data for a query like:

(Petroleum+Geology AND Drills)

This query would return all records that have been tagged with both
Petroleum+Geology AND Drills tags.

But what about Drills records that should be Petroleum+Geology records but haven't been tagged as such?
Well now you get a choice. You can have highly specific data that you know is correct by ignoring those records, or else
you can offer lower quality data but still try to remove data you know to be irrelevant:

Drills NOT (Military+Training AND Drills)

Browsing would use the same kind of queries. For instance, look at the Del.icio.us
programming tag. Currently the first item is tagged as
python, strings and programming. Obviously these concepts must be linked in
some way, so they should be presented as such in the user interface:

-->programming
programming > python
programming > strings

Now, if you go to programming > python you will get a dynamic page constructed using the query:

programming AND python

This is useful because it will remove all pages about python snakes from view.

The user interface would now change again:

programming
-->programming > python
programming > python > strings

Going to the programming > python > strings page will get you the data from the query:

programming AND python AND strings

I think this idea would provide quite a useful advancement on the current tagging mechanisms used in Folksonomies.
Now I just need to build this thing..

UPDATED: As it happens Del.ici.us already has support for querying on intersection of tags using the tag1+tag2 syntax. del.icio.us/tag/programming+python is the python programming tag.

Comments

Technorati Tags

Technorati Tags are cool, but I wonder how they will disambiguate (is that a word) them?

For instance, the Java Tag shows pictures of coffee, has blogs about programming and is only saved from a some fairly heated political discussions by the fact there are few Indonesian bloggers.

There are (clustering) algorithms that will solve this for you, but Technorati doesn't appeare to be using them at the moment.

Comments

My new toy

RC Car

Radio-controlled, with proper steering and only A$35.

It's called a SuperSlicks Mini RC car. You can get faster motors, tires and different 1:43 scale bodies. I probably won't do that, but it is a great little toy.

Comments

The Java Co-Processor

Azul Systems is set to release a “Network attached processor” for Java applications.

The key to network attached processing is Azul virtual machine proxy technology. This patent-pending technology, initially targeted at Java and J2EE platform-based applications, transparently redirects application workload to the compute pool. No changes are required to applications, or the existing infrastructure configuration. The Azul technology works with J2EE platform products including BEA® WebLogic® and IBM® WebSphere® application servers. Compute pool appliances are simply connected to the network and Azul software is installed on the application hosts. Suddenly every application has access to a virtually unlimited set of compute resources.

Each compute pool consists of two or more redundant compute appliances—devices designed solely to run massive amounts of virtual machine-based workloads. Each appliance has up to 384 coherent processor cores and 256 gigabytes of memory packed in a purpose-built design that delivers the benefits of symmetric multiprocessing with tremendous economic benefits. The massive SMP capacity of these appliances enables applications to dynamically scale, responding to varying workload and spikes without the pain of having to reconfigure or provision application tier servers. The targeted design provides small unit size, high rack density, low environmental costs, and simple administration.

Azul Systems

According to The Register, Azul has a custom multicored processor, which contains 24 cores.

Comments