Detecting duplicate blog entries with Bloom filters

It seems that blog search engines have trouble detecting duplicate feeds (and entries). It might be worth investigating using Bloom filters to detect those posts. If I get time I'll try and do a demo.

Leave a Comment