third myth in the article compares B-trees to hash tables: "Trees are better In ...

alayne · on Oct 2, 2012

It looks like the paper doesn't refer to B-trees. It refers to binary search trees.

brey · on Oct 2, 2012

and so the article confusing B-trees with binary search trees doesn't exactly increase confidence here ...

kroidkrensen · on Oct 2, 2012

There are two articles that are linked to in the text, one that describes comparisons of binary tress, self-adjusting tress, and hash tables. There's a second article that compares B-trees, hash tables, and a few other structures -- that's this one: http://ww2.cs.mu.oz.au/~jz/fulltext/acmtois02.pdf

tptacek · on Oct 2, 2012

What "confidence" do you need for a straightforward comparison of hash tables and binary trees? These aren't Apple rumors; this is CS 101.

brey · on Oct 2, 2012

my point is merely that as written, the article is confusing.

to paraphrase: "hashtables are faster. here's some research showing b-trees are slower than hashtables. why would you ever use a tree? prefix matching, that's why."

which misses the point that the primary reason for using a B-tree as a general purpose database index is not about prefixes, it's about block devices (prefixes is an important reason, but it's not the main one)

if the purpose of the article was to ignore implementation and spinning disks and just look at in-memory performance, then don't mention B-trees. they're an implementation-specific optimisation, and it just confuses.