I’ve been playing around with the spiffy sales rank tracker Matthew Beckler wrote, because I’m a great big dork, and enjoy playing with graphs. Here’s a graph of the sales rank vs. time through 2pm EST today (plotted in Excel from the data table at the bottom of the page):
As I noted in my previous post on this, the downward-going jumps are striking, and probably indicate discrete book purchase events. There also seems to be a clear trend that jumps starting at higher numbers are larger than jumps starting at lower numbers. If we assume that’s the case, what does that tell us about the relationship between number of sales and sales rank?
Here’s a graph of the size of the downward-going jump as a function of the sales rank at the time of the jump (I took the absolute value, so all the axes increase in the normal directions):
That’s remarkably linear for such dodgy data– astronomers and social scientists would kill for graphs this clean. If you fit a line to the whole set, the slope is around 0.42; if you throw out the four really small points in the 70,000 range, it’s 0.48. So the sale of a single book (assuming those jumps are individual sales) changes a book’s ranking by about 400 spots for every 1,000 places in the initial ranking. The two outliers above the line turn out to be glitches in the file, places where one point didn’t get read in properly (2 errors out of 160-odd points is pretty good for Excel).
Of course, this model predicts the effect of a single sale should go to zero at a rank of about 14,000 (18,000 if you throw out the outliers), so it’s obviously not perfect, but I’m amazed that it looks that linear.
What about the other direction? Here’s a plot of the change in rank for an hour with no sales, as a function of the rank at that time:
That’s much messier, but I wouldn’t expect it to be very good. Looking at the data, some sort of exponential decay with a time constant of 8-12 hours is probably a better model. That sort of data massage is not something I know an easy way to do in Excel, though, so it’ll have to wait until I’m at work and have access to SigmaPlot.