English, British subtitles

← Why might the estimated number of results change when going from page 1 to page 2?

Get Embed Code
40 Languages

Showing Revision 2 created 10/10/2014 by Sherry Chen.

  1. Today's question comes from
    San Francisco, California.
  2. Blind Five Year Old asks,
  3. "How reliable is the site colon,
  4. query, and why does the total

  5. count sometimes change from
  6. Not Synced
    page one to page two?"
  7. Not Synced
    Well, I wouldn't really put it in terms of
  8. Not Synced
    the site colon query!
  9. Not Synced
    Instead, I would pivot and narrow
  10. Not Synced
    and focus down on the
  11. Not Synced
    results estimates.
  12. Not Synced
    So there's a few things to know.
  13. Not Synced
    Number one, if you haven't noticed,
  14. Not Synced
    you should notice
  15. Not Synced
    that results estimates are
  16. Not Synced
    only accurate to about three
  17. Not Synced
    significant digits.
  18. Not Synced
    So they are estimates.
  19. Not Synced
  20. Not Synced
    As we're looking for queries,
    at some point we say, OK, we
  21. Not Synced
    have enough to constitute the first page,
    and we see how far
  22. Not Synced
    we've gone through
    our posting list - the
  23. Not Synced
    amount of data we have--
  24. Not Synced
    therefore, we can estimate
    there's about this many
  25. Not Synced
    results left.
  26. Not Synced
    Now when you click from page one to page two, suddenly
  27. Not Synced
    you're going deeper.
  28. Not Synced
    And so you have a better estimate or you have a
  29. Not Synced
    slightly different estimate, because you're starting to
  30. Not Synced
    look not just a little bit of the way through all those
  31. Not Synced
    documents, you're actually starting to
  32. Not Synced
    go deeper and deeper.
  33. Not Synced
    So it's relatively common for our results estimates to
  34. Not Synced
    change going from page one to page two,
    typically becoming
  35. Not Synced
    more accurate.
  36. Not Synced
    So it's not usually a factor of
    the site colon query.
  37. Not Synced
    It's usually a factor of us
    digging deeper through our
  38. Not Synced
    data, and as a result,
    we have a better or at least a
  39. Not Synced
    different estimate for how many results there are.
  40. Not Synced
  41. Not Synced
  42. Not Synced
  43. Not Synced