Review - Aqua: A Fast Decision Support Systems Using Approximate Query Answers.

H. V. Jagadish: Review - Aqua: A Fast Decision Support Systems Using Approximate Query Answers. ACM SIGMOD Digital Review 1: (1999)

Review

The need for producing quick answers to large queries is well-recognized, even if these answers be approximate. See, for instance, [2]. There are two primary ways to accomplish this. One is to sample "on-the-fly", as suggested in [3]. The other is to recognize that typical large queries involve large aggregations, so one could estimate values for these aggregates with the help of pre-computed information such as histograms.

Whereas the first technique is more flexible in the types of queries it can support and in the extent of approximation error allowed, the second technique requires less change to the existing query processing system of a database, and is likely to be substantially faster and more robust.

The Aqua project, being reviewed here, is the leading implementation of the second technique. I was impressed by the demonstration I saw at the VLDB conference, and believe this is a project worth paying attention to.

References

[1]: Swarup Acharya, Phillip B. Gibbons, Viswanath Poosala: Aqua: A Fast Decision Support Systems Using Approximate Query Answers. VLDB 1999: 754-757
[2]: Daniel Barbará, William DuMouchel, Christos Faloutsos, Peter J. Haas, Joseph M. Hellerstein, Yannis E. Ioannidis, H. V. Jagadish, Theodore Johnson, Raymond T. Ng, Viswanath Poosala, Kenneth A. Ross, Kenneth C. Sevcik: The New Jersey Data Reduction Report. IEEE Data Eng. Bull. 20(4): 3-45(1997)
[3]: Joseph M. Hellerstein, Peter J. Haas, Helen J. Wang: Online Aggregation. SIGMOD Conference 1997: 171-182