Information Gathering in the World-Wide Web: The W3QL Query Language and the W3QS System.
David Konopnicki, Oded Shmueli:
Information Gathering in the World-Wide Web: The W3QL Query Language and the W3QS System.
ACM Trans. Database Syst. 23(4): 369-410(1998)@article{DBLP:journals/tods/KonopnickiS98,
author = {David Konopnicki and
Oded Shmueli},
title = {Information Gathering in the World-Wide Web: The W3QL Query Language
and the W3QS System},
journal = {ACM Trans. Database Syst.},
volume = {23},
number = {4},
year = {1998},
pages = {369-410},
ee = {http://doi.acm.org/10.1145/296854.277639, db/journals/tods/KonopnickiS98.html},
bibsource = {DBLP, http://dblp.uni-trier.de}
}
BibTeX
Abstract
The World Wide Web (WWW) is a fast growing global information resource. It contains an enormous amount of information and provides access to a variety of services. Since there is no central control and very few standards of information organization or service offering, searching for information and services is a widely recognized problem. To some degree this problem is solved by "search services," also known as "indexers," such as Lycos, AltaVista, Yahoo, and others. These sites employ search engines known as "robots" or "knowbots" that scan the network periodically and form text-based indices. These services are limited in certain important aspects. First, the structural information, namely, the organization of the document into parts pointing to each other, is usually lost. Second, one is limited by the kind of textual analysis provided by the "search service." Third, search services are incapable of navigating "through" forms. Finally, one cannot prescribe a complex database-like search. We view the WWW as a huge database. We have designed a high-level SQL-like language called W3QL to support effective and flexible query processing, which addresses the structure and content of WWW nodes and their varied sorts of data. We have implemented a system called W3QS to execute W3QL queries. In W3QS, query results are declaratively specified and continuously maintained as views when desired. The current architecture of W3QS provides a server that enables users to pose queries as well as integrate their own data analysis tools. The system and its query language set a framework for the development of database-like tools over the WWW. A significant contribution of this article is in formalizing the WWW and query processing over it.
Copyright © 1998 by the ACM,
Inc., used by permission. Permission to make
digital or hard copies is granted provided that
copies are not made or distributed for profit or
direct commercial advantage, and that copies show
this notice on the first page or initial screen of
a display along with the full citation.
CDROM Version: Load the CDROM "Volume 4 Issue 1, Books, VLDB-j, TODS, ..." and ...
DVD Version: Load ACM SIGMOD Anthology DVD 2" and ...
BibTeX
[Abstract and Index Terms]
[Full Text in PDF Format, 1323 KB]
References
- [Abiteboul et al. 1993]
- Serge Abiteboul, Sophie Cluet, Tova Milo:
Querying and Updating the File.
VLDB 1993: 73-84 BibTeX
- [Alta Vista Home Page 1996]
- AltaVista.
http://www.altavista.com/ BibTeX
- [Beck 1995]
- ...
- [Beeri and Kornatzky 1990]
- Catriel Beeri, Yoram Kornatzky:
A Logical Query Language for Hypertext Systems.
ECHT 1990: 67-80 BibTeX
- [Berners-Lee 1994]
- Tim Berners-Lee:
Request for Comments: 1738, Uniform Resource Locators (URL).
(1994) http://www.w3.org/Addressing/rfc1738.txt BibTeX
- [Bush 1945]
- Vannevar Bush:
As We May Think.
The Atlantic Monthly 176(1): 101-108(1945) BibTeX
- [Consens and Mendelzon 1989]
- Mariano P. Consens, Alberto O. Mendelzon:
Expressing Structural Hypertext Queries in GraphLog.
Hypertext 1989: 269-292 BibTeX
- [De Bra and Post 1994]
- ...
- [Fielding et al. 1996]
- Roy T. Fielding, Henrik Frystyk Nielsen, Tim Berners-Lee:
Internet Draft: Hypertext Transfer Protocol - HTTP/1.1.
http://www.w3.org/Protocols/HTTP/1.1/spec.html BibTeX
- [Graham 1994]
- ...
- [Grobe 1994]
- ...
- [Halasz 1988]
- Frank G. Halasz:
Reflections on NoteCards: Seven Issues for the Next Generation of Hypermedia Systems.
Commun. ACM 31(7): 836-852(1988) BibTeX
- [Java Language Home Page 1995]
- Java Language Home Page.
http://java.sun.com/ BibTeX
- [Johnson 1996]
- ...
- [Konopnicki and Shmueli 1995]
- David Konopnicki, Oded Shmueli:
W3QS: A Query System for the World-Wide Web.
VLDB 1995: 54-65 BibTeX
- [Lackshmanan et al. 1996]
- Laks V. S. Lakshmanan, Fereidoon Sadri, Iyer N. Subramanian:
A Declarative Language for Querying and Restructuring the WEB.
RIDE-NDS 1996: 12-21 BibTeX
- [Lycos 1995]
- Lycos.
http://www.lycos.com BibTeX
- [McBryan 1994]
- ...
- [MetaCrawler Home Page 1996]
- MetaCrawler.
http://www.metacrawler.com BibTeX
- [Mihaila 1996]
- ...
- [Minohara and Wanatabe 1993]
- Tatsuo Minohara, Ryuichi Watanabe, Mario Tokoro:
Queries on Structures in Hypertext.
FODO 1993: 394-411 BibTeX
- [Netscape-Net Search 1994]
- ...
- [Pinkerton 1994]
- ...
- [Vromans and Design 1983]
- ...
- [Yahoo Home Page 1994]
- Yahoo.
http://www.yahoo.com/ BibTeX
Referenced by
- Ravi Kumar, Prabhakar Raghavan, Sridhar Rajagopalan, D. Sivakumar, Andrew Tomkins, Eli Upfal:
The Web as a Graph.
PODS 2000: 1-10
- Soumen Chakrabarti, Martin van den Berg, Byron Dom:
Distributed Hypertext Resource Discovery Through Examples.
VLDB 1999: 375-386
- Frédérique Laforest, Anne Tchounikine:
A Model for Querying Annotated Documents.
ADBIS 1999: 61-74
BibTeX
ACM SIGMOD Anthology - DBLP:
[Home | Search: Author, Title | Conferences | Journals]
TODS, ACM SIGMOD Anthology: Copyright © by ACM (info@acm.org), Corrections: anthology@acm.org
DBLP: Copyright © by Michael Ley (ley@uni-trier.de), last change: Wed Jun 4 19:23:49 2008