- The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher homepages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.
- Lawrence, Steve, Kurt D. Bollacker & C. Lee Giles & (1999). "Indexing and Retrieval of Scientific Literature." Proceedings of the Eighth International Conference on Information and Knowledge Management, p.139-146, November 02-06, 1999, Kansas City, Missouri, United States [^]
- Lawrence, Steve & C. Lee Giles (1998). "Searching the World Wide Web," Science, April 3, 1998, pp. 98-100. [^]
- Bollacker, Kurt D.; Steve Lawrence & C. Lee Giles (1998). "CiteSeer: An Autonous Web Agent for Automatic Retrieval and Identification of Interesting Publications," Proceedings of the Second International Conference on Autonomous Agents, May 10-13, 1998, Minneapolis, Minnesota, United States, pp. 116-123. [^]
- Brin, Sergey & Lawrence Page (1998). "The Anatomy of a Large-scale Hypertextual Web Search Engine," Proceedings of the Seventh International Conference on World Wide Web 7, p.107-117, April 1998, Brisbane, Australia. [^]
- Literature/1997/Cameron [^]
- Garfield, Eugene (1964). "Science Citation Indexing -- A New Dimension in Indexing." Science 144 (3619): 649-654. [^]