Mar
Stanford University Libraries Launches Federated Search
Press Release
March 27, 2008
If you think everything you’re looking for can be found through Internet search engines, think again. The “crawl” technology of search engines like Google identifies and indexes only a fraction of all the information that is available on the Web. The “invisible web” is made up of thousands of databases and searchable sources that contain highly targeted and valuable information, and whose content is inaccessible and therefore neither seen nor indexed by traditional search engines.
These online resources are often the bread and butter for today’s scholars: news archives, e-journal aggregators, indexing services, proprietary databases, and subscription services. They are generally invisible to the casual Internet search. As an example, Stanford University Libraries and Academic Information Resources (SULAIR) subscribes to thousands of scholarly journals and hundreds of licensed databases. And while these represent a trove of valuable resources, scholars have traditionally had no easy mechanism to search across multiple sources simultaneously.
For Michael Keller, University Librarian, giving scholars effective tools to discover digital information is an over-riding priority: “The digital revolution and the explosion in the number of relevant online resources are fantastic opportunities, but also a great challenge. Stanford students and faculty need a way to reduce the amount of time they spend searching, and to make it easier for them to find precisely relevant information on multidisciplinary research topics.”
To address these needs, SULAIR has partnered with Deep Web Technologies to release a prototype federated search service tailored to the environment of an academic research library. A federated search simultaneously queries a number of online resources held across many different, isolated systems and returns the results in one merged list. This gives scholars a quick and broad view across a number of possible sources, letting them identify the most promising areas and resources in which to extend their search.
Three demonstrations of federated search within the Stanford environment are now online:
1. All Library Catalogs at Stanford (Socrates, Lane Medical, Jackson Business, Stanford Linear Accelerator, and the Health Library of the Stanford Hospitals and Clinics). (Openly accessible)
2. “Top 10” Databases at Stanford (ABI/Inform, Annual Reviews, Biosis, Dissertations & Theses-A&I, Engineering Village, Expanded Academic ASAP, Lexis Nexis Academic, Periodicals Archive Online, PsycINFO, and Web of Science. (Limited to current Stanford students, faculty, and staff)
3. Stanford Digitized Content available from http://collections.stanford.edu (Openly accessible)
Abe Lederman, president and founder of Deep Web Technologies, said that: “I’m very excited to be working with Stanford on this project. Deep Web Technologies has been very successful in building search tools for the science communities and national research communities, and the opportunity to work with SULAIR lets us explore how we can extend our services into one of the world’s leading academic research libraries.”
Unlike a standard search engine, Deep Web Technologies’ federated search product, Explorit Research Accelerator, does not rely upon a stored index built in advance. Instead, it operates in real time, replicating the query and broadcasting into multiple databases. Lederman explains: “The Deep Web search engine immediately reaches out to relevant databases at various sites, drilling down into these information centers all at once, organizing the info and merging the results in a ranked priority list—in real time. So, not only are you delving into databases, you’re getting the most precise and current results.”
Employing a familiar, intuitive search interface, Explorit quickly presents high-value results from a common, single point of access. Explorit is currently powering a number of science, technology and government search portals, including National Digital Library for Agriculture, Science.gov, Scitopia, and WorldWideScience.org. Explorit is also being used for federated search at Intel’s corporate library.
Links for More Information
RaPIDs Group, Stanford University Libraries & Academic Information Resources
http://library.stanford.edu/rapids/
Deep Web Technologies
http://deepwebtech.com/
Whitepaper: “How to Maximize Your Strategic Investment in Federated Search” http://www.deepwebtech.com/PDFs/Whitepaper.pdf
Federated Search Blog
http://federatedsearchblog.com/
Combined Search Prototypes Developed by Deep Web Technologies and SULAIR:
All Library Catalogs at Stanford
http://deepweb.stanford.edu/catalogs/search.html
”Top 10” Databases at Stanford
https://deepweb.stanford.edu/su/
Stanford Digitized Content
http://deepweb.stanford.edu/digcolls/search.html
Other Combined Search Sites Developed by Deep Web Technologies:
National Digital Library for Agriculture (NDLA)
http://www.nal.usda.gov/ndla
Science.gov (Gateway to U.S. Government Information, incl. R&D Results)
http://www.science.gov
Scitopia (Portal containing documents from 15 sci-tech societies plus patents and government data)
http://www.scitopia.org
WorldWideScience.org (Global Science portal developed by U.S. Dept of Energy and British Library)
http://worldwidescience.org/
Information Outlook: Federated Search at the Intel Library (article available courtesy of SLA) http://www.deepwebtech.com/PDFs/SLA-Hill-Intel-0710.pdf
Check out The Federated Search Blog to stay current with the industry..
Tags: federated search