[picture of a bar
..........
Searching the World Wide Web
Also see: Some Major Search Engines / Directories and How They Work
Boolean Search Logic  |  Evaluating Web Resources
 
 
Searching the WWW involves using a search engine, a directory, or some combination of these two. Because there is so much information on the Web, good and bad, finding what you want is not an exact science and can be time consuming. Though many search engines rank material according to their idea of what is relevant, that doesn't mean the material is relevant to want you want or is reliable. The information is only as good as the source of the material itself, not the search engine that found the material.

Search engines use specialized software to create their listings automatically. A "spider" (or crawler) "crawls" the web, for example, by visiting a web page periodically, reading it, and following the links. An index (or catalog) of the entire web is then compiled from the spider's search. The search software looks through the index to find matches to a user's search terms and ranks them in order of perceived relevance. The higher the number the more relevant that particular search engine thinks a "hit" is. The search service's idea of relevancy may or may not be relevant for your purposes. 

Meta "crawlers"  search a number of search engines at one time. 

WWW site creators may submit their sites for inclusion in a subject listing or a directory manager may find them. Companies can also pay to be included in search results. The search software looks for matches only from the chosen sites. The search software can only operate under the parameters set up by the programmer. They all work differently.

Majpr brpwsers allow you to search by just typing in keywords into the Web browser location "box."

Search tips:

Because there are so many URL's contained in a database, getting search results which are relevant may take some time and know-how. Because search engines work differently, using the same search words in different search services may get you different results. Each search service provides useful tips for searching its own database. Some search services are case sensitive; some let you use phrase; some rank relevancy of results (Search services or engines which have relevancy ranking means that the higher the number in the ranking, the more relevant the content).

Don't limit yourself to one search engine. Check the search engines for their specific ways to refine your search. Be prepared to revise your search. Compare results using different search services. Using synonyms sometimes helps. Search services and engines don't know what would be relevant to your particular search. Since companies can now pay to be included in searches, the "hits" might include entries that are there merely because the business paid to have their company appear in search results.  Sometimes you may get results that are relatively meaningless. Searching is by no means an exact science. But, like using a library database, the terms you plug in affect the results you get back.

Most major search engines also allow you to search for online forums, newsgroups, images, and other categories of information.

Don't just rely on the Internet for serious research. A significant amount of scholarly research has not reached the Internet or is protected from public access by "members only" subscriptions. Don't neglect your library databases for scholarly resources.

Google is a leader in search engines.  According to  John Markoff and G. Pascal Zachary, as of April 2003, Google "handles 200 million searches of the Web each day, a staggering one-third of the estimated daily total. Google, they say, handles " hundreds of thousands of queries simultaneously from all over the globe, each in less than half a second"( In Searching the Web, Google Finds Riches, N Y Times, 4/13/2003).

The Vivisimo search engine, developed by researchers at Carnegie Mellon University in Pittsburgh, uses an automatic document clustering technology.  According to  PR Newswire,  April 8, 2003, Vivisimo's "breakthrough clustering and meta-search products retrieve textual information from one or multiple sources and automatically organize[s] the combined results" into relevant folders." 

Most search engines allow you to use Boolean operators. 

What are Boolean Operators?

Boolean operators or words or symbols which help you create specific relationships among keywords or phrases "OR," "NOT," "AND," or "NEAR" commands. Often there is a separate box in the search service or engine in which to place your Boolean operators. Some search services do not use Boolean operators.

 
Some Boolean  Operators
KEYWORD METHOD
AND Finds only documents containing all of the specified words or phrases. "Computers AND viruses" finds documents with both the word computers and the word viruses.
OR Finds documents containing at least one of the specified words or phrases. "Computers OR viruses" finds documents containing either computers or viruses The found documents could contain both, but do not have to.
NEAR Finds documents containing both specified words or phrases within a certain number 10 words of each other. AltaVista searches for 10 word proximity; Lycos 20 words, for example.
NOT Finds only documents which exclude the term after the word "NOT." "Computers NOT viruses" finds documents with the word "computers" but not the word "viruses."
 
Other Search Symbols
SYMBOL METHOD
+ Terms preceeded with "+" indicate they have priority in the search.
" "  Phrases between quotation marks are search exactly as they are written. 
* This symbol is used for Wildcard searching. For example, comput* will search Computer, computing, etc.
[picture of a bar]
 Some Major Search Engines / Directories and How They Work
Boolean Search Logic  |  Evaluating Web Resources
virginia montecino[montecin@gmu.edu]
back to top