From Now On
|
|
Vol 7|No 4|January|1998 |
|
|
Speed | How quickly do pages open and how swiftly do results of searches appear? Some search engines are graphically (advertising) bloated. |
Boolean Capabilities | Learning how to use logical operators such as AND, OR and NOT greatly enhances your searching. If the search engine doesn't support Boolean searching, don't waste your time. Fuzzy Logic usually produces fuzzy search results. |
Power Sorting & Searching Features | The best search engines allow you to sort and sift findings by dates, type of page, type of domain (com vs. edu), etc. |
Interface Design | Good search engines keep the search box front and center where you can enter words without scrolling about. How well are the items and choices laid out on the page? Can you find what you need when and where you need it? Do ads get in the way of your searching? |
Browsing Capabilities | Can you look at more than 10 "hits" at a time? It is very helpful to quickly scan the first 100 hits. Is the description or annotation of each item sufficient to provide a basis for making a wise choice? Can you determine relevancy? |
Breadth & Depth of Database | Some search engines and their spiders do a better job of collecting from the world's vast array of Web sites. Is larger better? |
Digital Logic | Many search engines employ artificial intelligence and digital logic to determine relevancy, proximity and other issues which might help you find the best information. Do they explain the rules of their system or are they hidden? When Web sites and pages appear in the top 20, do they actually belong there? |
Reasonable Advertising | Some search engines accept advertising which might offend. Do you care? If so, which seem OK to you? |
Useful Results | The bottom line or essential question is "How useful are the results?" Do you find reliable and relevant information swiftly? |
No matter which search engine you select, do not settle for the "simple search" version of any. You are sacrificing power and accuracy for ease of use. It's a bit like training wheels on a bike. Go for the Power Search or the Advanced Search and encourage your students to do the same. They offer features which will vastly improve your results when combined with the suggestion in this article.
The more powerful the search engine, the more important the syntax - the rules governing how you enter your search query. Because few people stop to read and learn these rules, they end up with crude and clumsy searches.
For example, some search engines care about CAPITAL LETTERS and punctuation. Others ignore them both. If you search for Washington, D.C. with the following query, you may achieve no "hits" with one search engine and thousands with another . . .
WASHINGTON, DC
Another example . . . when you want an exact phrase such as "old growth forest," some search engines require quotation marks around the words which belong together, while others do not care.
Another example . . . when you are conducting Boolean searches, some search engines require that you capitalize AND, OR and NOT. Failure to do so may convince your search engine to ignore these critical words.
You can usually find the syntax for any search engine in the Help pages. If not, you may want to look for an engine that explains its rules.
As Info-Glut has grown to be more and more of a problem, the search engines have competed fiercely to offer the best tools to support you in your sorting and sifting, and yet I have witnessed hundreds of folks ignoring these powerful extra tools and features.
Altavista's Advanced Search offers important extra features:
HotBot's Super Search allows you to request particular domains, particular dates, particular levels of a Web site, particular countries of origin, particular types of files and as many as 100 results at a time. These can be very helpful search features.
The Power Searcher explores all of the features of a chosen search engine in advance of real searching in order to apply these extra tools with skill when they are needed.
Effective searching requires a balance between a broad reach and a careful aim. The searcher must cast a net far enough to capture the most important information, and then, once safely contained, must cull the results so that only the best information remains.
Our first search for "old growth forest" produces more than 5,000 hits, many of which are irrelevant. When we REFINE SEARCH as described above, we see that these 5600 hits fall into major categories such AS forest, habitat, ecosystem, lands, conservation, wildlife, etc. Since we are just beginning our exploration, we may want to conduct fresh searches for each of these concepts and use the REFINE SEARCH function once again to see what other ideas, issues and possibilities are associated with our original search phrase.
Too many searchers narrow their search prematurely, thereby condemning themselves to the boundaries and ideas of their prior knowledge.
Most of us do not understand the forest of our question well enough at this stage to start looking at individual trees and bark and leaves.
This is the time to map out the territory into conceptual zones, each of which becomes a neighborhood worthy of more carefully focused exploration.
If we have little prior knowledge of "sustainability" we could not know what we do not know and could not plan or pose the questions we ought to be exploring. We might learn some before we focus narrowly and sharply.
Our search on Altavista provides us with a much richer appreciation of the subtopics associated with sustainability and delivers a list of key words which will direct the more pointed search process.
Here are the results of our search for "sustainability."
About 46779 documents match your query.
When we REFINE the search, we find new ideas and categories . . .
Each of these is a rich vein worthy of further investigation.
Early search efforts are meant to provide an overview of the information landscape relevant to the investigation at hand, much like petroleum prospectors flying over a region and noting the terrain, seeking convergence (a combination of geological elements in one location which hints at the presence of oil).
While it is tempting to start right off opening pages and looking for information, it is more effective to wait until you have scanned the brief descriptions most search engines provide for the hits. Scanning the top 100 hits provides a basis for revising the original search to accomplish two goals:
Think of the first search as a pot-luck supper with a 400 foot long table. Would you step up to the very first dish and start heaping food onto your plate? Or would you browse and graze before making choices?
Some of the best sites on the Internet are not indexed by the search engines. It pays to go to such Web sites and search them directly if they would be the leading source of information on a particular topic.
One outstanding example would be the huge collection of educational research documents housed at ERIC (New AskERIC Homepage). If you perform a search for "middle school guidance" on the free Internet, you are unlikely to find much worth reading (a mere 195 hits with Altavista), while ERIC provides hundreds of abstracts of research articles from several decades.
It turns out that because many public agencies and news media Web sites do not permit access (perhaps for security reasons) to the spiders of search engines, their contents often elude the search engine's efforts as well as our own search attempts.
If you know that the New York Times, the Wall Street Journal, the Seattle Times and the Los Angeles Times have all been giving careful attention to a current lawsuit such as the Justice Department suit against Microsoft, you may achieve far better results by starting your search at their Web sites rather than wasting time with a global search engine which would overlook their offerings.
After several years of looking for a good source of crime statistics using global search engines, I came to realize that I would have done better if I had started with the source . . . the Federal Bureau of Justice, a site which never showed up when I would use global search strategies. I also discovered, almost accidentally, that Money Magazine often provides better statistics on leading cities than most of the cities themselves because it publishes an annual rating of the top cities . . . Money Online: Best Places: Money ranks the 300 biggest places.
What is the best way to know what source or Web site to visit?
After you have conducted several browsing searches, you may begin to focus your search more sharply by adding key words to your search in order to limit hits to pages distinctly relevant to your inquiry.
Careful selection and addition of key words which are discriminating, distinguishing and distinctive, puts the spotlight on just those discrete pages which match your interests. Your key words differentiate, separate and reserve only the best pages.
How different would your results be with each of the following words?
The more you particularize your search, the better your results. Adding particulars and specifics excludes all pages which do not contain those items. The advantage is sharp focus. The danger is bypassing, missing or overlooking key data.
Sometimes it pays to alternate between narrowing and broadening. After zeroing in with some particulars, zoom back out and try some different particulars.
This may also be a good time to use wild cards, truncation, the logical operator "OR" and "exact phrase" syntax.
The most powerful strategy for culling your original findings is the use of the logical operator "NOT" - which may be AND NOT in some search engines (Altavista).
As you browse the first 100 hits, look for patterns and groupings of irrelevant pages. In order to exclude all such irrelevant pages from your collection, add AND NOT with a series of key words contained within parentheses and divided by OR).
If you search for Mayflower do you really want all the motels, hotels, insurance companies and moving companies? If not, try the following search . . .
Power Searching is often more successful when you listen to your intuition and take chances. If you enjoy word play, you will have good luck with the trial-and-error searching process which leads to good results. Listen to your hunches. Be a Sherlock Holmes. Don't be so analytical and logical that you cannot make intuitive leaps. Digital Logic is the attempt to find order in an erratic, chaotic and disorganized information landscape.
Credits: The background is from Jay Boersma.
Other drawings, photographs and graphics are by Jamie McKenzie.
Copyright Policy: Materials published in From Now Onmay be duplicated in hard copy format if unchanged in format and content for educational, non-profit school district use only. All other uses, transmissions and duplications are prohibited unless permission is granted expressly. Showing these pages remotely through frames is not permitted.
FNO is applying for formal copyright registration for articles.
Network 609