The Other Side of the Search God's Abracadabra!

Thousands of servers ...billions of web pages.... thethe subject is unfamiliar. Similarly, the concept
possibility of individually sifting through the WWWbased search of Excite (instead of individual
is null. The search engine gods cull the informationwords, the words that you enter into a search
you need from the Internet...from tracking downare grouped and attempted to determine the
an elusive expert for communication to presentingmeaning) is a difficult task and yields inconsistent
the most unconventional views on the planet.results.
Name it and click it. Beyond all the hype created
about the web heavens they rule, let's attempt to
keep the argument balanced. From Google to
Voice of the Shuttle (for humanities research)Besides who reviews or evaluates these sites for
these ubiquitous gods that enrich the net, can bequality or authority? They are simply compiled by
unfair ...and do wear pitfalls. And considering thea computer program. These active search engines
rate at which the Internet continues to grow, therely on computerized retrieval mechanisms called
problems of these gods are only exacerbated"spiders", "crawlers", or "robots", to visit Web
further.sites, on a regular basis and retrieve relevant
keywords to index and store in a searchable
database. And from this huge database yields
often unmanageable and comprehensive
Primarily, what you need to digest is the fact thatresults....results whose relevance is determined by
search engines fall short of Mandrake's magictheir computers. The irrelevant sites (high
mechanism! They simply don't create URLs out ofpercentage of noise, as it's called), questionable
thin air but instead send their spiders crawlingranking mechanisms and poor quality control may
across those sites that have rendered prayersbe the result of less human involvement to weed
(and expensive offerings!) to them forout junk. Thought human intervention would solve
consideration. Even when sites like Google claim toall probes....read on.
have a massive 3 billion web pages in its
database, a large portion of the web nation is
invisible to these spiders. To think they are simply
ignorant of the Invisible Web. This invisible webFrom the very first search engine - Yahoo to
holds that content, normal search engines can'tabout.com, Snap.com, Magellan, NetGuide, Go
index because the information on many web sitesNetwork, LookSmart, NBCi and Starting Point, all
is in databases that are only searchable within thatsubject directories index and review documents
site. Sites like - The Internet Movie Database , -under categories - making them more
IncyWincy, the invisible web search engine and -manageable. Unlike active search engines, these
The Complete Planet that cover this area arepassive or human-selected search engines like
perhaps the only way you can access contentdon't roam the web directly and are human
from that portion of the Internet, invisible to thecontrolled, relying on individual submissions. Perhaps
search gods. Here, you don't perform a directthe easiest to use in town, but the indexing
content search but search for the resources thatstructure these search engines cover only a small
may access the content. (Meaning - be sure toportion of the actual number of WWW sites and
set aside considerable time for digging.)thus is certainly not your bet if you intend
specific, narrow or complex topics. Subject
designations may be arbitrary, confusing or wrong.
A search looks for matches only in the
None of the search engines indexes everything ondescriptions submitted. Never contains full text of
the Web (I mean none). Tried research literaturethe web they link to - you can only search what
on popular search engines? AltaVista to Yahoo, willyou see titles, descriptions, subject categories,
list thousands of sources on education, humanetc. Human-labor intensive process limits database
resource development, etc. etc. but mostly fromcurrency, size, rate of growth and timeliness. You
magazines, newspapers, and various organizations'may have to branch through the categories
own Web pages, rather than from researchrepeatedly before arriving at the right page. They
journals and dissertations- the main sources ofmay be several months behind the times because
research literature. That's because most of theof the need for human organization. Try looking
journals and dissertations are not yet availablefor some obscure topic....chances for the people
publicly on the Web. Thought they'll get you allthat maintain the directory to have excluded
that's hosted on the web? Think again.those pages. Obviously, machines can blindly count
keywords but they can't make common-sense
judgement as humans can. But then why does
human-edited directories respond with all this
The Web is huge and growing exponentially.junk?!
Simple searches, using a single word or phrase, will
often yield thousands of "hits", most of which will
be irrelevant. A layman going in for a piece of info
to the internet has to deal with a more severeAnd here's about those meta search engines. A
issue - too much information! And if you don'tcomprehensive search on the entire WWW using
learn how to control the information overloadThe Big Hub, Dogpile, Highway61, Internet Sleuth
from these websites, returned by a search result,or Savvysearch , covering as many documents
roll out the red carpet for some frustration. Aas possible may sound as good an idea as a one
very common problem results from sites thatstop shopping.Meta search engines do not create
have a lot of pages with similar content. For e.g., iftheir own databases. They rely on existing active
a discussion thread (in a forum) goes on for aand passive search engine indexes to retrieve
hundred posts there will be a hundred pages allsearch results. And the very fact that they
with similar titles, each containing a wee bit ofaccess multiple keyword indexes reduces their
information. Now instead of just one link, allresponse time. It sure does save your time by
hundred of those darn pages will crop up yoursearching several search engines at once but at
search result, crowding out other relevant site.the expense of redundant, unwanted and
Regardless of all the sophistication technology hasoverwhelming results....much more - important
brought in, many well thought-out search phrasesmisses. The default search mode differs from
produce list after list of irrelevant web pages. Thesearch site to search site, so the same search is
typical search still requires sifting through dirt tonot always appropriate in different search engine
find the gold. If you are not specific enough, yousoftware. The quality and size of the databases
may get too many irrelevant hits.vary widely.
As said, these search engines do not actuallyWeighted Search Engines like Ask Jeeves and
search the web directly but their centralizedRagingSearch allows the user to type queries in
server instead. And unless this database isplain English without advanced searching
updated continually to index modified, moved,knowledge, again at the expense of inaccurate
deleted or renamed documents, you will landand undetailed searching. Review or Ranking
yourself amidst broken links and stale copies ofSources like Argus Clearinghouse ( (eblast.com)
web pages. So if they inadequately handleand Librarian's Index to the Internet (lii.org). They
dynamic web pages whose content changesevaluate website quality from sources they find
frequently, chances are for the information theyor accept submissions from but cover a minimal
reference to quickly go out-of-date. After theynumber of sites.
wage their never ending war with over-zealous
promoters (spamdexers rather), where do they
have time to keep their databases current and
their search algorithms tuned? No surprise if aAs a webmaster, your site registration with the
perfectly worthwhile site may go unlisted!biggest billboards in Times Square can get you
closer to bingo! for the searcher. Those who didn't
even know you existed before are in your living
room in New York time!
Similarly, many of the Web search engines are
undergoing rapid development and are not well
documented. You will have only an approximate
idea of how they are working, and unknownYour URL registration is a no-brainer, considering
shortcomings may cause them to miss desiredthe generation of flocking traffic to your site.
information. Not to mention, amongst the firstCertainly a quick and inexpensive method, yet is
class information, the web also houses false,only a component of the overall marketing
misleading, deceptive and dressed up informationstrategy that in itself offers no guarantees, no
actually produced by charlatans. The Web itself isinstant results and demands continued effort for
unstable and tomorrow they may not find youthe webmaster. Commerce rules the web. Like
the site they found you today. Well if you couldhow a notable Internet caveman put it, "Web
predict them, they would not be god!...would they?!publishers also find dealing with search engines to
The syntax (word order and punctuation) forbe a frustrating pursuit. Everybody wants their
various types of complex searches varies somepages to be easy for the world to find, but
from search engine to search engine, and smallgetting your site listed can be tough. Search sites
errors in the syntax can seriously compromisemay take a long time to list your site, may never
the search. For instance, try the same phraselist it at all, and may drop it after a few months
search on different search engines and you'll knowfor no reason. If you resubmit often, as it is very
what I mean. Novices... read this line - using searchtempting to do, you may even be branded a
engines does involve a learning curve. Manyspamdexer and barred from a search site. And as
beginning Internet users, because of thesefor trying to get a good ranking, forget it! You
disadvantages, become discouraged andhave to keep up with all the arcane and
frustrated. Like a journalist put it, "Not showingever-changing rules of a dozen different search
favoritism to its business clients is certainly a rareengines, and adjust the keywords on your pages
virtue in these times." Search engines havejust so...all the while fighting against the very
increasingly turned to two significant revenueplausible theory that in fact none of this stuff
streams. Paid placement: In addition to the mainmatters, and the search sites assign rankings at
editorial-driven search results, the search enginesrandom or by whim.
display a second - and sometimes third - listing
that's usually commercial in nature. The more you
pay, the higher you'll appear in the search results.
Paid inclusion: An advertiser or content partner"To make the best use of Web search
pays the search engine to crawl its site andengines--to find what you need and avoid an
include the results in the main editorial listing.avalanche of irrelevant hits-- pick search engines
So?...more likely to be in the hit list but then againthat are well suited to your needs. And lest you'd
- no guarantees. Of course those refusing towant to cry "Ye immortal gods! where in the
favor certain devotees are industry leaders likeworld are we?", spend a few hours becoming
Google that publishes paid listings, but clearlymoderately proficient with each. Each works
marks them as 'Sponsored Links.'somewhat differently, most importantly in respect
to how you broaden or narrow a search.
The possibility of these 'for-profit' search gods
(which haven't yet made much profit) for takingFinding the appropriate search engine for your
fees to skew their searches, can't be ruled out.particular information need, can be frustrating. To
But as a searcher, the hit list you are providedeffectively use these search engines, it is
with by the engine should obviously rank in theimportant to understand what they are, how they
order of relevancy and interest. Search commandwork, and how they differ. For e.g. while using a
languages can often be complex and confusingmeta search engine, remember that each engine
and the ranking algorithm is unique to each godhas its own methods of displaying and ranking
based on the number of occurrences of theresults. Remember, search strategies affect the
search phrase in a page, if it appears in the pageresults. If the user is unaware of basic search
title, or in a heading, or the URL itself, or thestrategies, results may be spotty.
meta tag etc. or on a weighted average of a
number of these relevance scores. E.g. Google (
uses its patented PageRank TM and ranks the
importance of search results by examining theQuoting Charlie Morris (the former editor of The
links that lead to a specific site. The more linksWeb developer's journal) - "Search engines and
that lead to a site, the higher the site is ranked.directories survive, and indeed flourish, because
Pop on popularity!they're all we've got. If you want to use the
wealth of information that is the Web, you've got
to be able to find what you want, and search
engines and directories are the only way to do
Alta Vista, HotBot, Lycos, Infoseek and MSNthat. Getting good search results is a matter of
Search use keyword indexes - fast access tochance. Depending on what you're searching for,
millions of documents. The lack of an indexyou may get a meaty list of good resources, or
structure and poor accuracy of the size of theyou may get page after page of irrelevant drivel.
WWW, will not make searching any easier. LargeBy laboriously refining your search, and using
number of sites indexed. Keyword searching canseveral different search engines and directories
be difficult to get right.In reality, however, the(and especially by using appropriate specialty
prevalence of a certain keyword is not always indirectories), you can usually find what you need in
proportion to the relevance of a page. Take thisthe end."
example. A search on sari - the national costume
of India -in a popular search engine, returned
among it's top sites, the following links:
Search engines are very useful, no doubt. Right
? of the Scottish Crop research Institutefrom getting a quick view of a topic to finding
expert contact info...verily certain issues lie in their
? -a health resort in Indonesialap. Now the very reason we bother about these
search engines so much is because they're all
? - The South Asia Regional Initiative for Energywe've got! Though there sure is a lot of room for
Cooperation and Developmentimprovement, the hour's need is to not get
caught in the middle of the road. By simply
understanding what, how and where to seek,
you'd spare yourself the fate of chanting that old
Pretty useful sites for someone very muchJewish proverb "If God lived on earth, people
interested in knowing how to drape or thewould break his windows."
tradition of the sari?! (Well, no prayer goes
unanswered...whether you like the answer or not!)
By using keywords to determine how each page
will be ranked in search results and not simplyHappy searching!Liji is a PostGraduate in Software
counting the number of instances of a word on aScience, with a flair for writing on anything under
page, search engines are attempting to make thethe sun. She puts her dexterity to work, writing
rankings better by assigning more weight totechnical articles in her areas of interest which
things like titles, subheadings, and so on.Now,include Internet programming, web design and
unless you have a clear idea of what you'redevelopment, ecommerce and other related
looking for, it may be difficult or impossible to useissues.
a keyword search, especially if the vocabulary of