Indexing the WWW
The internet has been around for a long time, the Web is relatively new. However
since the birth of the Web even the average person has become interested in it.
Given the recent interest in it, the Web is starting to become a household thing.
But it the huge amount of information on the Web is to be on any use, it must
be easily accessed. This requires a good indexing technique, but there are many
- There is too much information. The WWW is a interconnected system of
sites and their pages - over 7 million at last count -- that you access with
a browser like Netscape Navigator.
- All this information, in the sites and pages, is utterly unorganised.
There's no central catalogue of its resources, no single place you can go to
find what you need.
In order to make an effective index to any big resource, we must first know
what we are indexing and how the information is being used. On the Internet
there are many types of index. Each one has it's intended use and is designed
for that purpose. The three main types of index are:
- The DNS/URL index, as a locational index to the
wealth of machines, users and resources on the internet.
- USEnet as a catagorisation of short lived,
text based messages.
- Search Engines provides a query system for users
to find sites/pages based on certain criteria.
Return to Project Title Page