|This material has been prepared to accompany the book "Searching and Researching on the Internet and the World Wide Web, Third Edition" (ISBN 1887902716) by Ernest Ackermann and Karen Hartman, and published by Franklin, Beedle and Associates, Incorporated, Wilsonville OR, ©2002. No part of this may be reproduced, stored in a retrieval system, or transcribed without permission of the publisher.|
|Searching and Researching on the Internet and the Web Glossary|
all-in-one search tool A tool that provides search forms for several search engines and directories all in one site. The tool also provides hyperlinks which allow you to go to the services directly.
annotated directories Often referred to as virtual libraries, these directories have brief summaries, descriptions, ratings, or reviews attached to Web pages and subject guides.
annotation A brief summary or description of a Web page or of any work listed in a database.
anonymous FTP A means of using FTP to make files readily available to the public. If you start an FTP session with a remote host, you give the log in or user name anonymous and enter your email address as a password. When you use a URL starting with "ftp://" and a domain name with a Web browser, an anonymous FTP session begins, and you don’t have to enter a user name or password.
article A message or file that is part of a Usenet newsgroup.
attachment A file that is sent as part of an email message but that is not part of the main message. Usually images, programs, or word processor files are sent as attachments, because most email programs allow only plain text in the body of a message.
bibliographic database An online database that lists sources of information and describes the information but that doesn’t include the text of the information itself. Some bibliographic databases are annotated, which means that there is a brief summary of each work listed.
bookmark A hyperlink that is saved in the bookmark list, a file in your browser. You can use bookmarks to keep track of favorite or important sites and to return there whenever you are on the World Wide Web.
bookmark list A list of hyperlinks to items on the World Wide Web.
Boolean searching Searching that uses Boolean operators (AND, OR, and NOT) in the search expression. Especially helpful in multifaceted or specific topics, Boolean operators help expand or narrow the scope of your search. A search for rivers OR lakes returns all documents with both words or either word in them. A search for rivers AND lakes returns documents with both words in them. A search for rivers AND lakes NOT swamps returns only documents that mention both rivers and lakes but omits those that also mention swamps.
browsing The process of clicking from one hyperlink to another on the World Wide Web. You can browse indiskriminately, or you can do structured browsing, using a hierarchical subject list in a directory.
case sensitivity The ability of a search tool to distinguish between uppercase and lowercase letters. Some search tools aren’t case sensitive; no matter what you type, the tool only picks up lowercase matches. Search engines that are case sensitive will strictly follow the search request; they’ll return documents containing the words in the case in which they were entered in the search expression.
client/server The interaction between a system that requests information (the client) and another system that provides it (the server). The browser is the client, and a computer at the site that provides the information is the server.
commercial database A database that requires you to pay a subscription cost before accessing it. See also proprietary database.
compressed file A file that has been processed by a program that applies an algorithm or scheme to compress or shrink a file. A compressed file must first be uncompressed or transformed before it can be read, displayed, or used. Files available through anonymous FTP are often stored in compressed form.
concept searching A feature enabling a search engine to find synonyms in its database. When you type in a word or phrase, the engine automatically searches for the word or phrase you want, plus words or phrases that may mean the same thing. For example, if the word teenage is in your search expression, the search engine would also look for the word adolescent.
content area The part of a Web browser window that contains the current Web page, including images, text, or hyperlinks.
copyright The right to copy or duplicate material such as images, music, and written works. Only the owners of the information can grant this right. Regardless of whether information on the Internet or a Web page is accompanied by a statement asserting copyright, it is still protected by the copyright laws of the United States, the Universal Copyright Convention, or the Berne Union.
cross-posting Posting an article to more than one Usenet newsgroup.
default setting The configuration a search engine uses unless you override the setting by specifying another configuration. For example, in most search engines, the Boolean operator OR is the assumed relationship between two words unless you type AND between the words.
delimited format A format often used to store tables of data. The data fields are separated by commas, tabs, semicolons, or some other delimiter. Spreadsheet programs usually include the facilities to import data that is in delimited format.
Dewey decimal classification system Originated by Melvil Dewey in the late nineteenth century, this classification system for library materials divides all knowledge into ten different classes, which are then subdivided into several sets of subclasses. Within these subclasses, decimals reflect still smaller subdivisions. The Dewey decimal classification system is most prevalent today in public libraries, whereas most other types of libraries, including academic ones, use the Library of Congress classification system.
directory Topical lists of Internet resources, arranged hierarchically. Directories are meant to be browsed, but they can also be searched. Directories differ from search engines in one major way—the human element involved in collecting and updating the information.
discussion group A group that discusses a single topic via email messages. An individual subscribes to or joins a discussion group electronically, and all messages sent to the group are distributed to the members by email.
domain name The Internet name for a network or computer system. The name consists of a sequence of characters separated by periods, such as www.mwc.edu. The domain name is often the first part of a URL that follows "://". For example, the domain name in the URL http://www.ckp.edu/technical/reference/wsftp.html is www.ckp.edu.
downloading Transferring or copying a file from another computer (the remote computer) to the computer you’re using (the local computer). This term is often applied to the process of retrieving a file from a software library or FTP archive.
duplicate detection An output feature of some search engines and meta-search tools that automatically filters out of your search results any URLs that are duplicated elsewhere in the results.
fair use A provision in most copyright conventions or statutes that makes it possible for individuals to copy portions of a document or other piece of work for short-term use.
FAQ (frequently asked questions) A list of commonly asked questions and answers on a specific topic. A FAQ is often associated with Usenet newsgroups, but several search tools also include a FAQ file. This, and online help, is usually the first place you should look to find answers.
favorite A hyperlink that is saved in the favorites list. You can use favorites to keep track of useful or important sites and to return there whenever you are using your browser. See bookmark.
favorites list The name the Internet Explorer browser gives to the collection of favorites. The browser includes menu bar and toolbar links to the favorites list.
field Part of a Web page or bibliographic record that is designated for a particular kind of data or text.
field searching A strategy in which you limit a search to a particular field. In a search engine, you might search only the URL field. In a library catalog, you could search for items by author, title, subject, call number, or any other data element that was designated as a field. By narrowing the scope of searchable items, field searching helps to eliminate the chance of retrieving irrelevant information.
frames A browser feature that displays several Web pages in the browser’s content area. Each frame has its own URL and controls.
freeware A software program that’s available for use without any charge. This doesn’t mean the program isn’t copyrighted. Usually, the originator retains the copyright. Anyone can use it, but the program can’t be legally sold or distributed without permission.
FTP (File Transfer Protocol) A means of transferring or sharing files across the Internet from one computer system to another.
FTP archive A collection of files available through anonymous FTP.
full-text database A database that contains the full text of the information it describes.
full-text indexing A search engine feature in which every word, significant or insignificant, is indexed and retrievable through a search.
Gopher A menu-oriented system that gives access to documents, files, and other Internet services, regardless of where they are on the Internet. The software for Gopher was created and developed at the University of Minnesota to allow users to browse and retrieve documents in a campus environment.
group address The address to use to send email to each member of a discussion group, interest group, listserv list, or mailing list.
GUI (graphical user interface) An interface that uses icons and images in addition to text to represent information, input, and output.
helper applications Programs used with a Web browser to display, view, or work with files that the browser cannot display. For example, Netscape Navigator can display graphic or image files in GIF or JPEG format. If you accessed an image of another type through a hyperlink, then your browser would need a helper application to display it. As another example, some Web browsers can work with several protocols but not with Telnet. If you activate a hyperlink that begins a Telnet session, a Telnet client that is separate from the Web browser must be used. Th!e Web browser can recognize when it’s necessary to use specific helper applications.
hierarchy A list of subjects in a directory. The subjects are organized in successive ranks with the broadest listed first and with more specific aspects or subdivisions listed below.
high precision/high recall A phenomenon that occurs during a search when you retrieve all the relevant documents in the database and retrieve no unwanted ones.
high precision/low recall A phenomenon that occurs when a search yields a small set of hits. Although each one may be very relevant to the search topic, some relevant documents will be missed.
history list A list of Internet sites, services, and resources that have been accessed through a Web browser during recent sessions.
hit list A list of results obtained from an online search.
home page The first screen or page of a site accessible through a Web browser.
HTML (Hypertext Markup Language) The format used for writing documents to be viewed with a Web browser. Items in the document can be text; images; sounds; or links to other HTML documents, sites, services, and resources on the Web.
HTTP (Hypertext Transfer Protocol) The standard protocol that World Wide Web servers and clients use to communicate.
hyperlink Words, phrases, images, or regions of an image that are often highlighted or colored differently and that can be selected as part of a WWW page. Each hyperlink represents another Web page; a location in the current Web page; an image, audio, video, or multimedia file; or some other resource on the World Wide Web. When the hyperlink is selected, it activates the resource that it represents.
hypermedia An extension to hypertext that includes graphics and audio.
hypertext A way of viewing or working with a document in text format that allows you to follow cross-references to other Web resources. By clicking on an embedded hyperlink, the user can choose her own path through the hypertext material.
implied Boolean operators Using the characters + and - to require or prohibit a word or phrase as part of a search expression. The + acts somewhat as AND, and the - acts as NOT would in a Boolean expression. For example, the Boolean expression rivers AND lakes NOT swamps may be expressed as +rivers +lakes -swamps.
interest group Group discussion and sharing of information about a single topic carried out via email.
Internet The collection of networks throughout the world that agree to communicate using specific telecommunication protocols, the most basic being the Internet Protocol (IP) and transmission control protocol (TCP), and the services supplied by those networks.
keyword indexing A feature in which each significant word in the entire document or record is indexed and retrievable by the search engine or computer program being used.
keyword searching A feature in which the search engine or computer program searches for every occurrence of a particular word in the database, regardless of where it may appear.
LCSH (Library of Congress subject headings) A list of standardized subject headings that are used to index materials by the Library of Congress. The subject headings are arranged in alphabetical order by the broadest headings, with more precise headings listed under them. Most academic library catalogs are searchable by subject heading as well as by keyword.
Library of Congress classification system Designed originally for the Library of Congress in the late nineteenth century, this classification system is used by most academic and special libraries throughout the United States and in many parts of the world. It consists of twenty-one classes, each designated by a letter of the alphabet. Subdivisions are created by the use of other letters and numbers.
limiting by date A search tool feature that allows you to limit search results to pages that were indexed after, before, or within certain dates.
list address See group address.
Listserv The type of software used to manage a listserv list.
listserv list A type of discussion group, interest group, or mailing list.
location field The pane on the browser window that holds the current document’s URL.
low precision/high recall A phenomenon that occurs during a search when you retrieve a large set of results, including many unwanted documents.
lurking Reading the email or articles in a discussion group or newsgroup without contributing or posting messages.
mailing list See discussion group.
menu bar The sequence of pulldown menus across the top of the Web browser window. All browser commands are embedded in the menu bar.
meta-search tools Tools that allow you to search either more than one search engine or directory simultaneously or a list of search tools that can be accessed from that site. These two major types of meta-search tools are called parallel search tools and all-in-one search tools.
meta-tags Keywords inserted in the meta-tag portion of the HTML source document by the Web page author. If Web pages don’t have much text, meta-tags help them come up in a keyword search.
moderator A person who manages or administers a discussion group, interest group, listserv list, mailing list, or Usenet newsgroup. In most cases, the moderator is a volunteer. Messages sent to the group are first read by the moderator, who then passes appropriate messages to the group.
modifying search results Changing an initial search expression to obtain more relevant results. This can involve narrowing the results by field, limiting by date, adding keywords, subtracting keywords, and so forth.
nested Boolean logic The use of parentheses in Boolean search expressions. For example, the nested expression ((rivers OR lakes) AND canoeing) NOT camping will first find resources that contain either the words rivers or lakes and the term canoeing but not resources that contain the term camping.
newsgroup A collection of Usenet articles arranged by topic. Some are specialized or technical groups (such as comp.protocols.tcp-ip.domains—topics related to Internet domain style names), some deal with recreational activities (such as rec.outdoors.fishing.saltwater—topics related to saltwater fishing), and one, news.newusers.questions, is dedicated to questions from new Usenet users.
newsreader The software you use to read, reply to, and manage Usenet news.
NNTP (Network News Transport Protocol) The standard protocol used to distribute Usenet news between computer systems on the Internet in a form that machines can read and computers can access.
OPAC (Online Public Access Catalog) An electronic catalog of a library’s holdings, usually searchable by author, title, subject, keyword, and call number. Thousands of OPACs from libraries all over the world are available on the World Wide Web.
parallel search tools A search tool or service that takes one search expression, submits it to several search services, and returns selected results from each. This is an example of a meta-search tool.
phrase searching The search feature supported by most search engines that allows you to search for words that usually appear next to each other. It is possibly the most important search feature.
plug-in An application software that’s used along with a Web browser to view or display certain types of files as part of a Web page. Shockwave from MacroMedia is a plug-in that allows the browser to display interactive multimedia.
PPP (Point-to-Point Protocol) A standard protocol that allows a computer with a modem to communicate using TCP/IP.
proprietary database A privately owned database that isn’t available to the public and that is usually password protected. Universities, businesses, and research institutes are the institutions most likely to subscribe to this kind of database and make it available to their employees and students.
protocol A set of rules for exchanging information between networks or computer systems. The rules specify the format and the content of the information, and the procedures to follow during the exchange.
proximity searching A search feature that makes it possible to search for words that are near each other in a document.
reference work A resource used to find quick answers to questions. Traditionally thought of as being in the form of books (such as dictionaries, encyclopedias, quotation directories, manuals, guides, atlases, bibliographies and indexes), a reference source on the World Wide Web closely resembles its print counterpart. A reference work doesn’t necessarily contain hyperlinks to other resources, although it will often have hyperlinks within the document itself.
relevance A measure of how closely a database entry matches a search request. Most search tools on the Web return results based on relevance. The specific algorithm for computing relevance varies from one service to another, but it’s often based on the number of times terms in the search expression appear in the document and whether they appear in the appropriate fields.
relevancy ranking A ranking of items retrieved from a database. The ranking is based on the relevancy score that a search engine has assigned.
results per page A feature of some search engines that allows you to designate the number of results listed per page. Search engines usually list ten results per page.
robot See spider.
scroll bar The rectangular area on the right side of the window that allows you to move up or down in the open document. You move by clicking and dragging it or clicking on the arrow at the bottom of the bar.
search engine A collection of programs that gather information from the Web (see spider), index it, and put it in a database so it can be searched. The search engine takes the keywords or phrases you enter, searches the database for words that match the search expression, and returns them to you. The results are hyperlinks to sources that have descriptions, titles, or contents matching the search expression.
search expression The keywords and syntax that you enter in a search form. With this expression, you ask a search tool to seek relevant documents in a particular way.
search form The rectangular pane or oblong box that appears on the home pages of most search engines and directories. In this space, you enter a search expression.
shareware Software that you are allowed to download and try for a specified period free of charge. If you continue to use the program after that time, you are expected to pay a usually modest fee to continue using the product legally.
SLIP (Serial Line Internet Protocol) Software allowing the use of the Internet Protocol (IP) over a serial line or through a serial port. Commonly used with a modem connection to a service providing Internet services.
sorting An option in some search engines that allows you to determine how you’d like your search results listed—by URL, relevance, domain or location, date, and so on.
spam Unwanted and unsolicited email. The electronic equivalent of paper junk mail.
special collection A collection of material, usually a separate collection of a large library, that focuses on a particular topic and covers it in great depth.
special library A library that focuses on the interests inherent in the institution it serves. Libraries in hospitals, corporations, associations, museums, and other types of institutions are all special libraries. In many cases, they are not open to the public. A special library’s collection may be narrow in scope, but it will have depth within the specialty it covers.
specialized database A self-contained index that is searchable and available on the Web. Items in specialized databases are often not accessible through a keyword search in a search engine.
spider A computer program that travels the Internet to locate such resources as Web documents, FTP archives, and Gopher documents. It indexes the documents in a database, which is then searched using a search engine (such as AltaVista or Excite). A spider can also be referred to as a robot or wanderer. Each search engine uses a spider to build its database.
status bar The bar or rectangular region at the bottom of the browser window that displays information regarding the transfer of a Web document to the browser. When the mouse moves over a hyperlink, the status bar shows the hyperlink’s URL. When a Web page is requested, the status bar gives information about contacting and receiving information from a server. During transmission, the status bar displays a percentage that reflects how much of the document has been transferred. The status bar also indicates whether transmissions are occurring in a secure manner.
stop words Words that an indexing program doesn’t index. Stop words usually include articles (a, an, the) and other common words.
structured browsing The process of moving through a hierarchically arranged Web directory from a broad subject heading to a more specific one until you find the desired information.
subcategory A subject category that is more narrowly focused than the broader subject category above it in a hierarchy.
subject catalog See directory.
subject category A division in a hierarchical subject classification system in a Web directory. You click on the subject category that is likely to contain either the Web pages you want or other subject categories that are more specific.
subject guide A collection of URLs on a particular topic. Most easily found listed in virtual libraries, they are also referred to as meta-pages.
subscribe To join a discussion group, interest group, listserv list, or mailing list. You use this term when writing commands to join such a group and to list a Usenet newsgroup on your newsreader.
syntax The rules governing the construction of search expressions in search engines and directories.
TCP/IP (transmission control protocol/Internet protocol) A collection of protocols used to provide the basis for Internet and World Wide Web services.
Telnet Allows for remote login capabilities on the Internet. One of the three basic Internet services, Telnet allows you to be on one computer and to access and log in to another.
thread A collection of articles that all deal with a single posting or email message.
toolbar The sequence of icons below the menu bar. Clicking on a toolbar icon executes a command or causes an action.
top-level category One of several main subjects in the top of a hierarchy in a directory’s list of subjects.
truncation The phenomenon in which you cut off the end of a word when creating a search expression. When given such a request, a search engine will look for all possible endings to the word, in addition to the root word itself.
unified search interfaces Meta-search tools that allow you to use several search engines simultaneously.
unsubscribe To leave, sign off from, or quit a discussion group, interest group, listserv list, or mailing list. You use the term when writing commands to end a relationship with a discussion group or to remove a Usenet newsgroup from the list of those you would regularly read.
uploading Transferring a file from a local computer (the one you’re using) to another, remote computer.
URL (Uniform Resource Locator) A way of describing the location of an item (document, service, or resource) on the Internet and specifying the means by which to access that item.
Usenet news A system for exchanging messages called articles arranged according to specific categories called newsgroups. The articles are passed from one system to another, not as email between individuals.
vendor A company that markets online catalog systems to libraries.
virtual libraries Directories that contain collections of resources that librarians or cybrarians have carefully chosen and organized in a logical way.
Web browser A program used to access the Internet services and resources available through the World Wide Web.
Web Page The information available and displayed by a Web browser as the result of opening a local file or opening a location (URL). The contents and format of the Web page are specified using HTML.
white pages service A search service available on the World Wide Web that finds the email or street address of an individual, business, or government agency.
wildcard A character that stands in for another character or group of characters. Most search tools use an asterisk for this function. Although the wildcard is most often used in truncation, it can also be used in the middle of words (for example, wom*n).
WWW (World Wide Web) The collection of different services and resources available on the Internet.
|Searching and Researching on the World Wide Web|
This is a Webliminal.com Production. Copyright 2002
Please send comments/questions to firstname.lastname@example.org
FROM the fortune list ...
Hide not your talents, for use they were made; what's a sun-dial in the shade? - Franklin