Lesson 3 | Information retrieval services |
Objective | Describe the different categories of information retrieval services. |
Information Retrieval Services
Automated Information Retrieval (IR)
Automated information retrieval (IR) systems were originally developed to help manage the huge scientific literature that has developed since the 1940s. Many university and public libraries now use IR systems to provide access to books, journals, and other documents. Commercial IR systems offer databases containing millions of documents in myriad subject areas. Dictionary and encyclopedia databases are now widely available for PCs. Information Retrieval has been found useful in such disparate areas as office automation and software engineering. Indeed, any discipline that relies on documents to do its work could potentially use and benefit from IR.
An IR system matches user queries to documents stored in a database. A document is a data object, usually textual, though it may also contain other types of data such as photographs, graphs. Often, the documents themselves are not stored directly in the IR system, but are represented in the system by document
surrogates. This web page is a document and could be stored in its entirety in an IR database. One might instead, however, choose to create a document surrogate for it consisting of the title, author, and abstract. This is typically done for efficiency to reduce the size of the database and searching time.
An IR system must support certain basic operations. There must be a way to enter documents into a database,
change the documents, and delete them. There must also be some way to search for documents, and present them to
a user. IR systems vary greatly in the ways they accomplish these tasks.
- Tour Search Engines
- Tour Subject pages and Link pages
Search Engine Marketing
Meta Search Engines
SEM strategy
Using PPC in parallel with SEO can be helpful. The benefits are multifold, especially if the site in question is brand new.
PPC can provide accurate forecasts for targeted keywords. For example, within the Google AdWords platform you can target the same
keywords in your ads that you are currently targeting on specific pages. You can then accurately forecast how your pages will convert for the same keywords once you start getting the equivalent SEO traffic.
You will have the opportunity to perform searches with metasearch engines in the next module. For now, take a quick look at the main page (or "home page") of startpage listed above. Note which, if any, allow you to choose the group of search engines and/or directories that will be searched before you begin the search. Also note other options that you can set to control your search. Clicking on any of these links will open the Web site in a separate browser window, so you can switch between this lesson and the Web site.
- Meta Search: Meta search engines are search engines that aggregate results from multiple search engines and present them to the user.
The best-known meta-search engine is Dogpile.com. However, its search volume is quite small, and these do not factor into SEO strategies.
- More specialized Vertical Search Engines: Vertical search can also come from third parties. Here are some examples:
- Comparison shopping engines, such as PriceGrabber, Shopzilla, and NexTag
- Travel search engines, such as Expedia, Travelocity, Kayak, and Utake
- Real estate search engines, such as Trulia and Zillow
- People search engines, such as Spock and Wink
- Job search engines, such as Indeed, CareerBuilder, and SimplyHired
- Music search engines, such as iTunes Music Store
- B2B search engines, such as Business.com, KnowledgeStorm, Kellysearch, and ThomasNet
The terms search engine and engine to refer to any service that allows you to compose your own search query. Any service that provides a compiled directory or allows you to perform searches is called a search service or information retrieval service. As you now are aware, very often the question you will ask before beginning a search is not "Where do I find a search site?" but, rather, "Which one of all these services do I start with?" This is not a trivial question.
A directory, with its smaller number of hand-selected sites, may be more immediately useful than a search engine if you are searching for beginning-level information on a popular topic. A search engine, by its continual automated Web-roaming, may be more useful if you are looking for very specific information or an obscure topic.
Information Retrieval Models
Modeling in Information Retrieval is a complex process aimed at producing a ranking function A
ranking function is a function that assigns scores to documents with regard to a given query. This process consists of two main tasks:
- The conception of a logical framework for representing documents and queries
- The definition of a ranking function that allows quantifying the similarities among documents and queries
- Modeling and Ranking: Information Retrieval systems usually adopt index terms to index and retrieve documents
- In a restricted sense: it is a keyword that has some meaning on its own; usually plays the role of a noun
- In a more general form: it is any word that appears in a document
- Retrieval based on index terms can be implemented efficiently
- Also, index terms are simple to refer to in a query
- Simplicity is important because it reduces the effort of query formulation
Retrieval Service Categories
[1] Search engine verticals: Examples of different verticals include local search, image search, video search, product search and realtime search for news.