COMP249: Web Search

Steve Cassidy and Rolf Schwitter

The Web as Data Store

How does a Search Engine work?

Have you ever wondered ...

Search Engines

A search engine basically consists of:

Collecting and Storing the Data

Web Crawlers

Term Indexing

What Information to Keep?

Building the Index

Relevance - Internal Factors

Factors based on a Web page's content:

Relevance - External Factors

Factors external to the page:

How Google PageRank Works

Boolean Search

Boolean Search

Boolean Search - Example

Example (cont'd)

Implicit Boolean Queries

Making use of Meta-Data

What are Meta Tags?

Description Tag

Keywords Tag

Robots Tag

Alternatives to Search Engines

Directories

Hybrid Search Engines

Metasearch

Limitations of Search Engines

The "Invisible" Web

Recall

Precision

Example

To find web pages about sales of beef in Chile, try the following queries:

Future Developments

Concept Based Search

Natural Language-based Search

Web-based Question Answering