13MCA553 Information Retrieval & Search Engines syllabus for MCA


Unit-1 INTRODUCTION 4 hours

Information Retrieval, Search Engines, Search Engineers.

Unit-2 ARCHITECTURE OF A SEARCH ENGINE 5 hours

Architecture, Basic Building Blocks, Text Acquisition, Text Transformation IndexCreation, User Interaction, Ranking and Evaluation

Unit-3 CRAWLS AND FEEDS 6 hours

Deciding what to search, Crawling the Web, Directory Crawling, Document Feeds,Conversion Problem, Storing the Documents, Detecting Duplicates, removes noise.

Unit-4 PROCESSING TEXT 8 hours

Text Statistics, Document Parsing, Document Structure and Markup, Link Analysis,Information Extraction, Internationalization

Unit-5 RANKING WITH INDEXES 6 hours

Abstract Model of Ranking, Inverted indexes, Compression, Entropy and Ambiguity, DeltaEncoding, Bit-aligned codes, Auxiliary Structures, Index Construction, Query Processing.

Unit-6 QUERIES AND INTERFACES 5 hours

Information Needs and Queries ,Query Transformation and Refinement , Showing theResults Cross-Language Search.

Unit-7 RETRIEVAL MODELS 12 hours

Overview of Retrieval Models , Boolean Retrieval , The Vector Space Model, ProbabilisticModels, Information Retrieval as Classification, BM25 Ranking Algorithm, ComplexQueries and Combining Evidence, Web Search, Machine Learning and InformationRetrieval ,.

Unit-8 EVALUATING SEARCH ENGINES 6 hours

The Evaluation Corpus , Logging , Effectiveness Metrics, Recall and Precision Averaging andInterpolation , Efficiency Metrics, Training, Testing, and Statistics

Last Updated: Tuesday, January 24, 2023