The repository is for the course Information Retrieval and Web Search Engines CSCI-572 that I took at the University of Southern California. The course covered topics like Crawling, Building Inverted Index using Hadoop Cluster, Google Query Formulation, Page Rank Algorithm, Map/Reduce, Query Processing, Inverting the Web using Solr, Rich Text Snippets, Spell Correction. The course covered five assignments that are created as separate folders in repository.
If you are a student then please don't use this code as I have modified the code and you may be penalised for that.
If you are a recruiter then please reach me to [email protected] for the working code. I am a Course Producer for this course and not allowed to disclose my assignments.