Skip to content
This repository has been archived by the owner on May 20, 2021. It is now read-only.

A university-related project, Build an Inverted Index search according to the documentation specifications

License

Notifications You must be signed in to change notification settings

rahnemoon/Information-Retrieval-university-project

Repository files navigation

Information-Retrieval-university-project

A university-related project, Build an Inverted Index search according to the documentation specifications

Implemented sections

  • normal search
  • wildcard search
  • spell checking

Project description

This project uses some of easiest NLP techniques for preprocessing level. First of all, read all data-set files and parsing them to the JSON file according to a year that comment left. Next use some NLP technique in preprocessing JSON files. Finally, make an inverted index from preprocessed files. An inverted index read and used for normal search, wildcard search, and spell checking.

NOTICE:This project use some of easiest NLP techniques for preprocessing level like: lemmatizing, stemming, word position detection from NLTK

How to use

First make Virtualenv with python 2.7, next run following command

pip install -r requirment.txt
 python search.py -h

Data-set description

OpinRank Dataset

LICENSE

MIT

About

A university-related project, Build an Inverted Index search according to the documentation specifications

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages