Performs tokenization, stemming, lemmatization, index creation, index compression and ranked retrieval of Cranfield documents
python information-retrieval nltk tf-idf tokenization information-retrieval-engine stemming okapi lemmatization porter-stemmer delta-encoding boolean-model wordnetlemmatizer cranfield-collection text-preprocessing ranked-retrieval relevant-documents document-vector gamma-encoding index-compression
-
Updated
May 10, 2020 - Python