Generalized Vector Space Model using Karl Pearson correlation coefficients
-
Updated
Dec 5, 2016 - Python
Generalized Vector Space Model using Karl Pearson correlation coefficients
Designed a scalable and efficient search engine in Python to query a Wikipedia corpus of ~75GB with a response time of 1s and outputs the top 10 relevant documents based on the search query.
Web Retrieval and Mining 2020Spring
Information Retrieval, Natural Language Processing, Machine Learning
IR and text mining project to calculate candidate's job profile score based on factors such as Education, Discipline, Required Skills, Desired Skills, and Years of Experience. Implemented Inverted Index algorithm for job filtering and Vector Space Model algorithm for ranking the documents (Jobs).
Vector space model for information retrieval
Hello folks! Looking for a fully modular, open source, Pygame 3d-Engine concept? Well this may be a good start. GridMod is capable of visualising self made three dimensional shapes, by groupping nodes, vectors and matrices, and by applying common matrix operations to those vertices we get to display those predefined 3d objects, along with a scal…
Knowledge processing technologies : Information Retrieval and text classification
Information Retrieval System
Using Apache Lucene to index documents in AP89 corpus, perform retrieval on TREC topics and evaluate the performance of retrieval algorithms using different evaluation metrics
Domain specific information retrieval system based on boolean retrieval and vector space models
Search engine based on the vector space model
Documents and queries are represented as vectors. Each dimension corresponds to a separate term. If a term occurs in the document, its value in the vector is non-zero. Several different ways of computing these values, also known as (term) weights, have been developed. One of the best known schemes is tf-idf weighting (see the example below). The…
Vector Space Model Experiments
Vector-Space Model (VSM) for Information Retrieval (IR) implemented for Assignment 1 in COL764 | Used d-gap encoding to store the index files efficiently (top 5% of the class)
Source code for my team's project at Information Retrieval Subject. The project is a Summarizer Text Application that using Vector Space Model Algorithm.
Implementation of a vector space-based information retrieval system.
Add a description, image, and links to the vector-space-model topic page so that developers can more easily learn about it.
To associate your repository with the vector-space-model topic, visit your repo's landing page and select "manage topics."