Knowledge based Question Answering System

Knowledge based Question Answering System Anurag Gautam Harshit Maheshwari

Introduction In order for the computers to interact with the users more naturally the computer must understand and automatically infer what the user wants to say from what he actually says . Our project is an attempt to achieve the same. We call the program : Sarvagya.

Our project is a question answering system based on statistical learning approach. We have used Probabilistic Latent Semantic Analysis (PLSA) algorithm to guess the most probable answer from the given set of documents

Usefulness As a user, you will probably not want to read the whole document to search for small answers. Time is a big factor in today’s life. You will no doubt want to save it. Document and knowledge management increase with QAS solution.

Our Approach Extraction of important concepts from the query. Query Processing …. Based on the concepts we search for the documents from wikipedia.

Formation of Term – Document Matrix (TD matrix) Tokenize the documents into sentences Pass the T-D matrix to PLSA Calculating cosine similarity among the document vectors returned by PLSA

PLSA – what it does ? T-D Matrix PLSA Document Clusters according to concepts

Some Results… We tested Sarvagya on various questions and got really close answers, some of them are shown:

Scope of Improvements • Technique of extraction of concept from query can be improved. • Data can be taken from some other sources also. • We can use inbuilt knowledge database which will speed up the answer retreival.

Thank you

Knowledge based Question Answering System