130 likes | 143 Views
Our project, Sarvagya, is a question answering system that uses the Probabilistic Latent Semantic Analysis (PLSA) algorithm to infer the most probable answer from a given set of documents. The system aims to save users' time by quickly providing relevant answers to their queries. Our approach involves extracting important concepts from the query, processing the query, and searching documents from Wikipedia based on these concepts. We have tested Sarvagya on various questions and obtained accurate answers. Possible improvements include refining the concept extraction technique, incorporating data from additional sources, and utilizing an inbuilt knowledge database to enhance answer retrieval speed.
E N D
Knowledge based Question Answering System Anurag Gautam Harshit Maheshwari
Introduction In order for the computers to interact with the users more naturally the computer must understand and automatically infer what the user wants to say from what he actually says . Our project is an attempt to achieve the same. We call the program : Sarvagya.
Our project is a question answering system based on statistical learning approach. We have used Probabilistic Latent Semantic Analysis (PLSA) algorithm to guess the most probable answer from the given set of documents
Usefulness As a user, you will probably not want to read the whole document to search for small answers. Time is a big factor in today’s life. You will no doubt want to save it. Document and knowledge management increase with QAS solution.
Our Approach Extraction of important concepts from the query. Query Processing …. Based on the concepts we search for the documents from wikipedia.
Formation of Term – Document Matrix (TD matrix) Tokenize the documents into sentences Pass the T-D matrix to PLSA Calculating cosine similarity among the document vectors returned by PLSA
PLSA – what it does ? T-D Matrix PLSA Document Clusters according to concepts
Some Results… We tested Sarvagya on various questions and got really close answers, some of them are shown:
Scope of Improvements • Technique of extraction of concept from query can be improved. • Data can be taken from some other sources also. • We can use inbuilt knowledge database which will speed up the answer retreival.