140 likes | 156 Views
Explore the use of Web mining techniques to extract data from Web documents with BSDS visualization for effective resource discovery. Learn about Web Content Mining, Structure Mining, and Usage Mining to enhance search efficiency and relevance.
E N D
Facilitating Efficient Web Search Using BSDS for Visualization of ResultsS.Prema
Web mining is the use of data mining techniques to automatically discover and extract information from Web documents/services. • The Web—an immense and dynamic collection of pages that includes countless hyperlinks and huge volumes of access and usage information. Web Mining
Web Content Mining • Web Structure Mining • Web Usage Mining Categories of Web Mining
Web page complexity far exceeds the complexity of any traditional text document collection. • The Web constitutes a highly dynamic information source. • The Web serves a broad spectrum of user communities. • Only a small portion of the Web pages contain truly relevant or useful information. Several challenges in effective resource and knowledge discovery
The only way to collect URLs is to scan collected pages for hyperlinks to other pages that have not been collected yet. • They start from a given set of URLs, progressively fetch and scan them for new URLs (out links), and then fetch these pages in turn. Basic principle of crawlers
In the current scenario, web page result personalization is playing a vital role. • The entire meta search engines are competing with each other • to provide relevant and efficient content in response to his or her queries. Research Domain
The user is often confronted by a great number of results, generally displayed in a list. • It is really a challenging task to satisfy an end-user’s request who is not aware of the search. Existing study
To enhance the learner's learning efficiency. • By providing not only the exact results but also suggesting other possible documents related to the query. • New type of interactions with the results make the exploration more perceptive and efficient. • Search engine query logs. This research work proposes
Semantic Web Search Semantic Web Search through Online Academic Domain Based Search through Offline Scholar Community Non-Scholar Community • Search engine is the starting point for locating new information on the Web. • Semantic web search designed in this thesis focuses on two main themes: • Semantic web search through online • Domain based search through offline Main Contribution
Keyword based search is emphasised in this research work. • The objective is to determine two-category partition of the data. • Information Retrieval is a problem of selecting the relevant information from a document database. Methodology
WebIR is a discipline for developing innovative models of information access. • They crawl the web pages according to the given query and store the pages in a local page repository. • Indexable representation of both documents and link structures. Contd .,
Domain 1 OS Linux Windows UNIX Dos Domain 2 Hardware Main Domain Bookshelf Sets Sub –Domain Sets Memory CPU RAM Chips Domain 3 Database Data Storage SQL Oracle DOMAIN-CENTRIC SEARCH
Providing a practicable architecture for a Semantic Web search engine using BSDS, this research work aims to help open the floodgates of the emerging Semantic Web. • Semantic web search designed in this thesis focused on two main themes: • Semantic web search through online • Domain based search through offline. • Since the domain is designed only for scholar community, information technology related documents are discussed. Conclusion