370 likes | 392 Views
MindGate is a complete software system that controls web access, filters content, and categorizes requested web pages. It is designed for schools, business organizations, and governmental organizations.
E N D
Kerim KORKMAZ A. Tolga KILINÇ H. Özgür BATUR Berkan KURTOĞLU
Outline • What is MindGate? • Motivation for MindGate Project • Features of System • Parts of The System • User scenerios • Review
What is MindGate? MindGate: • Is an expertsystem • Is a content filteringgateway • Is a complete software. • Categorizes requested web pages
What is MindGate? MindGate: • Controls web access of an organization • Blocking harmful content • Caching for fast access • Keeping user actions’ statistics
Motivation for The MindGate Project Demand for content filtering: • Schools • Business Organizations • Governmental Organizations
MindGateISNOT • A security tool • About server side filtering • A script based admin tool • A bandwith limiter • A P2P blocker
Features of MindGate • Inspecting all requests and responses on the web traffic • Filtering connections according to predefined policies. • User groups, web site categories and filter thresholds • Smart filtering techniques for preventing users entering sites according to policies. • Tracking of the useractions and generating satistical data
Key Features of MindGate Expert systemfor categorization • Improves accuracy in time, with feedback mechanisms and statistic accumulated during a time period. • Using simple but effective heuristics to categorize the web pages. • Highly modularized categorization unit.
Key Features of MindGate Expandingwebsite checklist • Build by categorizer in time • Corrected by user feedbacks • Faster, because all lists and tables are not stored on an SQL server but stored as XML documents with a completely indexed manner on the RAM.
Key Features of MindGate Special Indexing Techniques For Fast Access • Using Apache Lucene API • Ram Index for Checklist • Disk Indexing for Log Archive • Complex queries can easily be applied to all these indexed files for searching or listing purposes.
Key Features of MindGate Threaded Structure • Multi user environment • Concurrency • Synchronisation
Parts Of The System • Core System • Embedded proxy and web server: Connection Control & Admin Interface • Archive File System • SQL Server: User Account Data
Modules of The System • Connection Control:requests and responses • Database Control:static index checking, user authentication, and log archiving. • Categorizer:dynamiccategorization • Interface Control:user interaction,page generation
Connection Control • Handles requests and responses • Interface to proxy • Handles only HTTP Protocol by
Database Control Holds Data: (All indexed) • Checklist (URL - category...) • SessionTable (user - ip number...) • GroupInfo Table (user - group...)
Database Control Interface to: • SQL Database • Archive File System(Logs)
Categorizer Invoked when a page not found in checklist • Evaluation by a heuristic equation • Structural Criteria (Colors,Scripts,Links) • Content Criteria (Meta Data)
Interface Control • User interaction • Administrator interaction • Page generation • Interface to embedded web server
What is next? • More sophisticated categorization, new heuristics • Categorized Data Pool System, to share categorized data • Extending MindGate to handle other protocols (ie. ftp,pop3,…) Please! Stay Tuned :)