190 likes | 373 Vues
Secure Search Engine. Ivan Zhou Xinyi Dong. Project Overview. The Secure Search Engine project is a search engine that utilizes special modules to test the validity of a web page.
E N D
Secure Search Engine Ivan Zhou Xinyi Dong
Project Overview • The Secure Search Engine project is a search engine that utilizes special modules to test the validity of a web page. • These tests consist of verifying the web page's certificates and determining if the page in question is a phishing site. • Our goal is to setup a working Secure Search Engine (SSE) environment in the cloud architecture.
Detailed architecture • Components of SSE: • Browser plug-in • Web crawler • Certificate verification module • Phishing status verification module • Database abstraction module • Android plug-in (in progress)
Detailed architecture Android Plug-in Browser Plug-in Internet / Other Websites Certificate Verification Main Phishing Verification Web Crawler Data Abstraction Database
Project Description • Migrate the existing SSE project from a local environment to Mobicloud. • Test and modify if necessary SSE in this new environment. • Develop a background process to frequently update the bank database for the crawler. • Create an Android component to integrate SSE into a mobile device (tentative).
Task Allocations Ivan: • Test if the migration is setup successfully.(100% done) • Test the crawler and module to verify certificates.(50% done) • Develop & test background bank database updater.(100% done) • Research the Android SDK and develop an Android SSE component.(0% done)
Task Allocations Xinyi: • Setup MySQL server in the SSE server (migration).(100% done) • Test the crawler and module to verify certificates.(50% done) • Develop & test background bank database updater.(100% done) • Research the Android SDK and develop an Android SSE component.(0% done)
Technical Detail for task 1 • Task 1: Migrate the existing SSE project from a local environment to Mobicloud • All software installation: Apache Tomcat, MySQL, Netbeans, SVN, Java JDK, Jython. • Configuration: VM’s Internet connection, VNC configuration, PATH for Java/Tomcat/SVN, connection for MySQL server • Publish website to Apache Tomcat
Technical Detail for task 2 • Two parts need to be tested carefully • Phishing Filter • Crawler • Phishing Filter • Checks with the database if it is a phishing site or not • See if a third party site(phishtank) has said it is a phishing site • Compute the confidence ourselves.
Technical Detail for task 2 • Crawler.py: A Python implementation of java code to crawl webpage’s information • Seeds in Database • Crawl domain • Crawl domain path • Crawl child links • Difficulties encountered: • Webpages’ particularity (Localhost) (solved) • Only connect with port 443. Port 80? (solved) • Unreasonable logic in crawler.py(depth..) (exploring) • Other problems (exploring)
Technical Detail for task 3 • Develop a background process to frequently update the bank database for the crawler. • crontab -e • Syntax: min|hour|day|month|weekday|command • 00*** /sse/crawler.py
Technical Detail for task 4 • Create an Android component to integrate SSE into a mobile device (tentative). • All applications are written using the Java programming language. • Android SDK. • Eclipse: ADT Plugin. • Current firmware v2.1 update 1 on Droid. • Newest firmware available v2.2.1
Problems Encountered & Solution • We both cannot work on the VM at the same time. • Copy the architecture into one of our own Windows systems. We use WAMP, and NetBeans with Python and SVN plug-ins. • We encountered the mobicloud system to be down frequently. • Copy the same previous Windows architecture into an additional machine.
Conclusion • Task 2 is still on-going, but we moved forward to task 3 and accomplished it. • crawler.py may need to be re-arranged because of the logic errors in original code. • Android implementation task would also be time-consuming, so we need to move to this task soon.
Thank you! • Comments & Questions.