70 likes | 180 Vues
An application to generate year-based timelines for celebrities using freely available Wikipedia datasets. It utilizes SAX and regex parsers to extract timelines from articles and populate a database for display. The software includes JAVA SDK, JSP, MySQL, JDBC Connector, hosting service, Eclipse, and Oxygen XML Editor.
E N D
Celebrity Timeliner Karan Seth Rahul Dureja Salim Ali Khan
Objective • Generate Year based timelines for celebrities. • Freely available Wikipedia datasets are being used. • Sax parser generates data for each article from which a regex parser extracts timelines and populates to db. • Data is processed to build a category ontology and display a timeline.
List of Software • JAVA SDK (building the SAX Parser) • JSP (displaying the front-end UI) • MySQL (database) • MySQL J Connector (to establish a JDBC link) • EatJ Hosting (The hosting service running the application) • IDE • Eclipse • Oxygen XML Editor
Work Distribution • Karan Seth • Environment Setup • SAX Parser to parse XML files provided by Wikipedia • Regular Expression • Rahul Dureja • Database Setup and Creation • Linking SAX Parser to the Database • Handling Search module in UI • Salim Ali Khan • User Interface Setup • Regular Expressions • Handling Data display and complete integration • Common Tasks • Environment Cleanup • Modifying the Parser to include specific cases • Brainstorming on deciding the architecture of the application