1 / 13

Web IR / NLP Group (WING) Architecture

Web IR / NLP Group (WING) Architecture. Min-Yen Kan School of Computing National University of Singapore. Projects. Funded CSIDM (CAS, China): Aobo, CSIDM Interns ForeCite (Expires Oct 2010): Kaz, Emma, Thang Proposed Data Cleaning in the Cloud (UCI)

kelii
Télécharger la présentation

Web IR / NLP Group (WING) Architecture

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Web IR / NLP Group(WING) Architecture Min-Yen Kan School of Computing National University of Singapore

  2. Projects Funded • CSIDM (CAS, China): Aobo, CSIDM Interns • ForeCite (Expires Oct 2010): Kaz, Emma, Thang Proposed • Data Cleaning in the Cloud (UCI) • Text Mining Clinical Articles (Duke-NUS / UCI) • Shreyasee, Justin, • Text Mining Scientific Articles (Global Asia Institute) • ForeCite2 WING, NUS

  3. DL IR/MM/HCI NLP Research Topics • Yee Fan Tan - Record Matching in Digital Libraries • Jin Zhao - Math Equation IR • Jesse Gozali – Phototaking Behavior • Ziheng Lin – Rhetorical Discourse Analysis • Cong Duy Vu Hoang – Related Work Summarization • Jun Ping Ng – Logic in Question Answering • Aobo Wang – Crowdsourcing for Machine Translation • Shihong Huang, Wai Hong Loh – Tooltip translator for Firefox • Kazunari Sugiyama - Recommender Systems in Digital Libraries • Minh Thang Luong – ForeCite • Emma Thuy Dung Nguyen – ForeCite Incoming Staff (4 UROP, 1 Intern): • Shomir Wilson (Intern) – Mention Detection in Scientific Articles, w/ Jin • Shawn Tan (UROP) – Continuing PARCELS, w/ Jesse • Tamisa Huangsiri, Low Wee Hung – (UROP) CSIDM Firefox w/ Aobo, Jun Ping • Yipeng Huang (UROP) – Cloud Data Cleaning, w/ Yee Fan, Jin WING, NUS

  4. Responsibilities (to be discussed) • Kaz: Non-CSIDM UROP guidance • Yee Fan: None (Thesis Writing!!) • Jin: RPNLPIR / Meeting and Room Bookings • Ziheng: Publication Page / Joomla / Social • Jesse: RoR / FC / CSX • Aobo: RoR / Web System Admin • Jun Ping: System Admin Lead WING, NUS

  5. Fixed IP CTE – RAID drive host, LDAP host, source code repository AYE – webserver, mailserver, mailman, virtual host on ECP DHCP (.ddns.) ECP – LDAP backup PIE – compute server Windows Server (.ddns) KPE KJE BKE SLE Cluster Architecture • Systems named after Singapore’s highways WING, NUS

  6. OS support All *nix group machines run CentOS 5 • stable Linux Enterprise distribution • all mount cte’s raid drive, plus other automounts Future • use rsync to sync all binaries across machines • expand RAID to encompass disks over different machines for more space (more SAN like) WING, NUS

  7. RAID setup • Currently 5.0 TB in RAID 5? • ext3 mounted to cte • /mnt/homes – home directories • /mnt/rpnlpir-indep – machine indep data (datasets) • /mnt/rpnlpir-Linux – binaries • /mnt/rpnlpir-Windows – binaries Future • DB server coming online for Rails applications WING, NUS

  8. Webserver (aye.comp.nus.edu.sg) • Apache • Virtual hosts (wing.comp, linc.comp, opac.comp) • Hosts Tomcat for java servlets • Hosts gmond (Gangila monitor) • Runs webalizer for stats • Hosts Ruby on Rails apps (Trung’s myror script; to be deprecated soon) • Hosts web service server (router for web service calls) WING, NUS

  9. Web Services • Our infrastructure tuned to make many services and demos by web services. • External calls to port 4000 • List of Webservices on http://wing.comp.nus.edu.sg/~forecite/ • Calls handled by WebServiceServer (WSS) ruby code. • Directory for webservices currently at /home/forecite/services/ WING, NUS

  10. Joomla • For our website • Administration by admin@wing, PhD students Customizations • Forum integration (phpbb) • Forum has contact information for all staff • Forum userdb not yet synched with shadow pass in LDAP • RPNLPIR (resource list) • Blog WING, NUS

  11. Mailing List • mailman run on aye • lists also run on wing (alias for aye) • both local and international mailing list hosted here WING, NUS

  12. LDAP • To keep logins/uids/guids synched • Main server on cte • Backup on aye • Needs to be robust in case of failure of LDAP server • Local root for all machines must be maintained WING, NUS

  13. RPNLPIR (Research Project for NLP / IR) • Common team account • Keep software repository mirrored by web page listing • Keeps CVS repo in ~/CVSDir • Keeps git repo in ~/repo • Accessible to all group members WING, NUS

More Related