1 / 7

Enhancing Transliterated Search: Insights and Feedback on Multilingual Scenarios

Join us for a closing session on the results of yesterday's experiments in transliterated search scenarios involving Hindi and English. We will discuss the challenges and findings from our tasks on query word labeling and cross-script ad hoc retrieval. Participants can share their feedback and insights on mixed-language queries and document types. Explore how different scripts, like Devanagari and Roman, impact search results in a multilingual context. Collaborate on enhancing our approach to handle more languages. Your contributions are valuable!

quon-bell
Télécharger la présentation

Enhancing Transliterated Search: Insights and Feedback on Multilingual Scenarios

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Closing Session • FIRE shared task • Results of yesterday’s experiments • Open discussion and Your Feedback

  2. FIRE Shared Task on Transliterated Search

  3. Scenarios • Two languages L1 and L2 (say Hindi and English) • Two scripts S1 and S2 (say Devanagari and Roman) • Texts can be of four types: • L1 in S1: यह हिन्दी में लिखा है • L2 in S2: This is in English • L1 in S2: yah hindimeinlikhahai • L2 in S1: दिस इज़ इन इंग्लिश • Documents can be a mixture of these four types. • Queries will contain text in one script, but can be a mixture of languages

  4. Task 1: Query Word Labeling jab thakhaijan movie review Transliterated Hindi English

  5. Task II: Cross-script Ad hoc Retrieval • Query: Mixed language, one script • Document: In multiple script • Specific Scenario: Bollywood Song lyrics search • Document corpus will provided early.

  6. Collaborative Data creation • Query labeling • Relevance Judgement • More languages • Kannada??

  7. Thank You!

More Related