Volume 27, Issue 2 (3-2012)                   ... 2012, 27(2): 798-813 | Back to browse issues page

XML Persian Abstract Print


Download citation:
BibTeX | RIS | EndNote | Medlars | ProCite | Reference Manager | RefWorks
Send citation to:

Nezarat A, Mosavi Miangah T. Designing and Implementing a Cross-Language Information Retrieval System Using Linguistic Corpora. .... 2012; 27 (2) :798-813
URL: http://jipm.irandoc.ac.ir/article-1-1869-en.html
Islamic Azad University, Yazd Branch
Abstract:   (11601 Views)
Information retrieval (IR) is a crucial area of natural language processing (NLP) and can be defined as finding documents whose content is relevant to the query need of a user. Cross-language information retrieval (CLIR) refers to a kind of information retrieval in which the language of the query and that of searched document are different. In fact, it is a retrieval process where the user presents queries in one language to retrieve documents in another language. This paper tried to construct a bilingual lexicon of parallel chunks of English and Persian from two very large monolingual corpora an English-Persian parallel corpus which could be directly applied to cross-language information retrieval tasks. For this purpose, a statistical measure known as Association Score (AS) was used to compute the association value between every two corresponding chunks in the corpus using a couple of complicated algorithms. Once the CLIR system was developed using this bilingual lexicon, an experiment was performed on a set of one hundred English and Persian phrases and collocations to see to what extend this system was effective in assisting the users find the most relevant and suitable equivalents of their queries in either language.
Full-Text [PDF 397 kb]   (1734 Downloads)    
Type of Study: Research | Subject: Library and Information Science
Received: 2012/03/17

Add your comments about this article : Your username or Email:
CAPTCHA code

Send email to the article author


© 2019 All Rights Reserved | Iranian Journal of Information processing and Management

Designed & Developed by : Yektaweb