[Home ] [Archive]   [ فارسی ]  
:: ::
Back to the articles list Back to browse issues page
An automatic Persian text summarization system based on linguistic features and regression
Mahmood Soltani , Jalal Nasiri , Ehsan Asgarian
Assistant Professor Iranian Research Institute for Information Science and Technology (IRANDOC)
Abstract:   (77 Views)
Considering the vast amount of existing written information and the shortage of time, optimal summarization of books, articles, news reports, etc. on the Web is a major concern of researchers. In this paper, we propose a new approach for Persian single-document Summarization based on several linguistic features of text. In our approach after extracting the linguistic features for each sentence, the weight of features is learned by a linear regression method. We select one sentence with maximum score at each step of algorithm. The score of each sentence is calculated based on two factors: first, sum of the weighted features and second, the amount of its similarity to the sentences that are selected for final summary previously. We use an automatic evaluation tool to compare our approach with other existing approaches. The result indicates that our method improves the performance of summarization.
Keywords: Single- Document summarization, Linguistic Feature, linear regression
Full-Text [PDF 789 kb]   (17 Downloads)    
Type of Study: Research | Subject: Information Technology
Received: 2017/02/8 | Accepted: 2017/09/19
Send email to the article author

Add your comments about this article
Your username or Email:


XML   Persian Abstract   Print

Back to the articles list Back to browse issues page
پژوهشنامه پردازش و مدیریت اطلاعات Journal of Information processing and Management
Persian site map - English site map - Created in 0.26 seconds with 30 queries by YEKTAWEB 3735