Lexical scoring system of lexical chain for Quranic document retrieval

An Information Retrieval (IR) system aims to extract information based on a query made by a user on a particular subject from an extensive collection of text. IR is a process through which information is retrieved by submitting a query by a user in the form of keywords or to match words. In the A...

Full description

Bibliographic Details
Main Authors: Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad
Format: Article
Language:English
Published: Penerbit Universiti Kebangsaan Malaysia 2018
Online Access:http://journalarticle.ukm.my/13770/
http://journalarticle.ukm.my/13770/
http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf
id ukm-13770
recordtype eprints
spelling ukm-137702019-12-05T12:46:30Z http://journalarticle.ukm.my/13770/ Lexical scoring system of lexical chain for Quranic document retrieval Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad, An Information Retrieval (IR) system aims to extract information based on a query made by a user on a particular subject from an extensive collection of text. IR is a process through which information is retrieved by submitting a query by a user in the form of keywords or to match words. In the Al-Quran, verses of the same or comparable topics are scattered throughout the text in different chapters, and it is therefore difficult for users to remember the many keywords of the verses. Therefore, in such situations, retrieving information using semantically related words is useful. In well-composed documents, the semantic integrity of the text (coherence) exists between the words. Lexical cohesion is the results of chains of related words that contribute to the continuity of the lexical meaning found within the text are a direct result of text being about the same thing (i.e. topic, etc.). This indicates that using an IR system and lexical chains are a useful and appropriate method for representing documents with concepts rather than using terms in order to have successful retrieval based on semantic relations. Therefore, a new Lexical Scoring System is proposed in this study, in addition to determining the semantic relation that exists between words whereby WordNet was used as the semantic knowledge base. The proposed scoring system helped to retrieve 86.58% of the total relevant documents in the Al-Quran based on the relevance judgment, using the lexical chain approach. Based on the findings, the study concludes that, the proposed approach on representing verses using lexical chains is appropriate and suitable for a Quranic IR system. Penerbit Universiti Kebangsaan Malaysia 2018-05 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf Hamed Zakeri Rad, and Sabrina Tiun, and Saidah Saad, (2018) Lexical scoring system of lexical chain for Quranic document retrieval. GEMA: Online Journal of Language Studies, 18 (2). pp. 59-79. ISSN 1675-8021 http://ejournal.ukm.my/gema/issue/view/1087
repository_type Digital Repository
institution_category Local University
institution Universiti Kebangasaan Malaysia
building UKM Institutional Repository
collection Online Access
language English
description An Information Retrieval (IR) system aims to extract information based on a query made by a user on a particular subject from an extensive collection of text. IR is a process through which information is retrieved by submitting a query by a user in the form of keywords or to match words. In the Al-Quran, verses of the same or comparable topics are scattered throughout the text in different chapters, and it is therefore difficult for users to remember the many keywords of the verses. Therefore, in such situations, retrieving information using semantically related words is useful. In well-composed documents, the semantic integrity of the text (coherence) exists between the words. Lexical cohesion is the results of chains of related words that contribute to the continuity of the lexical meaning found within the text are a direct result of text being about the same thing (i.e. topic, etc.). This indicates that using an IR system and lexical chains are a useful and appropriate method for representing documents with concepts rather than using terms in order to have successful retrieval based on semantic relations. Therefore, a new Lexical Scoring System is proposed in this study, in addition to determining the semantic relation that exists between words whereby WordNet was used as the semantic knowledge base. The proposed scoring system helped to retrieve 86.58% of the total relevant documents in the Al-Quran based on the relevance judgment, using the lexical chain approach. Based on the findings, the study concludes that, the proposed approach on representing verses using lexical chains is appropriate and suitable for a Quranic IR system.
format Article
author Hamed Zakeri Rad,
Sabrina Tiun,
Saidah Saad,
spellingShingle Hamed Zakeri Rad,
Sabrina Tiun,
Saidah Saad,
Lexical scoring system of lexical chain for Quranic document retrieval
author_facet Hamed Zakeri Rad,
Sabrina Tiun,
Saidah Saad,
author_sort Hamed Zakeri Rad,
title Lexical scoring system of lexical chain for Quranic document retrieval
title_short Lexical scoring system of lexical chain for Quranic document retrieval
title_full Lexical scoring system of lexical chain for Quranic document retrieval
title_fullStr Lexical scoring system of lexical chain for Quranic document retrieval
title_full_unstemmed Lexical scoring system of lexical chain for Quranic document retrieval
title_sort lexical scoring system of lexical chain for quranic document retrieval
publisher Penerbit Universiti Kebangsaan Malaysia
publishDate 2018
url http://journalarticle.ukm.my/13770/
http://journalarticle.ukm.my/13770/
http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf
first_indexed 2023-09-18T20:05:36Z
last_indexed 2023-09-18T20:05:36Z
_version_ 1777407136933871616