Lexical scoring system of lexical chain for Quranic document retrieval
An Information Retrieval (IR) system aims to extract information based on a query made by a user on a particular subject from an extensive collection of text. IR is a process through which information is retrieved by submitting a query by a user in the form of keywords or to match words. In the A...
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
Penerbit Universiti Kebangsaan Malaysia
2018
|
Online Access: | http://journalarticle.ukm.my/13770/ http://journalarticle.ukm.my/13770/ http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf |
id |
ukm-13770 |
---|---|
recordtype |
eprints |
spelling |
ukm-137702019-12-05T12:46:30Z http://journalarticle.ukm.my/13770/ Lexical scoring system of lexical chain for Quranic document retrieval Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad, An Information Retrieval (IR) system aims to extract information based on a query made by a user on a particular subject from an extensive collection of text. IR is a process through which information is retrieved by submitting a query by a user in the form of keywords or to match words. In the Al-Quran, verses of the same or comparable topics are scattered throughout the text in different chapters, and it is therefore difficult for users to remember the many keywords of the verses. Therefore, in such situations, retrieving information using semantically related words is useful. In well-composed documents, the semantic integrity of the text (coherence) exists between the words. Lexical cohesion is the results of chains of related words that contribute to the continuity of the lexical meaning found within the text are a direct result of text being about the same thing (i.e. topic, etc.). This indicates that using an IR system and lexical chains are a useful and appropriate method for representing documents with concepts rather than using terms in order to have successful retrieval based on semantic relations. Therefore, a new Lexical Scoring System is proposed in this study, in addition to determining the semantic relation that exists between words whereby WordNet was used as the semantic knowledge base. The proposed scoring system helped to retrieve 86.58% of the total relevant documents in the Al-Quran based on the relevance judgment, using the lexical chain approach. Based on the findings, the study concludes that, the proposed approach on representing verses using lexical chains is appropriate and suitable for a Quranic IR system. Penerbit Universiti Kebangsaan Malaysia 2018-05 Article PeerReviewed application/pdf en http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf Hamed Zakeri Rad, and Sabrina Tiun, and Saidah Saad, (2018) Lexical scoring system of lexical chain for Quranic document retrieval. GEMA: Online Journal of Language Studies, 18 (2). pp. 59-79. ISSN 1675-8021 http://ejournal.ukm.my/gema/issue/view/1087 |
repository_type |
Digital Repository |
institution_category |
Local University |
institution |
Universiti Kebangasaan Malaysia |
building |
UKM Institutional Repository |
collection |
Online Access |
language |
English |
description |
An Information Retrieval (IR) system aims to extract information based on a query made by a
user on a particular subject from an extensive collection of text. IR is a process through
which information is retrieved by submitting a query by a user in the form of keywords or to
match words. In the Al-Quran, verses of the same or comparable topics are scattered
throughout the text in different chapters, and it is therefore difficult for users to remember the
many keywords of the verses. Therefore, in such situations, retrieving information using
semantically related words is useful. In well-composed documents, the semantic integrity of
the text (coherence) exists between the words. Lexical cohesion is the results of chains of
related words that contribute to the continuity of the lexical meaning found within the text are
a direct result of text being about the same thing (i.e. topic, etc.). This indicates that using an
IR system and lexical chains are a useful and appropriate method for representing documents
with concepts rather than using terms in order to have successful retrieval based on semantic
relations. Therefore, a new Lexical Scoring System is proposed in this study, in addition to
determining the semantic relation that exists between words whereby WordNet was used as
the semantic knowledge base. The proposed scoring system helped to retrieve 86.58% of the
total relevant documents in the Al-Quran based on the relevance judgment, using the lexical
chain approach. Based on the findings, the study concludes that, the proposed approach on
representing verses using lexical chains is appropriate and suitable for a Quranic IR system. |
format |
Article |
author |
Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad, |
spellingShingle |
Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad, Lexical scoring system of lexical chain for Quranic document retrieval |
author_facet |
Hamed Zakeri Rad, Sabrina Tiun, Saidah Saad, |
author_sort |
Hamed Zakeri Rad, |
title |
Lexical scoring system of lexical chain for Quranic document retrieval |
title_short |
Lexical scoring system of lexical chain for Quranic document retrieval |
title_full |
Lexical scoring system of lexical chain for Quranic document retrieval |
title_fullStr |
Lexical scoring system of lexical chain for Quranic document retrieval |
title_full_unstemmed |
Lexical scoring system of lexical chain for Quranic document retrieval |
title_sort |
lexical scoring system of lexical chain for quranic document retrieval |
publisher |
Penerbit Universiti Kebangsaan Malaysia |
publishDate |
2018 |
url |
http://journalarticle.ukm.my/13770/ http://journalarticle.ukm.my/13770/ http://journalarticle.ukm.my/13770/1/25370-76265-1-PB.pdf |
first_indexed |
2023-09-18T20:05:36Z |
last_indexed |
2023-09-18T20:05:36Z |
_version_ |
1777407136933871616 |