Evaluation of XML documents queries based on native XML database

As the amount of data available on the Internet grows rapidly, more and more of the data becomes semi structured. The Extensible Markup Language (XML), as a format for semi structured data, has become a standard for the representation and exchange of data over the Internet. Early in the XML history...

Full description

Bibliographic Details
Main Author:	Lazim, Raghad Yaseen
Format:	Undergraduates Project Papers
Language:	English English English English
Published:	2016
Subjects:	QA75 Electronic computers. Computer science
Online Access:	http://umpir.ump.edu.my/id/eprint/18104/ http://umpir.ump.edu.my/id/eprint/18104/ http://umpir.ump.edu.my/id/eprint/18104/1/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Table%20of%20contents.pdf http://umpir.ump.edu.my/id/eprint/18104/7/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Abstract.pdf http://umpir.ump.edu.my/id/eprint/18104/8/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Chapter%201.pdf http://umpir.ump.edu.my/id/eprint/18104/17/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-References.pdf

id	ump-18104
recordtype	eprints
repository_type	Digital Repository
institution_category	Local University
institution	Universiti Malaysia Pahang
building	UMP Institutional Repository
collection	Online Access
language	English English English English
topic	QA75 Electronic computers. Computer science
spellingShingle	QA75 Electronic computers. Computer science Lazim, Raghad Yaseen Evaluation of XML documents queries based on native XML database
description	As the amount of data available on the Internet grows rapidly, more and more of the data becomes semi structured. The Extensible Markup Language (XML), as a format for semi structured data, has become a standard for the representation and exchange of data over the Internet. Early in the XML history there were thoughts about whether XML is different from other data formats that require a database of its own. The popularity and wide-spread use of XML among a diverse set of organizations has engendered a rethinking of the storage and retrieval practices for data. Most early XML storage practices relied on mappings and transformations between XML data trees and relational database tables within a Relational Database. Though relational databases can represent nested data structures by using tables with foreign keys, it is still difficult to search these structures for objects at an unknown depth of nesting; by contrary, it is a potential advantage in XML. Also, the nested and repeating elements in XML documents can quite easily result in an unmanageable number of tables. Furthermore, it is usually very difficult after insertion to change the relational schema due to XML schema changes. The limitations of relational approaches are now well known. Moreover, local update to the document should not cause drastic changes to the whole storage system. Therefore, the design of the storage system should trade-off between the query performance and update costs. This study is to evaluate the Native XML database (NXD) performance in a comparison with XML_Enabled Database (XED), and then to ellhance· Entity Relationship (ER) algorithm of the relational schema for the improvement of Insert, Delete, Update and Search XML document (XML files with a large number of elements) and finally, to validate the algorithm in NXD and compare the performance ofXED and NXD, by implementing the same command and control data model. Five different sizes of datasets have been used (65.8, 101, 117, 127, 183 MB). Benchmark techniques is used to measure the performance. XMark and XMark-1 are two main tools of Benchmarks in the research field, and they have used for the dataset. The performance of a system can be measured by using datasets of. varying sizes, different documents with different features. The size of XML documents and the number of elements have been determined by the factor of the main driver of generation. The result of this study shown that XED has better performance for the datasets <= 117 MB. The performance of XED begins to decline with the increase in the size of XML data(> 127 MB), while NXD shown better performance in for the data(=> 127 MB). NXD produced better results in the reporting section, which implies that the Nf{D X-Query has performance gains from query optimization. Most of the figures show that the XED starts better, but becomes worse as data size grows. The difference becomes obvious as the query becomes more complicated.
format	Undergraduates Project Papers
author	Lazim, Raghad Yaseen
author_facet	Lazim, Raghad Yaseen
author_sort	Lazim, Raghad Yaseen
title	Evaluation of XML documents queries based on native XML database
title_short	Evaluation of XML documents queries based on native XML database
title_full	Evaluation of XML documents queries based on native XML database
title_fullStr	Evaluation of XML documents queries based on native XML database
title_full_unstemmed	Evaluation of XML documents queries based on native XML database
title_sort	evaluation of xml documents queries based on native xml database
publishDate	2016
url	http://umpir.ump.edu.my/id/eprint/18104/ http://umpir.ump.edu.my/id/eprint/18104/ http://umpir.ump.edu.my/id/eprint/18104/1/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Table%20of%20contents.pdf http://umpir.ump.edu.my/id/eprint/18104/7/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Abstract.pdf http://umpir.ump.edu.my/id/eprint/18104/8/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Chapter%201.pdf http://umpir.ump.edu.my/id/eprint/18104/17/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-References.pdf
first_indexed	2023-09-18T22:25:27Z
last_indexed	2023-09-18T22:25:27Z
_version_	1777415935374655488
spelling	ump-181042017-07-07T02:26:07Z http://umpir.ump.edu.my/id/eprint/18104/ Evaluation of XML documents queries based on native XML database Lazim, Raghad Yaseen QA75 Electronic computers. Computer science As the amount of data available on the Internet grows rapidly, more and more of the data becomes semi structured. The Extensible Markup Language (XML), as a format for semi structured data, has become a standard for the representation and exchange of data over the Internet. Early in the XML history there were thoughts about whether XML is different from other data formats that require a database of its own. The popularity and wide-spread use of XML among a diverse set of organizations has engendered a rethinking of the storage and retrieval practices for data. Most early XML storage practices relied on mappings and transformations between XML data trees and relational database tables within a Relational Database. Though relational databases can represent nested data structures by using tables with foreign keys, it is still difficult to search these structures for objects at an unknown depth of nesting; by contrary, it is a potential advantage in XML. Also, the nested and repeating elements in XML documents can quite easily result in an unmanageable number of tables. Furthermore, it is usually very difficult after insertion to change the relational schema due to XML schema changes. The limitations of relational approaches are now well known. Moreover, local update to the document should not cause drastic changes to the whole storage system. Therefore, the design of the storage system should trade-off between the query performance and update costs. This study is to evaluate the Native XML database (NXD) performance in a comparison with XML_Enabled Database (XED), and then to ellhance· Entity Relationship (ER) algorithm of the relational schema for the improvement of Insert, Delete, Update and Search XML document (XML files with a large number of elements) and finally, to validate the algorithm in NXD and compare the performance ofXED and NXD, by implementing the same command and control data model. Five different sizes of datasets have been used (65.8, 101, 117, 127, 183 MB). Benchmark techniques is used to measure the performance. XMark and XMark-1 are two main tools of Benchmarks in the research field, and they have used for the dataset. The performance of a system can be measured by using datasets of. varying sizes, different documents with different features. The size of XML documents and the number of elements have been determined by the factor of the main driver of generation. The result of this study shown that XED has better performance for the datasets <= 117 MB. The performance of XED begins to decline with the increase in the size of XML data(> 127 MB), while NXD shown better performance in for the data(=> 127 MB). NXD produced better results in the reporting section, which implies that the Nf{D X-Query has performance gains from query optimization. Most of the figures show that the XED starts better, but becomes worse as data size grows. The difference becomes obvious as the query becomes more complicated. 2016-11 Undergraduates Project Papers NonPeerReviewed application/pdf en http://umpir.ump.edu.my/id/eprint/18104/1/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Table%20of%20contents.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/18104/7/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Abstract.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/18104/8/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-Chapter%201.pdf application/pdf en http://umpir.ump.edu.my/id/eprint/18104/17/Evaluation%20of%20XML%20documents%20queries%20based%20on%20native%20XML%20database-References.pdf Lazim, Raghad Yaseen (2016) Evaluation of XML documents queries based on native XML database. Faculty of Computer Systems & Software Engineering, Universiti Malaysia Pahang. http://iportal.ump.edu.my/lib/item?id=chamo:100041&theme=UMP2

Evaluation of XML documents queries based on native XML database

Similar Items