Using regular expressions for mining data in large software repositories

The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this...

Full description

Bibliographic Details
Main Author: Awang Abu Bakar, Normi Sham
Format: Conference or Workshop Item
Language:English
English
Published: IEEE 2014
Subjects:
Online Access:http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
id iium-42896
recordtype eprints
spelling iium-428962017-09-20T01:07:10Z http://irep.iium.edu.my/42896/ Using regular expressions for mining data in large software repositories Awang Abu Bakar, Normi Sham T Technology (General) The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly. IEEE 2014 Conference or Workshop Item PeerReviewed application/pdf en http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf application/pdf en http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf Awang Abu Bakar, Normi Sham (2014) Using regular expressions for mining data in large software repositories. In: 2014 The 5th International Conference on Information and Communication Technology for The Muslim World (ICT4M), 17th-18th November 2014, Kuching, Sarawak, Malaysia. http://ieeexplore.ieee.org/document/7020649/ 10.1109/ICT4M.2014.7020649
repository_type Digital Repository
institution_category Local University
institution International Islamic University Malaysia
building IIUM Repository
collection Online Access
language English
English
topic T Technology (General)
spellingShingle T Technology (General)
Awang Abu Bakar, Normi Sham
Using regular expressions for mining data in large software repositories
description The usage of data mining technique in collecting data from software repositories involves the extraction of both basic and value-added information from existing software repositories. Regular Expressions (Regex) provide a mechanism to select specific strings from a set of character strings. In this paper, we discuss how regular expressions are used to create a data mining tool, known as OSSGrab. We developed the mining tool using Python scripting, in combination with Regex, and as a result, the time spent on data collection can be saved significantly.
format Conference or Workshop Item
author Awang Abu Bakar, Normi Sham
author_facet Awang Abu Bakar, Normi Sham
author_sort Awang Abu Bakar, Normi Sham
title Using regular expressions for mining data in large software repositories
title_short Using regular expressions for mining data in large software repositories
title_full Using regular expressions for mining data in large software repositories
title_fullStr Using regular expressions for mining data in large software repositories
title_full_unstemmed Using regular expressions for mining data in large software repositories
title_sort using regular expressions for mining data in large software repositories
publisher IEEE
publishDate 2014
url http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/
http://irep.iium.edu.my/42896/6/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
http://irep.iium.edu.my/42896/7/42896-Using%20Regular%20Expressions%20for%20Mining%20Data%20in%20Large.pdf
first_indexed 2023-09-18T21:01:07Z
last_indexed 2023-09-18T21:01:07Z
_version_ 1777410629989040128