Single-linkage method to detect multiple outliers with different outlier scenarios in circular regression model

Single-linkage is one of the algorithms in agglomerative clustering technique that can be used to detect outliers. The single-linkage algorithm combines two clusters with the closest pair of observations. Then, the clusters are combined into larger clusters, until all the observations are formed in...

Full description

Bibliographic Details
Main Authors: Siti Zanariah, Satari, Nur Faraidah, Muhammad Di, Roslinazairimah, Zakaria
Format: Conference or Workshop Item
Language:English
English
Published: Universiti Malaysia Pahang 2018
Subjects:
Online Access:http://umpir.ump.edu.my/id/eprint/21971/
http://umpir.ump.edu.my/id/eprint/21971/1/22.%20Single-linkage%20method%20to%20detect%20multiple%20outliers%20with%20different.pdf
http://umpir.ump.edu.my/id/eprint/21971/2/22.1%20Single-linkage%20method%20to%20detect%20multiple%20outliers%20with%20different.pdf
Description
Summary:Single-linkage is one of the algorithms in agglomerative clustering technique that can be used to detect outliers. The single-linkage algorithm combines two clusters with the closest pair of observations. Then, the clusters are combined into larger clusters, until all the observations are formed in the same cluster. In this study, a single-linkage algorithm method that utilised a circular distance based on the City-block distance as the similarity distance is used. The performance of the method in detecting multiple outliers for a circular regression model is tested via simulation studies with three different outlier scenarios which are outliers in u-space only, v-space only and both uv-space. The performance is measured by calculating the “success” probability (pout), masking error (pmask) and swamping error (pswamp) for both outlier scenarios. It is found that the single linkage method performed well in detecting outliers for both outlier scenarios and applicable for circular regression model.