Please use this identifier to cite or link to this item:
http://ir.lib.seu.ac.lk/handle/123456789/3279
Title: | An efficient way of using wrappers in big data classification |
Authors: | Fajila, M.N.F. |
Keywords: | Big data, Classification, Dimensionality reduction, r. Micro array, Wrapper. |
Issue Date: | 28-Nov-2017 |
Publisher: | Faculty of Applied Science, South Eastern University of Sri Lanka |
Abstract: | Data is dramatically growing with the growth of time. However, the value of data forces the scientists to find patters to use the high dimensional data efficiently. Dimensionality reduction is an essential technique in data science when handling big data. Although always the techniques are being introduced, applying correct technique at right position still seems to be challenging. One such technique is wrappers for machine learning. Feature selection plays a major role in classification of big data. A feature can be more informative in the presence of another feature. Thus, no feature should be removed without assessing. Wrappers select all the possible combinations of feature subsets, and finally provide the most informative subset which classifies the data with a higher accuracy. But, compared to filters wrappers are much slower and consume a huge amount of time when applied to big data. Therefore, in the proposed approach, wrapper is applied after the application of filter in order to get rid of the computational complexity. This approach uses gain ratio filter followed by classifier subset evaluate, the wrapper for feature sub set selection. The proposed technique is validated and evaluated on two high dimensional micro array data sets namely; lung cancer data set and breast cancer data set. It provided 97.10% accuracy (only with two mis classifications) and 78.78% accuracy for lung cancer and breast cancer data sets respectively. Thus, the results show that the proposed approach is extremely efficient in terms of accuracy and computational time too. |
URI: | http://ir.lib.seu.ac.lk/handle/123456789/3279 |
ISBN: | 9789556271232 |
Appears in Collections: | ASRS - FAS 2017 |
Files in This Item:
File | Description | Size | Format | |
---|---|---|---|---|
ASRS 2017 03....pdf | 9.35 kB | Adobe PDF | View/Open |
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.