Parallel associative classification data mining frameworks based mapreduce

dc.contributor.authorThabtah, Fadi
dc.contributor.authorHammoud, Suhel
dc.contributor.authorAbdel-Jaber, Hussein
dc.date.accessioned2020-02-13T09:13:02Z
dc.date.available2020-02-13T09:13:02Z
dc.date.copyright2015en_US
dc.date.issued2015-06
dc.descriptionThis article is not available at CUD collection. The version of scholarly record of this Article is published in Parallel Processing Letters (2015), available online at: https://doi.org/10.1142/S0129626415500024.en_US
dc.description.abstractAssociative classification (AC) is a research topic that integrates association rules with classification in data mining to build classifiers. After dissemination of the Classification-based Association Rule algorithm (CBA), the majority of its successors have been developed to improve either CBA's prediction accuracy or the search for frequent ruleitems in the rule discovery step. Both of these steps require high demands in processing time and memory especially in cases of large training data sets or a low minimum support threshold value. In this paper, we overcome the problem of mining large training data sets by proposing a new learning method that repeatedly transforms data between line and item spaces to quickly discover frequent ruleitems, generate rules, subsequently rank and prune rules. This new learning method has been implemented in a parallel Map-Reduce (MR) algorithm called MRMCAR which can be considered the first parallel AC algorithm in the literature. The new learning method can be utilised in the different steps within any AC or association rule mining algorithms which scales well if contrasted with current horizontal or vertical methods. Two versions of the learning method (Weka, Hadoop) have been implemented and a number of experiments against different data sets have been conducted. The ground bases of the comparisons are classification accuracy and time required by the algorithm for data initialization, frequent ruleitems discovery, rule generation and rule pruning. The results reveal that MRMCAR is superior to both current AC mining algorithms and rule based classification algorithms in improving the classification performance with respect to accuracy. © 2015 World Scientific Publishing Company.en_US
dc.identifier.citationThabtah, F., Hammoud, S., & Abdel-Jaber, H. (2015). Parallel associative classification data mining frameworks based mapreduce. Parallel Processing Letters, 25(2). https://doi.org/10.1142/S0129626415500024en_US
dc.identifier.issn01296264
dc.identifier.urihttp://dx.doi.org/10.1142/S0129626415500024
dc.identifier.urihttp://hdl.handle.net/20.500.12519/139
dc.language.isoenen_US
dc.publisherWorld Scientific Publishing Co. Pte Ltden_US
dc.relationAuthors Affiliations: Thabtah, F., E-Business Dept, Canadian University of Dubai, Dubai, United Arab Emirates; Hammoud, S., Electronic and Computing Dept, Brunel University, Uxbridge, United Kingdom; Abdel-Jaber, H., Faculty of Computer Studies, Arab Open University, Saudi Arabia
dc.relationAuthors Affiliations: Thabtah, F., E-Business Dept, Canadian University of Dubai, Dubai, United Arab Emirates; Hammoud, S., Electronic and Computing Dept, Brunel University, Uxbridge, United Kingdom; Abdel-Jaber, H., Faculty of Computer Studies, Arab Open University, Saudi Arabia
dc.relation.ispartofseriesParallel Processing Letters;Vol. 25, no. 2
dc.rightsPermission to reuse the abstract has been secured from World Scientific Publishing Co. Pte Ltd.
dc.rightsPermission to reuse the abstract has been secured from World Scientific Publishing Co. Pte Ltd.
dc.rights.holderCopyroght : 2015 World Scientific Publishing Company
dc.rights.holderCopyright : 2015 World Scientific Publishing Company
dc.subjectAlgorithmsen_US
dc.subjectAssociation rulesen_US
dc.subjectClassification (of information)en_US
dc.subjectLearning algorithmsen_US
dc.subjectLearning systemsen_US
dc.subjectAssociative classificationen_US
dc.subjectDistributed tasksen_US
dc.subjectHadoopen_US
dc.subjectMap-reduceen_US
dc.subjectParallel miningsen_US
dc.subjectData miningen_US
dc.titleParallel associative classification data mining frameworks based mapreduceen_US
dc.typeArticleen_US
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Access Instruction 139.pdf
Size:
55.8 KB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description:
Collections