Hadoop+: A Big-data Programming Framework for Heterogeneous Computing Environments

doi:10.12146/j.issn.2095-3135.201603008

Home > Archive>Volume 5, Issue 3, 2016 >60-71. DOI:10.12146/j.issn.2095-3135.201603008

Hadoop+: A Big-data Programming Framework for Heterogeneous Computing Environments
DOI:
                        10.12146/j.issn.2095-3135.201603008
                    
CSTR:
                        
Author:
                        
Affiliation:
Clc Number:
Fund Project:

Article

Figures

Metrics

Reference

Cited by

Materials

Comments

Abstract:

The rapid development of Internet and Internet of Things opens the era of big data. Currently, heterogeneous architectures are being widely adopted in large-scale datacenters, for the sake of performance improvement and reduction of energy consumption. This paper presents the design and implementation of Hadoop+, a programming framework that implements MapReduce and enables invocation of parallelized CUDA/OpenCL within a map/reduce task, and helps the user by taking advantage of a heterogeneous task model. Experimental result shows that Hadoop+ attains 1.4× to 16.1× speedups over Hadoop for five commonly used machine learning algorithms. Coupled with a heterogeneous task model that helps allocate computing resouces, Hadoop+ brings a 36.0% improvement in data processing speed for single-application workloads, and for mixed workloads of multiple applications, the execution time is reduced by up to 36.9% with an average 17.6%.

Reference

Cited by

Get Citation

HE Wenting, CUI Huimin, FENG Xiaobing. Hadoop+: A Big-data Programming Framework for Heterogeneous Computing Environments[J]. Journal of Integration Technology,2016,5(3):60-71

Copy

Article Metrics

Abstract:
PDF:
HTML:
Cited by:

History

Received:
Revised:
Adopted:
Online: May 31,2016
Published:

Home

About Journal

Editorial Team

Author Center

Peer Review

Reader Center

Ethics

Contact us

中文

Get Citation

Share

Article Metrics

History

Article QR Code