Skip to main content

Table 3 Application characteristics

From: TMaR: a two-stage MapReduce scheduler for heterogeneous environments

Job

Description

CPU/IO-intensive

Shuffle-light/heavy

Wordcount

Counts the occurrence of each word in the input data

CPU-intensive

Shuffle-heavy

K-means

A clustering analysis algorithm for multi-dimensional numerical samples in data mining

CPU-intensive

Shuffle-light

TeraSort

A popular benchmark to sort one terabyte of randomly distributed data

IO-intensive

Shuffle-heavy

Grep

Counts the number of occurrences of strings matching the target in a text file

IO-intensive

Shuffle-light