<?xml version="1.0" encoding="ISO-8859-1" ?><?xml-stylesheet type="text/xsl" href="results.xsl" ?><root><item>
  <title>MapReduce Optimization Using Dynamic Regulated Prioritization</title>
  <link>http://www.hpl.hp.com/personal/Thomas_Sandholm/sandholm2009a.pdf</link>
  <minidescription>How to run parallel jobs more efficiently in a multi-user cluster</minidescription>
  <description>We present a system for allocating resources in shared data and
compute clusters that improves MapReduce job scheduling in three
ways. First, the system uses regulated and user-assigned priorities
to offer different service levels to jobs and users over time. Second,
the system dynamically adjusts resource allocations to fit the
requirements of different job stages. Finally, the system automatically
detects and eliminates bottlenecks within a job. We show
experimentally using real applications that users can optimize not
only job execution time but also the cost-benefit ratio or prioritization
efficiency of a job using these three strategies. Our approach
relies on a proportional share mechanism that continuously allocates
virtual machine resources. Our experimental results show a
11-31% improvement in completion time and 4-187% improvement
in prioritization efficiency for different classes of MapReduce
jobs. We further show that delay intolerant users gain even more
from our system.</description>
  <author>Thomas Sandholm and Kevin Lai</author>
  <pubDate>2009-06-23 09:26:00</pubDate>
  <tags>
    <tag>tycoon</tag>
    <tag>mapreduce</tag>
  </tags>
</item>
</root>