14 سال پیش · 0dc3968345
--- a/CHANGES.txt
+++ b/CHANGES.txt
@@ -2,10 +2,10 @@ Hadoop Change Log
 
				 
			
 
				 Release 0.20.203.0 - unreleased
			
 
				 
			
 
				+    MAPREDUCE-2316. Updated CapacityScheduler documentation. (acmurthy) 
			
 
				+
			
 
				     HADOOP-7243. Fix contrib unit tests missing dependencies. (omalley)
			
 
				 
			
 
				-    MAPREDUCE-2355. Add a dampner to out-of-band heartbeats. (acmurthy) 
			
 
				- 
			
 
				     HADOOP-7190. Add metrics v1 back for backwards compatibility. (omalley)
			
 
				 
			
 
				     MAPREDUCE-2360. Remove stripping of scheme, authority from submit dir in 
			
--- a/src/docs/src/documentation/content/xdocs/capacity_scheduler.xml
+++ b/src/docs/src/documentation/content/xdocs/capacity_scheduler.xml
@@ -20,7 +20,7 @@
 
				 <document>
			
 
				   
			
 
				   <header>
			
 
				-    <title>Capacity Scheduler Guide</title>
			
 
				+    <title>CapacityScheduler Guide</title>
			
 
				   </header>
			
 
				   
			
 
				   <body>
			
@@ -28,91 +28,125 @@
 
				     <section>
			
 
				       <title>Purpose</title>
			
 
				       
			
 
				-      <p>This document describes the Capacity Scheduler, a pluggable 
			
 
				-      MapReduce scheduler for Hadoop which provides a way to share 
			
 
				-      large clusters.</p>
			
 
				+      <p>This document describes the CapacityScheduler, a pluggable 
			
 
				+      MapReduce scheduler for Hadoop which allows for multiple-tenants to 
			
 
				+      securely share a large cluster such that their applications are allocated
			
 
				+      resources in a timely manner under constraints of allocated capacities.
			
 
				+      </p>
			
 
				+    </section>
			
 
				+    
			
 
				+    <section>
			
 
				+      <title>Overview</title>
			
 
				+     
			
 
				+      <p>The CapacityScheduler is designed to run Hadoop Map-Reduce as a 
			
 
				+      shared, multi-tenant cluster in an operator-friendly manner while 
			
 
				+      maximizing the throughput and the utilization of the cluster while
			
 
				+      running Map-Reduce applications. </p>
			
 
				+     
			
 
				+      <p>Traditionally each organization has it own private set of compute 
			
 
				+      resources that have sufficient capacity to meet the organization's SLA 
			
 
				+      under peak or near peak conditions. This generally leads to poor average 
			
 
				+      utilization and the overhead of managing multiple independent clusters, 
			
 
				+      one per each organization. Sharing clusters between organizations is a 
			
 
				+      cost-effective manner of running large Hadoop installations since this 
			
 
				+      allows them to reap benefits of economies of scale without creating 
			
 
				+      private clusters.  However, organizations are concerned about sharing a 
			
 
				+      cluster because they are worried about others using the resources that 
			
 
				+      are critical for their SLAs.</p> 
			
 
				+
			
 
				+      <p>The CapacityScheduler is designed to allow sharing a large cluster 
			
 
				+      while giving  each organization a minimum capacity guarantee. The central 
			
 
				+      idea is that the available resources in the Hadoop Map-Reduce cluster are 
			
 
				+      partitioned among multiple organizations who collectively fund the 
			
 
				+      cluster based on computing needs. There is an added benefit that an 
			
 
				+      organization can access any excess capacity no being used by others. This 
			
 
				+      provides elasticity for the organizations in a cost-effective manner.</p> 
			
 
				+
			
 
				+      <p>Sharing clusters across organizations necessitates strong support for
			
 
				+      multi-tenancy since each organization must be guaranteed capacity and 
			
 
				+      safe-guards to ensure the shared cluster is impervious to single rouge 
			
 
				+      job or user. The CapacityScheduler provides a stringent set of limits to 
			
 
				+      ensure that a single job or user or queue cannot consume dispropotionate 
			
 
				+      amount of resources in the cluster. Also, the JobTracker of the cluster,  
			
 
				+      in particular, is a precious resource and the CapacityScheduler provides 
			
 
				+      limits on initialized/pending tasks and jobs from a single user and queue 
			
 
				+      to ensure fairness and stability of the cluster.</p> 
			
 
				+
			
 
				+      <p>The primary abstraction provided by the CapacityScheduler is the 
			
 
				+      concept of <em>queues</em>. These queues are typically setup by administrators
			
 
				+      to reflect the economics of the shared cluster.</p>
			
 
				     </section>
			
 
				     
			
 
				     <section>
			
 
				       <title>Features</title>
			
 
				       
			
 
				-      <p>The Capacity Scheduler supports the following features:</p> 
			
 
				+      <p>The CapacityScheduler supports the following features:</p> 
			
 
				       <ul>
			
 
				         <li>
			
 
				-          Support for multiple queues, where a job is submitted to a queue.
			
 
				+          Capacity Guarantees - Support for multiple queues, where a job is 
			
 
				+          submitted to a queue.Queues are allocated a fraction of the capacity 
			
 
				+          of the grid in the sense that a certain capacity of resources will be 
			
 
				+          at their disposal. All jobs submitted to a queue will have access to 
			
 
				+          the capacity allocated to the queue. Adminstrators can configure soft 
			
 
				+          limits and optional hard limits on the capacity allocated to each queue. 
			
 
				         </li>
			
 
				         <li>
			
 
				-          Queues are allocated a fraction of the capacity of the grid in the 
			
 
				-          sense that a certain capacity of resources will be at their 
			
 
				-          disposal. All jobs submitted to a queue will have access to the 
			
 
				-          capacity allocated to the queue.
			
 
				+          Security - Each queue has strict ACLs which controls which users can 
			
 
				+          submit jobs to individual queues. Also, there are safe-guards to 
			
 
				+          ensure that users cannot view and/or modify jobs from other users if
			
 
				+          so desired. Also, per-queue and system administrator roles are 
			
 
				+          supported.
			
 
				         </li>
			
 
				         <li>
			
 
				-          Free resources can be allocated to any queue beyond it's capacity. 
			
 
				-          When there is demand for these resources from queues running below 
			
 
				-          capacity at a future point in time, as tasks scheduled on these 
			
 
				+          Elasticity - Free resources can be allocated to any queue beyond it's 
			
 
				+          capacity. When there is demand for these resources from queues running 
			
 
				+          below capacity at a future point in time, as tasks scheduled on these 
			
 
				           resources complete, they will be assigned to jobs on queues 
			
 
				-          running below the capacity.
			
 
				+          running below the capacity. This ensures that resources are available 
			
 
				+          in a predictable and elastic manner to queues, thus preventing 
			
 
				+          artifical silos of resources in the cluster which helps utilization.
			
 
				         </li>
			
 
				         <li>
			
 
				-          Queues optionally support job priorities (disabled by default).
			
 
				+          Multi-tenancy - Comprehensive set of limits are provided to prevent 
			
 
				+          a single job, user and queue from monpolizing resources of the queue 
			
 
				+          or the cluster as a whole to ensure that the system, particularly the 
			
 
				+          JobTracker, isn't overwhelmed by too many tasks or jobs. 
			
 
				         </li>
			
 
				         <li>
			
 
				-          Within a queue, jobs with higher priority will have access to the 
			
 
				-          queue's resources before jobs with lower priority. However, once a 
			
 
				-          job is running, it will not be preempted for a higher priority job,
			
 
				-          though new tasks from the higher priority job will be 
			
 
				-          preferentially scheduled.
			
 
				+          Operability - The queue definitions and properties can be changed, 
			
 
				+          at runtime, by administrators in a secure manner to minimize 
			
 
				+          disruption to users. Also, a console is provided for users and 
			
 
				+          administrators to view current allocation of resources to various 
			
 
				+          queues in the system.
			
 
				         </li>
			
 
				         <li>
			
 
				-          In order to prevent one or more users from monopolizing its 
			
 
				-          resources, each queue enforces a limit on the percentage of 
			
 
				-          resources allocated to a user at any given time, if there is 
			
 
				-          competition for them.  
			
 
				+          Resource-based Scheduling - Support for resource-intensive jobs, 
			
 
				+          wherein a job can optionally specify higher resource-requirements than 
			
 
				+          the default, there-by accomodating applications with differing resource
			
 
				+          requirements. Currently, memory is the the resource requirement 
			
 
				+          supported.
			
 
				         </li>
			
 
				         <li>
			
 
				-          Support for memory-intensive jobs, wherein a job can optionally 
			
 
				-          specify higher memory-requirements than the default, and the tasks 
			
 
				-          of the job will only be run on TaskTrackers that have enough memory 
			
 
				-          to spare.
			
 
				+          Job Priorities - Queues optionally support job priorities (disabled 
			
 
				+          by default). Within a queue, jobs with higher priority will have 
			
 
				+          access to the queue's resources before jobs with lower priority. 
			
 
				+          However, once a job is running, it will not be preempted for a higher 
			
 
				+          priority job, <em>premption</em> is on the roadmap is currently not 
			
 
				+          supported.
			
 
				         </li>
			
 
				       </ul>
			
 
				     </section>
			
 
				     
			
 
				-    <section>
			
 
				-      <title>Picking a task to run</title>
			
 
				-      
			
 
				-      <p>Note that many of these steps can be, and will be, enhanced over time
			
 
				-      to provide better algorithms.</p>
			
 
				-      
			
 
				-      <p>Whenever a TaskTracker is free, the Capacity Scheduler picks 
			
 
				-      a queue which has most free space (whose ratio of # of running slots to 
			
 
				-      capacity is the lowest).</p>
			
 
				-      
			
 
				-      <p>Once a queue is selected, the Scheduler picks a job in the queue. Jobs
			
 
				-      are sorted based on when they're submitted and their priorities (if the 
			
 
				-      queue supports priorities). Jobs are considered in order, and a job is 
			
 
				-      selected if its user is within the user-quota for the queue, i.e., the 
			
 
				-      user is not already using queue resources above his/her limit. The 
			
 
				-      Scheduler also makes sure that there is enough free memory in the 
			
 
				-      TaskTracker to tun the job's task, in case the job has special memory
			
 
				-      requirements.</p>
			
 
				-      
			
 
				-      <p>Once a job is selected, the Scheduler picks a task to run. This logic 
			
 
				-      to pick a task remains unchanged from earlier versions.</p> 
			
 
				-      
			
 
				-    </section>
			
 
				-    
			
 
				     <section>
			
 
				       <title>Installation</title>
			
 
				       
			
 
				-        <p>The Capacity Scheduler is available as a JAR file in the Hadoop
			
 
				+        <p>The CapacityScheduler is available as a JAR file in the Hadoop
			
 
				         tarball under the <em>contrib/capacity-scheduler</em> directory. The name of 
			
 
				         the JAR file would be on the lines of hadoop-*-capacity-scheduler.jar.</p>
			
 
				         <p>You can also build the Scheduler from source by executing
			
 
				         <em>ant package</em>, in which case it would be available under
			
 
				         <em>build/contrib/capacity-scheduler</em>.</p>
			
 
				-        <p>To run the Capacity Scheduler in your Hadoop installation, you need 
			
 
				+        <p>To run the CapacityScheduler in your Hadoop installation, you need 
			
 
				         to put it on the <em>CLASSPATH</em>. The easiest way is to copy the 
			
 
				         <code>hadoop-*-capacity-scheduler.jar</code> from 
			
 
				         to <code>HADOOP_HOME/lib</code>. Alternatively, you can modify 
			
@@ -124,9 +158,9 @@
 
				       <title>Configuration</title>
			
 
				 
			
 
				       <section>
			
 
				-        <title>Using the Capacity Scheduler</title>
			
 
				+        <title>Using the CapacityScheduler</title>
			
 
				         <p>
			
 
				-          To make the Hadoop framework use the Capacity Scheduler, set up
			
 
				+          To make the Hadoop framework use the CapacityScheduler, set up
			
 
				           the following property in the site configuration:</p>
			
 
				           <table>
			
 
				             <tr>
			
@@ -144,14 +178,22 @@
 
				         <title>Setting up queues</title>
			
 
				         <p>
			
 
				           You can define multiple queues to which users can submit jobs with
			
 
				-          the Capacity Scheduler. To define multiple queues, you should edit
			
 
				-          the site configuration for Hadoop and modify the
			
 
				-          <em>mapred.queue.names</em> property.
			
 
				+          the CapacityScheduler. To define multiple queues, you should use the  
			
 
				+          <em>mapred.queue.names</em> property in 
			
 
				+          <code>conf/hadoop-site.xml</code>.
			
 
				         </p>
			
 
				+        
			
 
				+        <p>
			
 
				+          The CapacityScheduler can be configured with several properties
			
 
				+          for each queue that control the behavior of the Scheduler. This
			
 
				+          configuration is in the <em>conf/capacity-scheduler.xml</em>.
			
 
				+        </p>
			
 
				+        
			
 
				         <p>
			
 
				           You can also configure ACLs for controlling which users or groups
			
 
				-          have access to the queues.
			
 
				+          have access to the queues in <code>conf/mapred-queue-acls.xml</code>.
			
 
				         </p>
			
 
				+        
			
 
				         <p>
			
 
				           For more details, refer to
			
 
				           <a href="cluster_setup.html#Configuring+the+Hadoop+Daemons">Cluster 
			
@@ -160,25 +202,12 @@
 
				       </section>
			
 
				   
			
 
				       <section>
			
 
				-        <title>Configuring properties for queues</title>
			
 
				+        <title>Queue properties</title>
			
 
				 
			
 
				-        <p>The Capacity Scheduler can be configured with several properties
			
 
				-        for each queue that control the behavior of the Scheduler. This
			
 
				-        configuration is in the <em>conf/capacity-scheduler.xml</em>. By
			
 
				-        default, the configuration is set up for one queue, named 
			
 
				-        <em>default</em>.</p>
			
 
				-        <p>To specify a property for a queue that is defined in the site
			
 
				-        configuration, you should use the property name as
			
 
				-        <em>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.&lt;property-name&gt;</em>.
			
 
				-        </p>
			
 
				-        <p>For example, to define the property <em>capacity</em>
			
 
				-        for queue named <em>research</em>, you should specify the property
			
 
				-        name as 
			
 
				-        <em>mapred.capacity-scheduler.queue.research.capacity</em>.
			
 
				-        </p>
			
 
				-
			
 
				-        <p>The properties defined for queues and their descriptions are
			
 
				-        listed in the table below:</p>
			
 
				+        <section>
			
 
				+        <title>Resource allocation</title>
			
 
				+        <p>The properties defined for resource allocations to queues and their 
			
 
				+        descriptions are listed in below:</p>
			
 
				 
			
 
				         <table>
			
 
				           <tr><th>Name</th><th>Description</th></tr>
			
@@ -187,25 +216,8 @@
 
				             to be available for jobs in this queue. The sum of capacities 
			
 
				             for all queues should be less than or equal 100.</td>
			
 
				           </tr>
			
 
				-          <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.supports-priority</td>
			
 
				-          	<td>If true, priorities of jobs will be taken into account in scheduling 
			
 
				-          	decisions.</td>
			
 
				-          </tr>
			
 
				-          <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.minimum-user-limit-percent</td>
			
 
				-          	<td>Each queue enforces a limit on the percentage of resources 
			
 
				-          	allocated to a user at any given time, if there is competition 
			
 
				-          	for them. This user limit can vary between a minimum and maximum 
			
 
				-          	value. The former depends on the number of users who have submitted
			
 
				-          	jobs, and the latter is set to this property value. For example, 
			
 
				-          	suppose the value of this property is 25. If two users have 
			
 
				-          	submitted jobs to a queue, no single user can use more than 50% 
			
 
				-          	of the queue resources. If a third user submits a job, no single 
			
 
				-          	user can use more than 33% of the queue resources. With 4 or more 
			
 
				-          	users, no user can use more than 25% of the queue's resources. A 
			
 
				-          	value of 100 implies no user limits are imposed.</td>
			
 
				-          </tr>
			
 
				           <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.maximum-capacity</td>
			
 
				-          	<td>
			
 
				+            <td>
			
 
				                   maximum-capacity defines a limit beyond which a queue cannot
			
 
				                   use the capacity of the cluster.This provides a means to limit
			
 
				                   how much excess capacity a queue can use. By default, there
			
@@ -228,137 +240,175 @@
 
				                   absolute terms would increase accordingly.
			
 
				                 </td>
			
 
				           </tr>
			
 
				-        </table>
			
 
				-      </section>
			
 
				-      
			
 
				-      <section>
			
 
				-        <title>Memory management</title>
			
 
				-      
			
 
				-        <p>The Capacity Scheduler supports scheduling of tasks on a
			
 
				-        <code>TaskTracker</code>(TT) based on a job's memory requirements
			
 
				-        and the availability of RAM and Virtual Memory (VMEM) on the TT node.
			
 
				-        See the <a href="mapred_tutorial.html#Memory+monitoring"> 
			
 
				-        MapReduce Tutorial</a> for details on how the TT monitors
			
 
				-        memory usage.</p>
			
 
				-        <p>Currently the memory based scheduling is only supported
			
 
				-        in Linux platform.</p>
			
 
				-        <p>Memory-based scheduling works as follows:</p>
			
 
				-        <ol>
			
 
				-          <li>The absence of any one or more of three config parameters 
			
 
				-          or -1 being set as value of any of the parameters, 
			
 
				-          <code>mapred.tasktracker.vmem.reserved</code>, 
			
 
				-          <code>mapred.task.default.maxvmem</code>, or
			
 
				-          <code>mapred.task.limit.maxvmem</code>, disables memory-based
			
 
				-          scheduling, just as it disables memory monitoring for a TT. These
			
 
				-          config parameters are described in the 
			
 
				-          <a href="mapred_tutorial.html#Memory+monitoring">MapReduce 
			
 
				-          Tutorial</a>. The value of  
			
 
				-          <code>mapred.tasktracker.vmem.reserved</code> is 
			
 
				-          obtained from the TT via its heartbeat. 
			
 
				-          </li>
			
 
				-          <li>If all the three mandatory parameters are set, the Scheduler 
			
 
				-          enables VMEM-based scheduling. First, the Scheduler computes the free
			
 
				-          VMEM on the TT. This is the difference between the available VMEM on the
			
 
				-          TT (the node's total VMEM minus the offset, both of which are sent by 
			
 
				-          the TT on each heartbeat)and the sum of VMs already allocated to 
			
 
				-          running tasks (i.e., sum of the VMEM task-limits). Next, the Scheduler
			
 
				-          looks at the VMEM requirements for the job that's first in line to 
			
 
				-          run. If the job's VMEM requirements are less than the available VMEM on 
			
 
				-          the node, the job's task can be scheduled. If not, the Scheduler 
			
 
				-          ensures that the TT does not get a task to run (provided the job 
			
 
				-          has tasks to run). This way, the Scheduler ensures that jobs with 
			
 
				-          high memory requirements are not starved, as eventually, the TT 
			
 
				-          will have enough VMEM available. If the high-mem job does not have 
			
 
				-          any task to run, the Scheduler moves on to the next job. 
			
 
				-          </li>
			
 
				-          <li>In addition to VMEM, the Capacity Scheduler can also consider 
			
 
				-          RAM on the TT node. RAM is considered the same way as VMEM. TTs report
			
 
				-          the total RAM available on their node, and an offset. If both are
			
 
				-          set, the Scheduler computes the available RAM on the node. Next, 
			
 
				-          the Scheduler figures out the RAM requirements of the job, if any. 
			
 
				-          As with VMEM, users can optionally specify a RAM limit for their job
			
 
				-          (<code>mapred.task.maxpmem</code>, described in the MapReduce 
			
 
				-          Tutorial). The Scheduler also maintains a limit for this value 
			
 
				-          (<code>mapred.capacity-scheduler.task.default-pmem-percentage-in-vmem</code>, 
			
 
				-          described below). All these three values must be set for the 
			
 
				-          Scheduler to schedule tasks based on RAM constraints.
			
 
				-          </li>
			
 
				-          <li>The Scheduler ensures that jobs cannot ask for RAM or VMEM higher
			
 
				-          than configured limits. If this happens, the job is failed when it
			
 
				-          is submitted. 
			
 
				-          </li>
			
 
				-        </ol>
			
 
				-        
			
 
				-        <p>As described above, the additional scheduler-based config 
			
 
				-        parameters are as follows:</p>
			
 
				-
			
 
				-        <table>
			
 
				-          <tr><th>Name</th><th>Description</th></tr>
			
 
				-          <tr><td>mapred.capacity-scheduler.task.default-pmem-percentage-in-vmem</td>
			
 
				-          	<td>A percentage of the default VMEM limit for jobs
			
 
				-          	(<code>mapred.task.default.maxvmem</code>). This is the default 
			
 
				-          	RAM task-limit associated with a task. Unless overridden by a 
			
 
				-          	job's setting, this number defines the RAM task-limit.</td>
			
 
				+          <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.minimum-user-limit-percent</td>
			
 
				+          	<td>Each queue enforces a limit on the percentage of resources 
			
 
				+          	allocated to a user at any given time, if there is competition 
			
 
				+          	for them. This user limit can vary between a minimum and maximum 
			
 
				+          	value. The former depends on the number of users who have submitted
			
 
				+          	jobs, and the latter is set to this property value. For example, 
			
 
				+          	suppose the value of this property is 25. If two users have 
			
 
				+          	submitted jobs to a queue, no single user can use more than 50% 
			
 
				+          	of the queue resources. If a third user submits a job, no single 
			
 
				+          	user can use more than 33% of the queue resources. With 4 or more 
			
 
				+          	users, no user can use more than 25% of the queue's resources. A 
			
 
				+          	value of 100 implies no user limits are imposed.</td>
			
 
				+          </tr>
			
 
				+          <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.user-limit-factor</td>
			
 
				+            <td>The multiple of the queue capacity which can be configured to 
			
 
				+              allow a single user to acquire more slots. By default this is set 
			
 
				+              to 1 which ensure that a single user can never take more than the 
			
 
				+              queue's configured capacity irrespective of how idle th cluster 
			
 
				+              is.</td>
			
 
				           </tr>
			
 
				-          <tr><td>mapred.capacity-scheduler.task.limit.maxpmem</td>
			
 
				-          <td>Configuration which provides an upper limit to maximum physical
			
 
				-           memory which can be specified by a job. If a job requires more 
			
 
				-           physical memory than what is specified in this limit then the same
			
 
				-           is rejected.</td>
			
 
				+          <tr><td>mapred.capacity-scheduler.queue.&lt;queue-name&gt;.supports-priority</td>
			
 
				+            <td>If true, priorities of jobs will be taken into account in scheduling 
			
 
				+            decisions.</td>
			
 
				           </tr>
			
 
				         </table>
			
 
				-      </section>
			
 
				+   </section>
			
 
				    <section>
			
 
				-        <title>Job Initialization Parameters</title>
			
 
				+        <title>Job initialization</title>
			
 
				         <p>Capacity scheduler lazily initializes the jobs before they are
			
 
				         scheduled, for reducing the memory footprint on jobtracker. 
			
 
				-        Following are the parameters, by which you can control the laziness
			
 
				-        of the job initialization. The following parameters can be 
			
 
				-        configured in capacity-scheduler.xml
			
 
				+        Following are the parameters, by which you can control the
			
 
				+        initialization of jobs per-queue.
			
 
				         </p>
			
 
				         
			
 
				         <table>
			
 
				           <tr><th>Name</th><th>Description</th></tr>
			
 
				           <tr>
			
 
				             <td>
			
 
				-              mapred.capacity-scheduler.queue.&lt;queue-name&gt;.maximum-initialized-jobs-per-user
			
 
				+              mapred.capacity-scheduler.maximum-system-jobs
			
 
				             </td>
			
 
				             <td>
			
 
				-              Maximum number of jobs which are allowed to be pre-initialized for
			
 
				-              a particular user in the queue. Once a job is scheduled, i.e. 
			
 
				-              it starts running, then that job is not considered
			
 
				-              while scheduler computes the maximum job a user is allowed to
			
 
				-              initialize. 
			
 
				+              Maximum number of jobs in the system which can be initialized,
			
 
				+              concurrently, by the CapacityScheduler.
			
 
				+              
			
 
				+              Individual queue limits on initialized jobs are directly 
			
 
				+              proportional to their queue capacities.
			
 
				             </td>
			
 
				           </tr>
			
 
				           <tr>
			
 
				             <td>
			
 
				-              mapred.capacity-scheduler.init-poll-interval
			
 
				+              mapred.capacity-scheduler.queue.&lt;queue-name&gt;.maximum-initialized-active-tasks
			
 
				             </td>
			
 
				             <td>
			
 
				-              Amount of time in miliseconds which is used to poll the scheduler
			
 
				-              job queue to look for jobs to be initialized.
			
 
				+              The maximum number of tasks, across all jobs in the queue, 
			
 
				+              which can be initialized concurrently. Once the queue's jobs 
			
 
				+              exceed this limit they will be queued on disk.             
			
 
				             </td>
			
 
				           </tr>
			
 
				           <tr>
			
 
				             <td>
			
 
				-              mapred.capacity-scheduler.init-worker-threads
			
 
				+              mapred.capacity-scheduler.queue.&lt;queue-name&gt;.maximum-initialized-active-tasks-per-user
			
 
				             </td>
			
 
				             <td>
			
 
				-              Number of worker threads which would be used by Initialization
			
 
				-              poller to initialize jobs in a set of queue. If number mentioned 
			
 
				-              in property is equal to number of job queues then a thread is 
			
 
				-              assigned jobs from one queue. If the number configured is lesser than
			
 
				-              number of queues, then a thread can get jobs from more than one queue
			
 
				-              which it initializes in a round robin fashion. If the number configured
			
 
				-              is greater than number of queues, then number of threads spawned
			
 
				-              would be equal to number of job queues.
			
 
				+              The maximum number of tasks per-user, across all the of the
			
 
				+              user's jobs in the queue, which can be initialized concurrently. 
			
 
				+              Once the user's jobs exceed this limit they will be queued on disk.
			
 
				             </td>
			
 
				           </tr>
			
 
				+          <tr>
			
 
				+            <td> 
			
 
				+              mapred.capacity-scheduler.queue.&lt;queue-name&gt;.init-accept-jobs-factor
			
 
				+            </td>
			
 
				+            <td>
			
 
				+              The multipe of (maximum-system-jobs * queue-capacity) used to
			
 
				+              determine the number of jobs which are accepted by the scheduler. 
			
 
				+              The default value is 10. If number of jobs submitted to the queue
			
 
				+              exceeds this limit, job submission are rejected. 
			
 
				+            </td>
			
 
				+          </tr> 
			
 
				         </table>
			
 
				       </section>   
			
 
				+      </section>
			
 
				+      
			
 
				       <section>
			
 
				-        <title>Reviewing the configuration of the Capacity Scheduler</title>
			
 
				+        <title>Resource based scheduling</title>
			
 
				+      
			
 
				+        <p>The CapacityScheduler supports scheduling of tasks on a
			
 
				+        <code>TaskTracker</code>(TT) based on a job's memory requirements
			
 
				+        in terms of RAM and Virtual Memory (VMEM) on the TT node.
			
 
				+        A TT is conceptually composed of a fixed number of map and reduce
			
 
				+        slots with fixed slot size across the cluster. A job can ask for one
			
 
				+        or more slots for each of its component map and/or reduce slots. If a
			
 
				+        task consumes more memory than configured the TT forcibly kills the task.
			
 
				+        </p>
			
 
				+
			
 
				+        <p>Currently the memory based scheduling is only supported
			
 
				+        in Linux platform.</p>
			
 
				+        
			
 
				+        <p>Additional scheduler-based config 
			
 
				+        parameters are as follows:</p>
			
 
				+
			
 
				+        <table>
			
 
				+          <tr><th>Name</th><th>Description</th></tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.cluster.map.memory.mb</td>
			
 
				+          	 <td>The size, in terms of virtual memory, of a single map slot
			
 
				+             in the Map-Reduce framework, used by the scheduler.
			
 
				+             A job can ask for multiple slots for a single map task via
			
 
				+             <code>mapred.job.map.memory.mb</code>, upto the limit specified by
			
 
				+             <code>mapred.cluster.max.map.memory.mb</code>, if the scheduler 
			
 
				+             supports the feature.
			
 
				+             The value of -1 indicates that this feature is turned off.
			
 
				+          	 </td>
			
 
				+          </tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.cluster.reduce.memory.mb</td>
			
 
				+             <td>The size, in terms of virtual memory, of a single reduce slot
			
 
				+             in the Map-Reduce framework, used by the scheduler.
			
 
				+             A job can ask for multiple slots for a single reduce task via
			
 
				+             <code>mapred.job.reduce.memory.mb</code>, upto the limit specified by
			
 
				+             <code>mapred.cluster.max.reduce.memory.mb</code>, if the scheduler supports the 
			
 
				+             feature.The value of -1 indicates that this feature is turned off.
			
 
				+             </td>
			
 
				+          </tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.cluster.max.map.memory.mb</td>
			
 
				+            <td>The maximum size, in terms of virtual memory, of a single map
			
 
				+            task launched by the Map-Reduce framework, used by the scheduler.
			
 
				+            A job can ask for multiple slots for a single map task via
			
 
				+            <code>mapred.job.map.memory.mb</code>, upto the limit specified by
			
 
				+            <code>mapred.cluster.max.map.memory.mb</code>, if the scheduler supports the 
			
 
				+            feature. The value of -1 indicates that this feature is turned off.
			
 
				+            </td>
			
 
				+          </tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.cluster.max.reduce.memory.mb</td>
			
 
				+            <td>The maximum size, in terms of virtual memory, of a single reduce
			
 
				+            task launched by the Map-Reduce framework, used by the scheduler.
			
 
				+            A job can ask for multiple slots for a single reduce task via
			
 
				+            <code>mapred.job.reduce.memory.mb</code>, upto the limit specified by
			
 
				+            <code>mapred.cluster.max.reduce.memory.mb</code>, if the scheduler supports the 
			
 
				+            feature. The value of -1 indicates that this feature is turned off.
			
 
				+            </td>
			
 
				+          </tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.job.map.memory.mb</td>
			
 
				+            <td>The size, in terms of virtual memory, of a single map task
			
 
				+            for the job. A job can ask for multiple slots for a single map task, 
			
 
				+            rounded up to the next multiple of <code>mapred.cluster.map.memory.mb</code> and 
			
 
				+            upto the limit specified by <code>mapred.cluster.max.map.memory.mb</code>, 
			
 
				+            if the scheduler supports the feature. The value of -1 indicates 
			
 
				+            that this feature is turned off iff <code>mapred.cluster.map.memory.mb</code> is 
			
 
				+            also turned off (-1).</td>
			
 
				+          </tr>
			
 
				+          <tr>
			
 
				+            <td>mapred.job.reduce.memory.mb</td>
			
 
				+            <td>The size, in terms of virtual memory, of a single reduce task
			
 
				+            for the job. A job can ask for multiple slots for a single reduce task, 
			
 
				+            rounded up to the next multiple of <code>mapred.cluster.reduce.memory.mb</code> and 
			
 
				+            upto the limit specified by <code>mapred.cluster.max.reduce.memory.mb</code>, 
			
 
				+            if the scheduler supports the feature. The value of -1 indicates 
			
 
				+            that this feature is turned off iff <code>mapred.cluster.reduce.memory.mb</code> is 
			
 
				+            also turned off (-1).</td>
			
 
				+          </tr>
			
 
				+        </table>
			
 
				+      </section>
			
 
				+      
			
 
				+      <section>
			
 
				+        <title>Reviewing the configuration of the CapacityScheduler</title>
			
 
				         <p>
			
 
				           Once the installation and configuration is completed, you can review
			
 
				           it after starting the MapReduce cluster from the admin UI.
			
@@ -370,10 +420,218 @@
 
				               Information</em> section of the page.</li>
			
 
				           <li>The properties for the queues should be visible in the <em>Scheduling
			
 
				               Information</em> column against each queue.</li>
			
 
				+          <li>The /scheduler web-page should show the resource usages of 
			
 
				+              individual queues.</li>
			
 
				         </ul>
			
 
				       </section>
			
 
				       
			
 
				    </section>
			
 
				+
			
 
				+  <section>
			
 
				+    <title>Example</title>
			
 
				+    <p>Here is a practical example for using CapacityScheduler:</p>
			
 
				+    <table>
			
 
				+    <tr>
			
 
				+    <td>
			
 
				+<code>&lt;?xml version="1.0"?&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;configuration&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>  &lt;!-- system limit, across all queues --&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.maximum-system-jobs&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;3000&lt;/value&gt;</code><br/>
			
 
				+<code>    &lt;description&gt;Maximum number of jobs in the system which can be initialized,</code><br/>
			
 
				+<code>     concurrently, by the CapacityScheduler.</code><br/>
			
 
				+<code>    &lt;/description&gt;    </code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code> </code><br/>
			
 
				+<code>&lt;!-- queue: queueA --&gt;</code><br/>
			
 
				+<code> &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;8&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueA.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;!-- queue: queueB --&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;2&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;1&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueB.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;!-- queue: queueC --&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;30&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;1&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueC.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;!-- queue: queueD --&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;1&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueD.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;!-- queue: queueE --&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;31&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;1&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueE.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;!-- queue: queueF --&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.capacity&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;28&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.supports-priority&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;false&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.minimum-user-limit-percent&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;20&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.user-limit-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;1&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.maximum-initialized-active-tasks&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;200000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.maximum-initialized-active-tasks-per-user&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;100000&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<code>  &lt;property&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;name&gt;mapred.capacity-scheduler.queue.queueF.init-accept-jobs-factor&lt;/name&gt;</code><br/>
			
 
				+<code>    &nbsp;&nbsp;&lt;value&gt;10&lt;/value&gt;</code><br/>
			
 
				+<code>  &lt;/property&gt;</code><br/>
			
 
				+<br/>
			
 
				+<code>&lt;/configuration&gt;</code><br/>
			
 
				+    </td>
			
 
				+    </tr>
			
 
				+    </table>
			
 
				+  </section>
			
 
				   </body>
			
 
				   
			
 
				 </document>