Browse Source

HADOOP-4687. Updating CHANGES.txt

git-svn-id: https://svn.apache.org/repos/asf/hadoop/core/branches/HADOOP-4687/core@785393 13f79535-47bb-0310-9956-ffa450edef68
Owen O'Malley 16 years ago
parent
commit
c2d07b4fcd
1 changed files with 275 additions and 11 deletions
  1. 275 11
      CHANGES.txt

+ 275 - 11
CHANGES.txt

@@ -56,6 +56,18 @@ Trunk (unreleased changes)
     HADOOP-4359. Support for data access authorization checking on Datanodes.
     HADOOP-4359. Support for data access authorization checking on Datanodes.
     (Kan Zhang via rangadi)
     (Kan Zhang via rangadi)
 
 
+    HADOOP-5690. Change org.apache.hadoop.examples.DBCountPageView to use 
+    new mapreduce api. (Amareshwari Sriramadasu via sharad)
+
+    HADOOP-5694. Change org.apache.hadoop.examples.dancing to use new 
+    mapreduce api. (Amareshwari Sriramadasu via sharad)
+
+    HADOOP-5696. Change org.apache.hadoop.examples.Sort to use new 
+    mapreduce api. (Amareshwari Sriramadasu via sharad)
+
+    HADOOP-5698. Change org.apache.hadoop.examples.MultiFileWordCount to 
+    use new mapreduce api. (Amareshwari Sriramadasu via sharad)
+
   NEW FEATURES
   NEW FEATURES
 
 
     HADOOP-4268. Change fsck to use ClientProtocol methods so that the
     HADOOP-4268. Change fsck to use ClientProtocol methods so that the
@@ -122,6 +134,21 @@ Trunk (unreleased changes)
     HADOOP-5643. Adds a way to decommission TaskTrackers while the JobTracker
     HADOOP-5643. Adds a way to decommission TaskTrackers while the JobTracker
     is running. (Amar Kamat via ddas)
     is running. (Amar Kamat via ddas)
 
 
+    HADOOP-4829. Allow FileSystem shutdown hook to be disabled.
+    (Todd Lipcon via tomwhite)
+
+    HADOOP-5815. Sqoop: A database import tool for Hadoop.
+    (Aaron Kimball via tomwhite)
+
+    HADOOP-4861. Add disk usage with human-readable size (-duh).
+    (Todd Lipcon via tomwhite)
+
+    HADOOP-5844. Use mysqldump when connecting to local mysql instance in Sqoop.
+    (Aaron Kimball via tomwhite)
+
+    HADOOP-5170. Allows jobs to set max maps/reduces per-node and per-cluster.
+    (Matei Zaharia via ddas)
+
   IMPROVEMENTS
   IMPROVEMENTS
 
 
     HADOOP-4565. Added CombineFileInputFormat to use data locality information
     HADOOP-4565. Added CombineFileInputFormat to use data locality information
@@ -326,9 +353,6 @@ Trunk (unreleased changes)
     HADOOP-5080. Add new test cases to TestMRCLI and TestHDFSCLI
     HADOOP-5080. Add new test cases to TestMRCLI and TestHDFSCLI
     (V.Karthikeyan via nigel)
     (V.Karthikeyan via nigel)
 
 
-    HADOOP-4372. Improves the way history filenames are obtained and manipulated.
-    (Amar Kamat via ddas)
-
     HADOOP-5135. Splits the tests into different directories based on the 
     HADOOP-5135. Splits the tests into different directories based on the 
     package. Four new test targets have been defined - run-test-core, 
     package. Four new test targets have been defined - run-test-core, 
     run-test-mapred, run-test-hdfs and run-test-hdfs-with-mr.
     run-test-mapred, run-test-hdfs and run-test-hdfs-with-mr.
@@ -343,8 +367,6 @@ Trunk (unreleased changes)
 
 
     HADOOP-5780. Improve per block message prited by "-metaSave" in HDFS.
     HADOOP-5780. Improve per block message prited by "-metaSave" in HDFS.
     (Raghu Angadi)
     (Raghu Angadi)
-   
-    HADOOP-5792. To resolve jsp-2.1 jars through ivy (Giridharan Kesavan) 
 
 
     HADOOP-5823. Added a new class DeprecatedUTF8 to help with removing
     HADOOP-5823. Added a new class DeprecatedUTF8 to help with removing
     UTF8 related javac warnings. These warnings are removed in 
     UTF8 related javac warnings. These warnings are removed in 
@@ -367,6 +389,61 @@ Trunk (unreleased changes)
     HADOOP-5858. Eliminate UTF8 and fix warnings in test/hdfs-with-mr package.
     HADOOP-5858. Eliminate UTF8 and fix warnings in test/hdfs-with-mr package.
     (shv)
     (shv)
 
 
+    HADOOP-5866. Move DeprecatedUTF8 from o.a.h.io to o.a.h.hdfs since it may
+    not be used outside hdfs. (Raghu Angadi)
+
+    HADOOP-5857. Move normal java methods from hdfs .jsp files to .java files.
+    (szetszwo)
+
+    HADOOP-5873. Remove deprecated methods randomDataNode() and
+    getDatanodeByIndex(..) in FSNamesystem.  (szetszwo)
+
+    HADOOP-5572. Improves the progress reporting for the sort phase for both
+    maps and reduces. (Ravi Gummadi via ddas)
+
+    HADOOP-5839. Fix EC2 scripts to allow remote job submission.
+    (Joydeep Sen Sarma via tomwhite)
+
+    HADOOP-5877. Fix javac warnings in TestHDFSServerPorts, TestCheckpoint, 
+    TestNameEditsConfig, TestStartup and TestStorageRestore.
+    (Jakob Homan via shv)
+
+    HADOOP-5438. Provide a single FileSystem method to create or open-for-append
+    to a file.  (He Yongqiang via dhruba)
+
+    HADOOP-5472. Change DistCp to support globbing of input paths.  (Dhruba
+    Borthakur and Rodrigo Schmidt via szetszwo)
+
+    HADOOP-5175. Don't unpack libjars on classpath. (Todd Lipcon via tomwhite)
+
+    HADOOP-5620. Add an option to DistCp for preserving modification and access
+    times.  (Rodrigo Schmidt via szetszwo)
+
+    HADOOP-5664. Change map serialization so a lock is obtained only where
+    contention is possible, rather than for each write. (cdouglas)
+
+    HADOOP-5896. Remove the dependency of GenericOptionsParser on 
+    Option.withArgPattern. (Giridharan Kesavan and Sharad Agarwal via 
+    sharad)
+
+    HADOOP-5784. Makes the number of heartbeats that should arrive a second
+    at the JobTracker configurable. (Amareshwari Sriramadasu via ddas)
+
+    HADOOP-5955. Changes TestFileOuputFormat so that is uses LOCAL_MR
+    instead of CLUSTER_MR. (Jothi Padmanabhan via das)
+
+    HADOOP-5948. Changes TestJavaSerialization to use LocalJobRunner 
+    instead of MiniMR/DFS cluster. (Jothi Padmanabhan via das)
+
+    HADOOP-2838. Add mapred.child.env to pass environment variables to 
+    tasktracker's child processes. (Amar Kamat via sharad)
+
+    HADOOP-5961. DataNode process understand generic hadoop command line
+    options (like -Ddfs.property=value). (Raghu Angadi)
+
+    HADOOP-5938. Change org.apache.hadoop.mapred.jobcontrol to use new
+    api. (Amareshwari Sriramadasu via sharad)
+
   OPTIMIZATIONS
   OPTIMIZATIONS
 
 
     HADOOP-5595. NameNode does not need to run a replicator to choose a
     HADOOP-5595. NameNode does not need to run a replicator to choose a
@@ -494,9 +571,6 @@ Trunk (unreleased changes)
 
 
     HADOOP-5507. Fix JMXGet javadoc warnings.  (Boris Shkolnik via szetszwo)
     HADOOP-5507. Fix JMXGet javadoc warnings.  (Boris Shkolnik via szetszwo)
 
 
-    HADOOP-5210. Solves a problem in the progress report of the reduce task.
-    (Ravi Gummadi via ddas)
-
     HADOOP-5191. Accessing HDFS with any ip or hostname should work as long 
     HADOOP-5191. Accessing HDFS with any ip or hostname should work as long 
     as it points to the interface NameNode is listening on. (Raghu Angadi)
     as it points to the interface NameNode is listening on. (Raghu Angadi)
 
 
@@ -624,6 +698,128 @@ Trunk (unreleased changes)
     HADOOP-5835. Fix findbugs warnings found in Block, DataNode, NameNode and
     HADOOP-5835. Fix findbugs warnings found in Block, DataNode, NameNode and
     a few other hdfs classes.  (Suresh Srinivas via szetszwo)
     a few other hdfs classes.  (Suresh Srinivas via szetszwo)
 
 
+    HADOOP-5853. Undeprecate HttpServer.addInternalServlet method.  (Suresh
+    Srinivas via szetszwo)
+
+    HADOOP-5801. Fixes the problem: If the hosts file is changed across restart
+    then it should be refreshed upon recovery so that the excluded hosts are 
+    lost and the maps are re-executed. (Amar Kamat via ddas)
+
+    HADOOP-5841. Resolve findbugs warnings in DistributedFileSystem,
+    DatanodeInfo, BlocksMap, DataNodeDescriptor.  (Jakob Homan via szetszwo)
+
+    HADOOP-5878. Fix import and Serializable javac warnings found in hdfs jsp.
+    (szetszwo)
+
+    HADOOP-5782. Revert a few formatting changes introduced in HADOOP-5015.
+    (Suresh Srinivas via rangadi)
+
+    HADOOP-5687. NameNode throws NPE if fs.default.name is the default value.
+    (Philip Zeyliger via shv)
+
+    HADOOP-5867. Fix javac warnings found in NNBench and NNBenchWithoutMR.
+    (Konstantin Boudnik via szetszwo)
+    
+    HADOOP-5728. Fixed FSEditLog.printStatistics IndexOutOfBoundsException.
+    (Wang Xu via johan)
+
+    HADOOP-5847. Fixed failing Streaming unit tests (gkesavan) 
+
+    HADOOP-5252. Streaming overrides -inputformat option (Klaas Bosteels 
+    via sharad)
+
+    HADOOP-5710. Counter MAP_INPUT_BYTES missing from new mapreduce api. 
+    (Amareshwari Sriramadasu via sharad)
+
+    HADOOP-5809. Fix job submission, broken by errant directory creation.
+    (Sreekanth Ramakrishnan and Jothi Padmanabhan via cdouglas)
+
+    HADOOP-5759. Fix for  IllegalArgumentException when 
+    CombineFileInputFormat is used as job InputFormat.
+    (Amareshwari Sriramadasu via dhruba)
+
+    HADOOP-5635. Change distributed cache to work with other distributed file
+    systems. (Andrew Hitchcock via tomwhite)
+
+    HADOOP-5856. Fix "unsafe multithreaded use of DateFormat" findbugs warning
+    in DataBlockScanner.  (Kan Zhang via szetszwo)
+
+    HADOOP-4864. Fixes a problem to do with -libjars with multiple jars when
+    client and cluster reside on different OSs. (Amareshwari Sriramadasu via ddas)
+
+    HADOOP-5623. Fixes a problem to do with status messages getting overwritten
+    in streaming jobs. (Rick Cox and Jothi Padmanabhan via ddas)
+
+    HADOOP-5895. Fixes computation of count of merged bytes for logging.
+    (Ravi Gummadi via ddas)
+
+    HADOOP-5805. problem using top level s3 buckets as input/output directories.
+    (Ian Nowland via tomwhite)
+   
+    HADOOP-5940. trunk eclipse-plugin build fails while trying to copy 
+    commons-cli jar from the lib dir (Giridharan Kesavan via gkesavan)
+
+    HADOOP-5864. Fix DMI and OBL findbugs in packages hdfs and metrics.
+    (hairong)
+
+    HADOOP-5935. Fix Hudson's release audit warnings link is broken. 
+    (Giridharan Kesavan via gkesavan)
+
+    HADOOP-5947. Delete empty TestCombineFileInputFormat.java
+
+    HADOOP-5899. Move a log message in FSEditLog to the right place for
+    avoiding unnecessary log.  (Suresh Srinivas via szetszwo)
+
+    HADOOP-5944. Add Apache license header to BlockManager.java.  (Suresh
+    Srinivas via szetszwo)
+
+    HADOOP-5891. SecondaryNamenode is able to converse with the NameNode 
+    even when the default value of dfs.http.address is not overridden.
+    (Todd Lipcon via dhruba)
+
+    HADOOP-5953. The isDirectory(..) and isFile(..) methods in KosmosFileSystem
+    should not be deprecated.  (szetszwo)
+
+    HADOOP-5954. Fix javac warnings in TestFileCreation, TestSmallBlock,
+    TestFileStatus, TestDFSShellGenericOptions, TestSeekBug and
+    TestDFSStartupVersions.  (szetszwo)
+
+    HADOOP-5956. Fix ivy dependency in hdfsproxy and capacity-scheduler.
+    (Giridharan Kesavan via szetszwo)
+
+    HADOOP-5836. Bug in S3N handling of directory markers using an object with
+    a trailing "/" causes jobs to fail. (Ian Nowland via tomwhite)
+
+    HADOOP-5861. s3n files are not getting split by default. (tomwhite)
+
+    HADOOP-5762. Fix a problem that DistCp does not copy empty directory.
+    (Rodrigo Schmidt via szetszwo)
+
+    HADOOP-5859. Fix "wait() or sleep() with locks held" findbugs warnings in
+    DFSClient.  (Kan Zhang via szetszwo)
+   
+    HADOOP-5457. Fix to continue to run builds even if contrib test fails
+    (Giridharan Kesavan via gkesavan)
+
+    HADOOP-5963. Remove an unnecessary exception catch in NNBench.  (Boris
+    Shkolnik via szetszwo)
+
+    HADOOP-5989. Fix streaming test failure.  (gkesavan)
+
+    HADOOP-5981. Fix a bug in HADOOP-2838 in parsing mapred.child.env.
+    (Amar Kamat via sharad)
+
+    HADOOP-5420. Fix LinuxTaskController to kill tasks using the process
+    groups they are launched with.
+    (Sreekanth Ramakrishnan via yhemanth)
+
+    HADOOP-6031. Remove @author tags from Java source files.  (Ravi Phulari
+    via szetszwo)
+
+    HADOOP-5980. Fix LinuxTaskController so tasks get passed 
+    LD_LIBRARY_PATH and other environment variables.
+    (Sreekanth Ramakrishnan via yhemanth)
+
 Release 0.20.1 - Unreleased
 Release 0.20.1 - Unreleased
 
 
   INCOMPATIBLE CHANGES
   INCOMPATIBLE CHANGES
@@ -631,6 +827,9 @@ Release 0.20.1 - Unreleased
     HADOOP-5726. Remove pre-emption from capacity scheduler code base.
     HADOOP-5726. Remove pre-emption from capacity scheduler code base.
     (Rahul Kumar Singh via yhemanth)
     (Rahul Kumar Singh via yhemanth)
 
 
+    HADOOP-5881. Simplify memory monitoring and scheduling related
+    configuration. (Vinod Kumar Vavilapalli via yhemanth)
+
   NEW FEATURES
   NEW FEATURES
 
 
   IMPROVEMENTS
   IMPROVEMENTS
@@ -644,6 +843,9 @@ Release 0.20.1 - Unreleased
     HADOOP-5714. Add a metric for NameNode getFileInfo operation. (Jakob Homan
     HADOOP-5714. Add a metric for NameNode getFileInfo operation. (Jakob Homan
     via szetszwo)
     via szetszwo)
 
 
+    HADOOP-4372. Improves the way history filenames are obtained and manipulated.
+    (Amar Kamat via ddas)
+
   OPTIMIZATIONS
   OPTIMIZATIONS
 
 
   BUG FIXES
   BUG FIXES
@@ -690,6 +892,62 @@ Release 0.20.1 - Unreleased
     HADOOP-5828. Use absolute path for mapred.local.dir of JobTracker in
     HADOOP-5828. Use absolute path for mapred.local.dir of JobTracker in
     MiniMRCluster. (yhemanth)
     MiniMRCluster. (yhemanth)
 
 
+    HADOOP-4981. Fix capacity scheduler to schedule speculative tasks 
+    correctly in the presence of High RAM jobs.
+    (Sreekanth Ramakrishnan via yhemanth)
+
+    HADOOP-5210. Solves a problem in the progress report of the reduce task.
+    (Ravi Gummadi via ddas)
+
+    HADOOP-5850. Fixes a problem to do with not being able to jobs with
+    0 maps/reduces. (Vinod K V via ddas)
+
+    HADOOP-4626. Correct the API links in hdfs forrest doc so that they
+    point to the same version of hadoop.  (szetszwo)
+
+    HADOOP-5883. Fixed tasktracker memory monitoring to account for
+    momentary spurts in memory usage due to java's fork() model.
+    (yhemanth)
+
+    HADOOP-5539. Fixes a problem to do with not preserving intermediate
+    output compression for merged data.
+    (Jothi Padmanabhan and Billy Pearson via ddas)
+
+    HADOOP-5932. Fixes a problem in capacity scheduler in computing
+    available memory on a tasktracker.
+    (Vinod Kumar Vavilapalli via yhemanth)
+
+    HADOOP-5937. Correct a safemode message in FSNamesystem.  (Ravi Phulari
+    via szetszwo)
+
+    HADOOP-5908. Fixes a problem to do with ArithmeticException in the 
+    JobTracker when there are jobs with 0 maps. (Amar Kamat via ddas)
+
+    HADOOP-5924. Fixes a corner case problem to do with job recovery with
+    empty history files. Also, after a JT restart, sends KillTaskAction to 
+    tasks that report back but the corresponding job hasn't been initialized
+    yet. (Amar Kamat via ddas)
+
+    HADOOP-5882. Fixes a reducer progress update problem for new mapreduce
+    api. (Amareshwari Sriramadasu via sharad)
+
+    HADOOP-5746. Fixes a corner case problem in Streaming, where if an exception
+    happens in MROutputThread after the last call to the map/reduce method, the 
+    exception goes undetected. (Amar Kamat via ddas)
+
+    HADOOP-5884. Fixes accounting in capacity scheduler so that high RAM jobs
+    take more slots. (Vinod Kumar Vavilapalli via yhemanth)
+
+    HADOOP-5869. Fix bug in assignment of setup / cleanup task that was
+    causing TestQueueCapacities to fail.
+    (Sreekanth Ramakrishnan via yhemanth)
+
+    HADOOP-5921. Fixes a problem in the JobTracker where it sometimes never used
+    to come up due to a system file creation on JobTracker's system-dir failing. 
+    This problem would sometimes show up only when the FS for the system-dir 
+    (usually HDFS) is started at nearly the same time as the JobTracker. 
+    (Amar Kamat via ddas)
+
 Release 0.20.0 - 2009-04-15
 Release 0.20.0 - 2009-04-15
 
 
   INCOMPATIBLE CHANGES
   INCOMPATIBLE CHANGES
@@ -1454,7 +1712,7 @@ Release 0.20.0 - 2009-04-15
     (Rahul Kumar Singh via yhemanth)
     (Rahul Kumar Singh via yhemanth)
 
 
     HADOOP-5328. Fixes a problem in the renaming of job history files during 
     HADOOP-5328. Fixes a problem in the renaming of job history files during 
-    job recovery. Amar Kamat via ddas)
+    job recovery. (Amar Kamat via ddas)
 
 
     HADOOP-5417. Don't ignore InterruptedExceptions that happen when calling 
     HADOOP-5417. Don't ignore InterruptedExceptions that happen when calling 
     into rpc. (omalley)
     into rpc. (omalley)
@@ -1567,7 +1825,7 @@ Release 0.19.2 - Unreleased
     (ddas)
     (ddas)
 
 
     HADOOP-5392. Fixes a problem to do with JT crashing during recovery when
     HADOOP-5392. Fixes a problem to do with JT crashing during recovery when
-    the job files are garbled. (Amar Kamat vi ddas)
+    the job files are garbled. (Amar Kamat via ddas)
 
 
     HADOOP-5332. Appending to files is not allowed (by default) unless
     HADOOP-5332. Appending to files is not allowed (by default) unless
     dfs.support.append is set to true. (dhruba)
     dfs.support.append is set to true. (dhruba)
@@ -1625,6 +1883,12 @@ Release 0.19.2 - Unreleased
     HADOOP-5579. Set errno correctly in libhdfs for permission, quota, and FNF
     HADOOP-5579. Set errno correctly in libhdfs for permission, quota, and FNF
     conditions. (Brian Bockelman via cdouglas)
     conditions. (Brian Bockelman via cdouglas)
 
 
+    HADOOP-5816. Fixes a problem in the KeyFieldBasedComparator to do with
+    ArrayIndexOutOfBounds exception. (He Yongqiang via ddas)
+
+    HADOOP-5951. Add Apache license header to StorageInfo.java.  (Suresh
+    Srinivas via szetszwo)
+
 Release 0.19.1 - 2009-02-23 
 Release 0.19.1 - 2009-02-23 
 
 
   IMPROVEMENTS
   IMPROVEMENTS
@@ -2268,7 +2532,7 @@ Release 0.19.0 - 2008-11-18
     (Lohit Vijayarenu via mahadev)
     (Lohit Vijayarenu via mahadev)
 
 
     HADOOP-3778. DFSInputStream.seek() did not retry in case of some errors.
     HADOOP-3778. DFSInputStream.seek() did not retry in case of some errors.
-    (LN via rangadi)
+    (Luo Ning via rangadi)
 
 
     HADOOP-3661. The handling of moving files deleted through fuse-dfs to
     HADOOP-3661. The handling of moving files deleted through fuse-dfs to
     Trash made similar to the behaviour from dfs shell.
     Trash made similar to the behaviour from dfs shell.