CHANGES.txt 13 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380
  1. Hadoop HDFS Change Log
  2. Trunk (unreleased changes)
  3. INCOMPATIBLE CHANGES
  4. HDFS-538. Per the contract elucidated in HADOOP-6201, throw
  5. FileNotFoundException from FileSystem::listStatus rather than returning
  6. null. (Jakob Homan via cdouglas)
  7. HDFS-602. DistributedFileSystem mkdirs throws FileAlreadyExistsException
  8. instead of FileNotFoundException. (Boris Shkolnik via suresh)
  9. HDFS-544. Add a "rbw" subdir to DataNode data directory. (hairong)
  10. HDFS-576. Block report includes under-construction replicas. (shv)
  11. HDFS-636. SafeMode counts complete blocks only. (shv)
  12. HDFS-644. Lease recovery, concurrency support. (shv)
  13. HDFS-570. Get last block length from a data-node when opening a file
  14. being written to. (Tsz Wo (Nicholas), SZE via shv)
  15. HDFS-657. Remove unused legacy data-node protocol methods. (shv)
  16. HDFS-658. Block recovery for primary data-node. (shv)
  17. NEW FEATURES
  18. HDFS-436. Introduce AspectJ framework for HDFS code and tests.
  19. (Konstantin Boudnik via szetszwo)
  20. HDFS-447. Add LDAP lookup to hdfsproxy. (Zhiyong Zhang via cdouglas)
  21. HDFS-459. Introduce Job History Log Analyzer. (shv)
  22. HDFS-461. Tool to analyze file size distribution in HDFS. (shv)
  23. HDFS-492. Add two JSON JSP pages to the Namenode for providing corrupt
  24. blocks/replicas information. (Bill Zeller via szetszwo)
  25. HDFS-578. Add support for new FileSystem method for clients to get server
  26. defaults. (Kan Zhang via suresh)
  27. HDFS-595. umask settings in configuration may now use octal or symbolic
  28. instead of decimal. (Jakob Homan via suresh)
  29. HADOOP-6234. Updated hadoop-core and test jars to propagate new option
  30. dfs.umaskmode in configuration. (Jakob Homan via suresh)
  31. HDFS-235. Add support for byte ranges in HftpFileSystem to serve
  32. range of bytes from a file. (Bill Zeller via suresh)
  33. HDFS-385. Add support for an experimental API that allows a module external
  34. to HDFS to specify how HDFS blocks should be placed. (dhruba)
  35. HADOOP-4952. Update hadoop-core and test jars to propagate new FileContext
  36. file system application interface. (Sanjay Radia via suresh).
  37. HDFS-567. Add block forensics contrib tool to print history of corrupt and
  38. missing blocks from the HDFS logs.
  39. (Bill Zeller, Jithendra Pandey via suresh).
  40. HDFS-610. Support o.a.h.fs.FileContext. (Sanjay Radia via szetszwo)
  41. HDFS-536. Support hflush at DFSClient. (hairong)
  42. HDFS-517. Introduce BlockInfoUnderConstruction to reflect block replica
  43. states while writing. (shv)
  44. HDFS-565. Introduce block committing logic during new block allocation
  45. and file close. (shv)
  46. HDFS-537. DataNode exposes a replica's meta info to BlockReceiver for the
  47. support of dfs writes/hflush. It also updates a replica's bytes received,
  48. bytes on disk, and bytes acked after receiving a packet. (hairong)
  49. HDFS-585. Datanode should serve up to visible length of a replica for read
  50. requests. (szetszwo)
  51. HDFS-604. Block report processing for append. (shv)
  52. HDFS-619. Support replica recovery initialization in datanode for the new
  53. append design. (szetszwo)
  54. HDFS-592. Allow clients to fetch a new generation stamp from NameNode for
  55. pipeline recovery. (hairong)
  56. HDFS-624. Support a new algorithm for pipeline recovery and pipeline setup
  57. for append. (hairong)
  58. HDFS-627. Support replica update in data-node.
  59. (Tsz Wo (Nicholas), SZE and Hairong Kuang via shv)
  60. HDFS-642. Support pipeline close and close error recovery. (hairong)
  61. IMPROVEMENTS
  62. HDFS-381. Remove blocks from DataNode maps when corresponding file
  63. is deleted. (Suresh Srinivas via rangadi)
  64. HDFS-377. Separate codes which implement DataTransferProtocol.
  65. (szetszwo)
  66. HDFS-396. NameNode image and edits directories are specified as URIs.
  67. (Luca Telloli via rangadi)
  68. HDFS-444. Allow to change probability levels dynamically in the fault
  69. injection framework. (Konstantin Boudnik via szetszwo)
  70. HDFS-352. Documentation for saveNamespace command. (Ravi Phulari via shv)
  71. HADOOP-6106. Updated hadoop-core and test jars from hudson trunk
  72. build #12. (Giridharan Kesavan)
  73. HDFS-204. Add a new metrics FilesInGetListingOps to the Namenode.
  74. (Jitendra Nath Pandey via szetszwo)
  75. HDFS-278. HDFS Outputstream close does not hang forever. (dhruba)
  76. HDFS-443. Add a new metrics numExpiredHeartbeats to the Namenode.
  77. (Jitendra Nath Pandey via szetszwo)
  78. HDFS-475. Add new ant targets for fault injection jars and tests.
  79. (Konstantin Boudnik via szetszwo)
  80. HDFS-458. Create a new ant target, run-commit-test. (Jakob Homan
  81. via szetszwo)
  82. HDFS-493. Change build.xml so that the fault-injected tests are executed
  83. only by the run-test-*-fault-inject targets. (Konstantin Boudnik via
  84. szetszwo)
  85. HDFS-446. Improvements to Offline Image Viewer. (Jakob Homan via shv)
  86. HADOOP-6160. Fix releaseaudit target to run on specific directories.
  87. (gkesavan)
  88. HDFS-501. Use enum to define the constants in DataTransferProtocol.
  89. (szetszwo)
  90. HDFS-508. Factor out BlockInfo from BlocksMap. (shv)
  91. HDFS-510. Rename DatanodeBlockInfo to be ReplicaInfo.
  92. (Jakob Homan & Hairong Kuang via shv)
  93. HDFS-500. Deprecate NameNode methods deprecated in NameNodeProtocol.
  94. (Jakob Homan via shv)
  95. HDFS-514. Change DFSClient.namenode from public to private. (Bill Zeller
  96. via szetszwo)
  97. HDFS-496. Use PureJavaCrc32 in HDFS. (Todd Lipcon via szetszwo)
  98. HDFS-511. Remove redundant block searches in BlockManager. (shv)
  99. HDFS-504. Update the modification time of a file when the file
  100. is closed. (Chun Zhang via dhruba)
  101. HDFS-498. Add development guide and documentation for the fault injection
  102. framework. (Konstantin Boudnik via szetszwo)
  103. HDFS-524. Further DataTransferProtocol code refactoring. (szetszwo)
  104. HDFS-529. Use BlockInfo instead of Block to avoid redundant block searches
  105. in BlockManager. (shv)
  106. HDFS-530. Refactor TestFileAppend* to remove code duplication.
  107. (Konstantin Boudnik via szetszwo)
  108. HDFS-451. Add fault injection tests for DataTransferProtocol. (szetszwo)
  109. HDFS-409. Add more access token tests. (Kan Zhang via szetszwo)
  110. HDFS-546. DatanodeDescriptor iterates blocks as BlockInfo. (shv)
  111. HDFS-457. Do not shutdown datanode if some, but not all, volumes fail.
  112. (Boris Shkolnik via szetszwo)
  113. HDFS-548. TestFsck takes nearly 10 minutes to run. (hairong)
  114. HDFS-539. Refactor fault injeciton pipeline test util for future reuse.
  115. (Konstantin Boudnik via szetszwo)
  116. HDFS-552. Change TestFiDataTransferProtocol to junit 4 and add a few new
  117. tests. (szetszwo)
  118. HDFS-563. Simplify the codes in FSNamesystem.getBlockLocations(..).
  119. (szetszwo)
  120. HDFS-581. Introduce an iterator over blocks in the block report array. (shv)
  121. HDFS-549. Add a new target, run-with-fault-inject-testcaseonly, which
  122. allows an execution of non-FI tests in FI-enable environment. (Konstantin
  123. Boudnik via szetszwo)
  124. HDFS-173. Namenode will not block until a large directory deletion completes.
  125. It allows other operations when the deletion is in progress. (suresh)
  126. HDFS-551. Create new functional test for a block report. (Konstantin
  127. Boudnik via hairong)
  128. HDFS-288. Redundant computation in hashCode() implementation.
  129. (szetszwo via tomwhite)
  130. HDFS-412. Hadoop JMX usage makes Nagios monitoring impossible.
  131. (Brian Bockelman via tomwhite)
  132. HDFS-472. Update hdfsproxy documentation. Adds a setup guide and design
  133. document. (Zhiyong Zhang via cdouglas)
  134. HDFS-617. Support non-recursive create(). (Kan Zhang via szetszwo)
  135. HDFS-618. Support non-recursive mkdir(). (Kan Zhang via szetszwo)
  136. HDFS-574. Split the documentation between the subprojects.
  137. (Corinne Chandel via omalley)
  138. HDFS-598. Eclipse launch task for HDFS. (Eli Collins via tomwhite)
  139. HDFS-509. Redesign DataNode volumeMap to include all types of Replicas.
  140. (hairong)
  141. HDFS-562. Add a test for NameNode.getBlockLocations(..) to check read from
  142. un-closed file. (szetszwo)
  143. HDFS-543. Break FSDatasetInterface#writToBlock() into writeToRemporary,
  144. writeToRBW, ad append. (hairong)
  145. HDFS-603. Add a new interface, Replica, which is going to replace the use
  146. of Block in datanode. (szetszwo)
  147. HDFS-589. Change block write protocol to support pipeline recovery.
  148. (hairong)
  149. HDFS-652. Replace BlockInfo.isUnderConstruction() with isComplete() (shv)
  150. HDFS-648. Change some methods in AppendTestUtil to public. (Konstantin
  151. Boudnik via szetszwo)
  152. HDFS-662. Unnecessary info message from DFSClient. (hairong)
  153. HDFS-518. Create new tests for Append's hflush. (Konstantin Boudnik
  154. via szetszwo)
  155. BUG FIXES
  156. HDFS-76. Better error message to users when commands fail because of
  157. lack of quota. Allow quota to be set even if the limit is lower than
  158. current consumption. (Boris Shkolnik via rangadi)
  159. HADOOP-4687. HDFS is split from Hadoop Core. It is a subproject under
  160. Hadoop (Owen O'Malley)
  161. HADOOP-6096. Fix Eclipse project and classpath files following project
  162. split. (tomwhite)
  163. HDFS-195. Handle expired tokens when write pipeline is reestablished.
  164. (Kan Zhang via rangadi)
  165. HDFS-181. Validate src path in FSNamesystem.getFileInfo(..). (Todd
  166. Lipcon via szetszwo)
  167. HDFS-441. Remove TestFTPFileSystem. (szetszwo)
  168. HDFS-440. Fix javadoc broken links in DFSClient. (szetszwo)
  169. HDFS-480. Fix a typo in the jar name in build.xml.
  170. (Konstantin Shvachko via gkesavan)
  171. HDFS-438. Check for NULL before invoking GenericArgumentParser in
  172. DataNode. (Raghu Angadi)
  173. HDFS-415. BlockReceiver hangs in case of certain runtime exceptions.
  174. (Konstantin Boudnik via rangadi)
  175. HDFS-462. loadFSImage should close edits file. (Jakob Homan via shv)
  176. HDFS-489. Update TestHDFSCLI for the -skipTrash option in rm. (Jakob Homan
  177. via szetszwo)
  178. HDFS-445. pread() does not pick up changes to block locations.
  179. (Kan Zhang via rangadi)
  180. HDFS-463. CreateEditLog utility broken after HDFS-396 (URI for
  181. FSImage). (Suresh Srinivas via rangadi)
  182. HDFS-484. Fix bin-package and package target to package jar files.
  183. (gkesavan)
  184. HDFS-490. Eliminate the deprecated warnings introduced by H-5438.
  185. (He Yongqiang via szetszwo)
  186. HDFS-119. Fix a bug in logSync(), which causes NameNode block forever.
  187. (Suresh Srinivas via shv)
  188. HDFS-534. Include avro in ivy. (szetszwo)
  189. HDFS-532. Allow applications to know that a read request failed
  190. because block is missing. (dhruba)
  191. HDFS-561. Fix write pipeline READ_TIMEOUT in DataTransferProtocol.
  192. (Kan Zhang via szetszwo)
  193. HDFS-553. BlockSender reports wrong failed position in ChecksumException.
  194. (hairong)
  195. HDFS-568. Set mapred.job.tracker.retire.jobs to false in
  196. src/test/mapred-site.xml for mapreduce tests to run. (Amareshwari
  197. Sriramadasu via szetszwo)
  198. HDFS-15. All replicas end up on 1 rack. (Jitendra Nath Pandey via hairong)
  199. HDFS-586. TestBlocksWithNotEnoughRacks sometimes fails.
  200. (Jitendra Nath Pandey via hairong)
  201. HADOOP-6243. Fixed a NullPointerException in handling deprecated keys.
  202. (Sreekanth Ramakrishnan via yhemanth)
  203. HDFS-605. Do not run fault injection tests in the run-test-hdfs-with-mr
  204. target. (Konstantin Boudnik via szetszwo)
  205. HDFS-606. Fix ConcurrentModificationException in invalidateCorruptReplicas()
  206. (shv)
  207. HDFS-601. TestBlockReport obtains data directories directly from
  208. MiniHDFSCluster. (Konstantin Boudnik via shv)
  209. HDFS-614. TestDatanodeBlockScanner obtains data directories directly from
  210. MiniHDFSCluster. (shv)
  211. HDFS-612. Remove the use of org.mortbay.log.Log in FSDataset. (szetszwo)
  212. HDFS-622. checkMinReplication should count live nodes only. (shv)
  213. HDFS-629. Remove ReplicationTargetChooser.java along with fixing
  214. import warnings generated by Eclipse. (dhruba)
  215. HDFS-640. Fix TestHDFSFileContextMainOperations.java build failure. (suresh)
  216. HDFS-637. DataNode sends a Success ack when block write fails. (hairong)
  217. HDFS-547. TestHDFSFileSystemContract#testOutputStreamClosedTwice
  218. sometimes fails with CloseByInterruptException. (hairong)
  219. HDFS-588. Fix TestFiDataTransferProtocol and TestAppend2 failures. (shv)
  220. HDFS-550. DataNode restarts may introduce corrupt/duplicated/lost replicas
  221. when handling detached replicas. (hairong)
  222. HDFS-659. If the the last block is not complete, update its length with
  223. one of its replica's length stored in datanode. (szetszwo)
  224. HDFS-649. Check null pointers for DataTransferTest. (Konstantin Boudnik
  225. via szetszwo)
  226. HDFS-661. DataNode upgrade fails on non-existant current directory.
  227. (hairong)
  228. HDFS-597. Mofication introduced by HDFS-537 breakes an advice binding in
  229. FSDatasetAspects. (Konstantin Boudnik via szetszwo)
  230. HDFS-665. TestFileAppend2 sometimes hangs. (hairong)
  231. Release 0.20.1 - 2009-09-01
  232. IMPROVEMENTS
  233. HDFS-438. Improve help message for space quota command. (Raghu Angadi)
  234. BUG FIXES
  235. HDFS-167. Fix a bug in DFSClient that caused infinite retries on write.
  236. (Bill Zeller via szetszwo)
  237. HDFS-527. Remove/deprecate unnecessary DFSClient constructors. (szetszwo)
  238. HDFS-525. The SimpleDateFormat object in ListPathsServlet is not thread
  239. safe. (Suresh Srinivas and cdouglas)