CHANGES.HDFS-1623.txt 10 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252
  1. Changes for HDFS-1623 branch.
  2. This change list will be merged into the trunk CHANGES.txt when the HDFS-1623
  3. branch is merged.
  4. ------------------------------
  5. HDFS-2179. Add fencing framework and mechanisms for NameNode HA. (todd)
  6. HDFS-1974. Introduce active and standy states to the namenode. (suresh)
  7. HDFS-2407. getServerDefaults and getStats don't check operation category (atm)
  8. HDFS-1973. HA: HDFS clients must handle namenode failover and switch over to the new active namenode. (atm)
  9. HDFS-2301. Start/stop appropriate namenode services when transition to active and standby states. (suresh)
  10. HDFS-2231. Configuration changes for HA namenode. (suresh)
  11. HDFS-2418. Change ConfiguredFailoverProxyProvider to take advantage of HDFS-2231. (atm)
  12. HDFS-2393. Mark appropriate methods of ClientProtocol with the idempotent annotation. (atm)
  13. HDFS-2523. Small NN fixes to include HAServiceProtocol and prevent NPE on shutdown. (todd)
  14. HDFS-2577. NN fails to start since it tries to start secret manager in safemode. (todd)
  15. HDFS-2582. Scope dfs.ha.namenodes config by nameservice (todd)
  16. HDFS-2591. MiniDFSCluster support to mix and match federation with HA (todd)
  17. HDFS-1975. Support for sharing the namenode state from active to standby. (jitendra, atm, todd)
  18. HDFS-1971. Send block report from datanode to both active and standby namenodes. (sanjay, todd via suresh)
  19. HDFS-2616. Change DatanodeProtocol#sendHeartbeat() to return HeartbeatResponse. (suresh)
  20. HDFS-2622. Fix TestDFSUpgrade in HA branch. (todd)
  21. HDFS-2612. Handle refreshNameNodes in federated HA clusters (todd)
  22. HDFS-2623. Add test case for hot standby capability (todd)
  23. HDFS-2626. BPOfferService.verifyAndSetNamespaceInfo needs to be synchronized (todd)
  24. HDFS-2624. ConfiguredFailoverProxyProvider doesn't correctly stop ProtocolTranslators (todd)
  25. HDFS-2625. TestDfsOverAvroRpc failing after introduction of HeartbeatResponse type (todd)
  26. HDFS-2627. Determine DN's view of which NN is active based on heartbeat responses (todd)
  27. HDFS-2634. Standby needs to ingest latest edit logs before transitioning to active (todd)
  28. HDFS-2671. NN should throw StandbyException in response to RPCs in STANDBY state (todd)
  29. HDFS-2680. DFSClient should construct failover proxy with exponential backoff (todd)
  30. HDFS-2683. Authority-based lookup of proxy provider fails if path becomes canonicalized (todd)
  31. HDFS-2689. HA: BookKeeperEditLogInputStream doesn't implement isInProgress() (atm)
  32. HDFS-2602. NN should log newly-allocated blocks without losing BlockInfo (atm)
  33. HDFS-2667. Fix transition from active to standby (todd)
  34. HDFS-2684. Fix up some failing unit tests on HA branch (todd)
  35. HDFS-2679. Add interface to query current state to HAServiceProtocol (eli via todd)
  36. HDFS-2677. Web UI should indicate the NN state. (eli via todd)
  37. HDFS-2678. When a FailoverProxyProvider is used, DFSClient should not retry connection ten times before failing over (atm via todd)
  38. HDFS-2682. When a FailoverProxyProvider is used, Client should not retry for 45 times if it is timing out to connect to server. (Uma Maheswara Rao G via todd)
  39. HDFS-2693. Fix synchronization issues around state transition (todd)
  40. HDFS-1972. Fencing mechanism for block invalidations and replications (todd)
  41. HDFS-2714. Fix test cases which use standalone FSNamesystems (todd)
  42. HDFS-2692. Fix bugs related to failover from/into safe mode. (todd)
  43. HDFS-2716. Configuration needs to allow different dfs.http.addresses for each HA NN (todd)
  44. HDFS-2720. Fix MiniDFSCluster HA support to work properly on Windows. (Uma Maheswara Rao G via todd)
  45. HDFS-2291. Allow the StandbyNode to make checkpoints in an HA setup. (todd)
  46. HDFS-2709. Appropriately handle error conditions in EditLogTailer (atm via todd)
  47. HDFS-2730. Refactor shared HA-related test code into HATestUtil class (todd)
  48. HDFS-2762. Fix TestCheckpoint timing out on HA branch. (Uma Maheswara Rao G via todd)
  49. HDFS-2724. NN web UI can throw NPE after startup, before standby state is entered. (todd)
  50. HDFS-2753. Fix standby getting stuck in safemode when blocks are written while SBN is down. (Hari Mankude and todd via todd)
  51. HDFS-2773. Reading edit logs from an earlier version should not leave blocks in under-construction state. (todd)
  52. HDFS-2775. Fix TestStandbyCheckpoints.testBothNodesInStandbyState failing intermittently. (todd)
  53. HDFS-2766. Test for case where standby partially reads log and then performs checkpoint. (atm)
  54. HDFS-2738. FSEditLog.selectinputStreams is reading through in-progress streams even when non-in-progress are requested. (atm)
  55. HDFS-2789. TestHAAdmin.testFailover is failing (eli)
  56. HDFS-2747. Entering safe mode after starting SBN can NPE. (Uma Maheswara Rao G via todd)
  57. HDFS-2772. On transition to active, standby should not swallow ELIE. (atm)
  58. HDFS-2767. ConfiguredFailoverProxyProvider should support NameNodeProtocol. (Uma Maheswara Rao G via todd)
  59. HDFS-2795. Standby NN takes a long time to recover from a dead DN starting up. (todd)
  60. HDFS-2592. Balancer support for HA namenodes. (Uma Maheswara Rao G via todd)
  61. HDFS-2367. Enable the configuration of multiple HA cluster addresses. (atm)
  62. HDFS-2812. When becoming active, the NN should treat all leases as freshly renewed. (todd)
  63. HDFS-2737. Automatically trigger log rolls periodically on the active NN. (todd and atm)
  64. HDFS-2820. Add a simple sanity check for HA config (todd)
  65. HDFS-2688. Add tests for quota tracking in an HA cluster. (todd)
  66. HDFS-2804. Should not mark blocks under-replicated when exiting safemode (todd)
  67. HDFS-2807. Service level authorizartion for HAServiceProtocol. (jitendra)
  68. HDFS-2809. Add test to verify that delegation tokens are honored after failover. (jitendra and atm)
  69. HDFS-2838. NPE in FSNamesystem when in safe mode. (Gregory Chanan via eli)
  70. HDFS-2805. Add a test for a federated cluster with HA NNs. (Brandon Li via jitendra)
  71. HDFS-2841. HAAdmin does not work if security is enabled. (atm)
  72. HDFS-2691. Fixes for pipeline recovery in an HA cluster: report RBW replicas immediately upon pipeline creation. (todd)
  73. HDFS-2824. Fix failover when prior NN died just after creating an edit log segment. (atm via todd)
  74. HDFS-2853. HA: NN fails to start if the shared edits dir is marked required (atm via eli)
  75. HDFS-2845. SBN should not allow browsing of the file system via web UI. (Bikas Saha via atm)
  76. HDFS-2742. HA: observed dataloss in replication stress test. (todd via eli)
  77. HDFS-2870. Fix log level for block debug info in processMisReplicatedBlocks (todd)
  78. HDFS-2859. LOCAL_ADDRESS_MATCHER.match has NPE when called from DFSUtil.getSuffixIDs when the host is incorrect (Bikas Saha via todd)
  79. HDFS-2861. checkpointing should verify that the dfs.http.address has been configured to a non-loopback for peer NN (todd)
  80. HDFS-2860. TestDFSRollback#testRollback is failing. (atm)
  81. HDFS-2769. HA: When HA is enabled with a shared edits dir, that dir should be
  82. marked required. (atm via eli)
  83. HDFS-2863. Failures observed if dfs.edits.dir and shared.edits.dir have same directories. (Bikas Saha via atm)
  84. HDFS-2874. Edit log should log to shared dirs before local dirs. (todd)
  85. HDFS-2890. DFSUtil#getSuffixIDs should skip unset configurations. (atm)
  86. HDFS-2792. Make fsck work. (atm)
  87. HDFS-2808. HA: haadmin should use namenode ids. (eli)
  88. HDFS-2819. Document new HA-related configs in hdfs-default.xml. (eli)
  89. HDFS-2752. HA: exit if multiple shared dirs are configured. (eli)
  90. HDFS-2894. HA: automatically determine the nameservice Id if only one nameservice is configured. (eli)
  91. HDFS-2733. Document HA configuration and CLI. (atm)
  92. HDFS-2794. Active NN may purge edit log files before standby NN has a chance to read them (todd)
  93. HDFS-2901. Improvements for SBN web UI - not show under-replicated/missing blocks. (Brandon Li via jitendra)
  94. HDFS-2905. HA: Standby NN NPE when shared edits dir is deleted. (Bikas Saha via jitendra)
  95. HDFS-2579. Starting delegation token manager during safemode fails. (todd)
  96. HDFS-2510. Add HA-related metrics. (atm)
  97. HDFS-2924. Standby checkpointing fails to authenticate in secure cluster. (todd)
  98. HDFS-2915. HA: TestFailureOfSharedDir.testFailureOfSharedDir() has race condition. (Bikas Saha via jitendra)
  99. HDFS-2912. Namenode not shutting down when shared edits dir is inaccessible. (Bikas Saha via atm)
  100. HDFS-2917. HA: haadmin should not work if run by regular user (eli)
  101. HDFS-2939. TestHAStateTransitions fails on Windows. (Uma Maheswara Rao G via atm)
  102. HDFS-2947. On startup NN throws an NPE in the metrics system. (atm)
  103. HDFS-2942. TestActiveStandbyElectorRealZK fails if build dir does not exist. (atm)
  104. HDFS-2948. NN throws NPE during shutdown if it fails to startup (todd)
  105. HDFS-2909. HA: Inaccessible shared edits dir not getting removed from FSImage storage dirs upon error. (Bikas Saha via jitendra)
  106. HDFS-2934. Allow configs to be scoped to all NNs in the nameservice. (todd)
  107. HDFS-2935. Shared edits dir property should be suffixed with nameservice and namenodeID (todd)
  108. HDFS-2928. ConfiguredFailoverProxyProvider should not create a NameNode proxy with an underlying retry proxy. (Uma Maheswara Rao G via atm)
  109. HDFS-2955. IllegalStateException during standby startup in getCurSegmentTxId. (Hari Mankude via atm)
  110. HDFS-2937. TestDFSHAAdmin needs tests with MiniDFSCluster. (Brandon Li via suresh)
  111. HDFS-2586. Add protobuf service and implementation for HAServiceProtocol. (suresh via atm)
  112. HDFS-2952. NN should not start with upgrade option or with a pending an unfinalized upgrade. (atm)
  113. HDFS-2974. MiniDFSCluster does not delete standby NN name dirs during format. (atm)
  114. HDFS-2929. Stress test and fixes for block synchronization (todd)
  115. HDFS-2972. Small optimization building incremental block report (todd)
  116. HDFS-2973. Re-enable NO_ACK optimization for block deletion. (todd)
  117. HDFS-2922. HA: close out operation categories (eli)
  118. HDFS-2993. HA: BackupNode#checkOperation should permit CHECKPOINT operations (eli)
  119. HDFS-2904. Client support for getting delegation tokens. (todd)
  120. HDFS-3013. HA: NameNode format doesn't pick up dfs.namenode.name.dir.NameServiceId configuration (Mingjie Lai via todd)
  121. HDFS-3019. Fix silent failure of TestEditLogJournalFailures (todd)
  122. HDFS-2958. Sweep for remaining proxy construction which doesn't go through failover path. (atm)
  123. HDFS-2920. fix remaining TODO items. (atm and todd)
  124. HDFS-3027. Implement a simple NN health check. (atm)
  125. HDFS-3023. Optimize entries in edits log for persistBlocks call. (todd)
  126. HDFS-2979. Balancer should use logical uri for creating failover proxy with HA enabled. (atm)
  127. HDFS-3035. Fix failure of TestFileAppendRestart due to OP_UPDATE_BLOCKS (todd)
  128. HDFS-3039. Address findbugs and javadoc warnings on branch. (todd via atm)