CHANGES.txt 16 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297
  1. HBase Change Log
  2. Trunk (unreleased changes)
  3. INCOMPATIBLE CHANGES
  4. HADOOP-2056 A table with row keys containing colon fails to split regions
  5. HADOOP-2079 Fix generated HLog, HRegion names
  6. NEW FEATURES
  7. HADOOP-2061 Add new Base64 dialects
  8. HADOOP-2084 Add a LocalHBaseCluster
  9. HADOOP-2068 RESTful interface (Bryan Duxbury via Stack)
  10. HADOOP-2316 Run REST servlet outside of master
  11. (Bryan Duxbury via Stack)
  12. OPTIMIZATIONS
  13. BUG FIXES
  14. HADOOP-2059 In tests, exceptions in min dfs shutdown should not fail test
  15. (e.g. nightly #272)
  16. HADOOP-2064 TestSplit assertion and NPE failures (Patch build #952 and #953)
  17. HADOOP-2124 Use of `hostname` does not work on Cygwin in some cases
  18. HADOOP-2083 TestTableIndex failed in #970 and #956
  19. HADOOP-2109 Fixed race condition in processing server lease timeout.
  20. HADOOP-2137 hql.jsp : The character 0x19 is not valid
  21. HADOOP-2109 Fix another race condition in processing dead servers,
  22. Fix error online meta regions: was using region name and not
  23. startKey as key for map.put. Change TestRegionServerExit to
  24. always kill the region server for the META region. This makes
  25. the test more deterministic and getting META reassigned was
  26. problematic.
  27. HADOOP-2155 Method expecting HBaseConfiguration throws NPE when given Configuration
  28. HADOOP-2156 BufferUnderflowException for un-named HTableDescriptors
  29. HADOOP-2161 getRow() is orders of magnitudes slower than get(), even on rows
  30. with one column (Clint Morgan and Stack)
  31. HADOOP-2040 Hudson hangs AFTER test has finished
  32. HADOOP-2274 Excess synchronization introduced by HADOOP-2139 negatively
  33. impacts performance
  34. HADOOP-2196 Fix how hbase sits in hadoop 'package' product
  35. HADOOP-2276 Address regression caused by HADOOP-2274, fix HADOOP-2173 (When
  36. the master times out a region servers lease, the region server
  37. may not restart)
  38. HADOOP-2253 getRow can return HBASE::DELETEVAL cells
  39. (Bryan Duxbury via Stack)
  40. HADOOP-2295 Fix assigning a region to multiple servers
  41. HADOOP-2234 TableInputFormat erroneously aggregates map values
  42. HADOOP-2308 null regioninfo breaks meta scanner
  43. HADOOP-2304 Abbreviated symbol parsing error of dir path in jar command
  44. (Edward Yoon via Stack)
  45. HADOOP-2320 Committed TestGet2 is managled (breaks build).
  46. HADOOP-2322 getRow(row, TS) client interface not properly connected
  47. HADOOP-2309 ConcurrentModificationException doing get of all region start keys
  48. HADOOP-2321 TestScanner2 does not release resources which sometimes cause the
  49. test to time out
  50. HADOOP-2315 REST servlet doesn't treat / characters in row key correctly
  51. (Bryan Duxbury via Stack)
  52. HADOOP-2332 Meta table data selection in Hbase Shell
  53. (Edward Yoon via Stack)
  54. HADOOP-2347 REST servlet not thread safe but run in a threaded manner
  55. (Bryan Duxbury via Stack)
  56. IMPROVEMENTS
  57. HADOOP-2401 Add convenience put method that takes writable
  58. (Johan Oskarsson via Stack)
  59. HADOOP-2074 Simple switch to enable DEBUG level-logging in hbase
  60. HADOOP-2088 Make hbase runnable in $HADOOP_HOME/build(/contrib/hbase)
  61. HADOOP-2126 Use Bob Jenkins' hash for bloom filters
  62. HADOOP-2157 Make Scanners implement Iterable
  63. HADOOP-2176 Htable.deleteAll documentation is ambiguous
  64. HADOOP-2139 (phase 1) Increase parallelism in region servers.
  65. HADOOP-2267 [Hbase Shell] Change the prompt's title from 'hbase' to 'hql'.
  66. (Edward Yoon via Stack)
  67. HADOOP-2139 (phase 2) Make region server more event driven
  68. HADOOP-2289 Useless efforts of looking for the non-existant table in select
  69. command.
  70. (Edward Yoon via Stack)
  71. HADOOP-2257 Show a total of all requests and regions on the web ui
  72. (Paul Saab via Stack)
  73. HADOOP-2261 HTable.abort no longer throws exception if there is no active update.
  74. HADOOP-2287 Make hbase unit tests take less time to complete.
  75. HADOOP-2262 Retry n times instead of n**2 times.
  76. HADOOP-1608 Relational Algrebra Operators
  77. (Edward Yoon via Stack)
  78. HADOOP-2198 HTable should have method to return table metadata
  79. HADOOP-2296 hbase shell: phantom columns show up from select command
  80. HADOOP-2297 System.exit() Handling in hbase shell jar command
  81. (Edward Yoon via Stack)
  82. HADOOP-2224 Add HTable.getRow(ROW, ts)
  83. (Bryan Duxbury via Stack)
  84. HADOOP-2339 Delete command with no WHERE clause
  85. (Edward Yoon via Stack)
  86. HADOOP-2299 Support inclusive scans (Bryan Duxbury via Stack)
  87. HADOOP-2333 Client side retries happen at the wrong level
  88. Release 0.15.1
  89. Branch 0.15
  90. INCOMPATIBLE CHANGES
  91. HADOOP-1931 Hbase scripts take --ARG=ARG_VALUE when should be like hadoop
  92. and do ---ARG ARG_VALUE
  93. NEW FEATURES
  94. HADOOP-1768 FS command using Hadoop FsShell operations
  95. (Edward Yoon via Stack)
  96. HADOOP-1784 Delete: Fix scanners and gets so they work properly in presence
  97. of deletes. Added a deleteAll to remove all cells equal to or
  98. older than passed timestamp. Fixed compaction so deleted cells
  99. do not make it out into compacted output. Ensure also that
  100. versions > column max are dropped compacting.
  101. HADOOP-1720 Addition of HQL (Hbase Query Language) support in Hbase Shell.
  102. The old shell syntax has been replaced by HQL, a small SQL-like
  103. set of operators, for creating, altering, dropping, inserting,
  104. deleting, and selecting, etc., data in hbase.
  105. (Inchul Song and Edward Yoon via Stack)
  106. HADOOP-1913 Build a Lucene index on an HBase table
  107. (Ning Li via Stack)
  108. HADOOP-1957 Web UI with report on cluster state and basic browsing of tables
  109. OPTIMIZATIONS
  110. BUG FIXES
  111. HADOOP-1527 Region server won't start because logdir exists
  112. HADOOP-1723 If master asks region server to shut down, by-pass return of
  113. shutdown message
  114. HADOOP-1729 Recent renaming or META tables breaks hbase shell
  115. HADOOP-1730 unexpected null value causes META scanner to exit (silently)
  116. HADOOP-1747 On a cluster, on restart, regions multiply assigned
  117. HADOOP-1776 Fix for sporadic compaction failures closing and moving
  118. compaction result
  119. HADOOP-1780 Regions are still being doubly assigned
  120. HADOOP-1797 Fix NPEs in MetaScanner constructor
  121. HADOOP-1799 Incorrect classpath in binary version of Hadoop
  122. HADOOP-1805 Region server hang on exit
  123. HADOOP-1785 TableInputFormat.TableRecordReader.next has a bug
  124. (Ning Li via Stack)
  125. HADOOP-1800 output should default utf8 encoding
  126. HADOOP-1801 When hdfs is yanked out from under hbase, hbase should go down gracefully
  127. HADOOP-1813 OOME makes zombie of region server
  128. HADOOP-1814 TestCleanRegionServerExit fails too often on Hudson
  129. HADOOP-1820 Regionserver creates hlogs without bound
  130. (reverted 2007/09/25) (Fixed 2007/09/30)
  131. HADOOP-1821 Replace all String.getBytes() with String.getBytes("UTF-8")
  132. HADOOP-1832 listTables() returns duplicate tables
  133. HADOOP-1834 Scanners ignore timestamp passed on creation
  134. HADOOP-1847 Many HBase tests do not fail well.
  135. HADOOP-1847 Many HBase tests do not fail well. (phase 2)
  136. HADOOP-1870 Once file system failure has been detected, don't check it again
  137. and get on with shutting down the hbase cluster.
  138. HADOOP-1888 NullPointerException in HMemcacheScanner (reprise)
  139. HADOOP-1903 Possible data loss if Exception happens between snapshot and
  140. flush to disk.
  141. HADOOP-1920 Wrapper scripts broken when hadoop in one location and hbase in
  142. another
  143. HADOOP-1923, HADOOP-1924 a) tests fail sporadically because set up and tear
  144. down is inconsistent b) TestDFSAbort failed in nightly #242
  145. HADOOP-1929 Add hbase-default.xml to hbase jar
  146. HADOOP-1941 StopRowFilter throws NPE when passed null row
  147. HADOOP-1966 Make HBase unit tests more reliable in the Hudson environment.
  148. HADOOP-1975 HBase tests failing with java.lang.NumberFormatException
  149. HADOOP-1990 Regression test instability affects nightly and patch builds
  150. HADOOP-1996 TestHStoreFile fails on windows if run multiple times
  151. HADOOP-1937 When the master times out a region server's lease, it is too
  152. aggressive in reclaiming the server's log.
  153. HADOOP-2004 webapp hql formatting bugs
  154. HADOOP_2011 Make hbase daemon scripts take args in same order as hadoop
  155. daemon scripts
  156. HADOOP-2017 TestRegionServerAbort failure in patch build #903 and
  157. nightly #266
  158. HADOOP-2029 TestLogRolling fails too often in patch and nightlies
  159. HADOOP-2038 TestCleanRegionExit failed in patch build #927
  160. IMPROVEMENTS
  161. HADOOP-1737 Make HColumnDescriptor data publically members settable
  162. HADOOP-1746 Clean up findbugs warnings
  163. HADOOP-1757 Bloomfilters: single argument constructor, use enum for bloom
  164. filter types
  165. HADOOP-1760 Use new MapWritable and SortedMapWritable classes from
  166. org.apache.hadoop.io
  167. HADOOP-1793 (Phase 1) Remove TestHClient (Phase2) remove HClient.
  168. HADOOP-1794 Remove deprecated APIs
  169. HADOOP-1802 Startup scripts should wait until hdfs as cleared 'safe mode'
  170. HADOOP-1833 bin/stop_hbase.sh returns before it completes
  171. (Izaak Rubin via Stack)
  172. HADOOP-1835 Updated Documentation for HBase setup/installation
  173. (Izaak Rubin via Stack)
  174. HADOOP-1868 Make default configuration more responsive
  175. HADOOP-1884 Remove useless debugging log messages from hbase.mapred
  176. HADOOP-1856 Add Jar command to hbase shell using Hadoop RunJar util
  177. (Edward Yoon via Stack)
  178. HADOOP-1928 Have master pass the regionserver the filesystem to use
  179. HADOOP-1789 Output formatting
  180. HADOOP-1960 If a region server cannot talk to the master before its lease
  181. times out, it should shut itself down
  182. HADOOP-2035 Add logo to webapps
  183. Below are the list of changes before 2007-08-18
  184. 1. HADOOP-1384. HBase omnibus patch. (jimk, Vuk Ercegovac, and Michael Stack)
  185. 2. HADOOP-1402. Fix javadoc warnings in hbase contrib. (Michael Stack)
  186. 3. HADOOP-1404. HBase command-line shutdown failing (Michael Stack)
  187. 4. HADOOP-1397. Replace custom hbase locking with
  188. java.util.concurrent.locks.ReentrantLock (Michael Stack)
  189. 5. HADOOP-1403. HBase reliability - make master and region server more fault
  190. tolerant.
  191. 6. HADOOP-1418. HBase miscellaneous: unit test for HClient, client to do
  192. 'Performance Evaluation', etc.
  193. 7. HADOOP-1420, HADOOP-1423. Findbugs changes, remove reference to removed
  194. class HLocking.
  195. 8. HADOOP-1424. TestHBaseCluster fails with IllegalMonitorStateException. Fix
  196. regression introduced by HADOOP-1397.
  197. 9. HADOOP-1426. Make hbase scripts executable + add test classes to CLASSPATH.
  198. 10. HADOOP-1430. HBase shutdown leaves regionservers up.
  199. 11. HADOOP-1392. Part1: includes create/delete table; enable/disable table;
  200. add/remove column.
  201. 12. HADOOP-1392. Part2: includes table compaction by merging adjacent regions
  202. that have shrunk in size.
  203. 13. HADOOP-1445 Support updates across region splits and compactions
  204. 14. HADOOP-1460 On shutdown IOException with complaint 'Cannot cancel lease
  205. that is not held'
  206. 15. HADOOP-1421 Failover detection, split log files.
  207. For the files modified, also clean up javadoc, class, field and method
  208. visibility (HADOOP-1466)
  209. 16. HADOOP-1479 Fix NPE in HStore#get if store file only has keys < passed key.
  210. 17. HADOOP-1476 Distributed version of 'Performance Evaluation' script
  211. 18. HADOOP-1469 Asychronous table creation
  212. 19. HADOOP-1415 Integrate BSD licensed bloom filter implementation.
  213. 20. HADOOP-1465 Add cluster stop/start scripts for hbase
  214. 21. HADOOP-1415 Provide configurable per-column bloom filters - part 2.
  215. 22. HADOOP-1498. Replace boxed types with primitives in many places.
  216. 23. HADOOP-1509. Made methods/inner classes in HRegionServer and HClient protected
  217. instead of private for easier extension. Also made HRegion and HRegionInfo public too.
  218. Added an hbase-default.xml property for specifying what HRegionInterface extension to use
  219. for proxy server connection. (James Kennedy via Jim Kellerman)
  220. 24. HADOOP-1534. [hbase] Memcache scanner fails if start key not present
  221. 25. HADOOP-1537. Catch exceptions in testCleanRegionServerExit so we can see
  222. what is failing.
  223. 26. HADOOP-1543 [hbase] Add HClient.tableExists
  224. 27. HADOOP-1519 [hbase] map/reduce interface for HBase. (Vuk Ercegovac and
  225. Jim Kellerman)
  226. 28. HADOOP-1523 Hung region server waiting on write locks
  227. 29. HADOOP-1560 NPE in MiniHBaseCluster on Windows
  228. 30. HADOOP-1531 Add RowFilter to HRegion.HScanner
  229. Adds a row filtering interface and two implemenentations: A page scanner,
  230. and a regex row/column-data matcher. (James Kennedy via Stack)
  231. 31. HADOOP-1566 Key-making utility
  232. 32. HADOOP-1415 Provide configurable per-column bloom filters.
  233. HADOOP-1466 Clean up visibility and javadoc issues in HBase.
  234. 33. HADOOP-1538 Provide capability for client specified time stamps in HBase
  235. HADOOP-1466 Clean up visibility and javadoc issues in HBase.
  236. 34. HADOOP-1589 Exception handling in HBase is broken over client server connections
  237. 35. HADOOP-1375 a simple parser for hbase (Edward Yoon via Stack)
  238. 36. HADOOP-1600 Update license in HBase code
  239. 37. HADOOP-1589 Exception handling in HBase is broken over client server
  240. 38. HADOOP-1574 Concurrent creates of a table named 'X' all succeed
  241. 39. HADOOP-1581 Un-openable tablename bug
  242. 40. HADOOP-1607 [shell] Clear screen command (Edward Yoon via Stack)
  243. 41. HADOOP-1614 [hbase] HClient does not protect itself from simultaneous updates
  244. 42. HADOOP-1468 Add HBase batch update to reduce RPC overhead
  245. 43. HADOOP-1616 Sporadic TestTable failures
  246. 44. HADOOP-1615 Replacing thread notification-based queue with
  247. java.util.concurrent.BlockingQueue in HMaster, HRegionServer
  248. 45. HADOOP-1606 Updated implementation of RowFilterSet, RowFilterInterface
  249. (Izaak Rubin via Stack)
  250. 46. HADOOP-1579 Add new WhileMatchRowFilter and StopRowFilter filters
  251. (Izaak Rubin via Stack)
  252. 47. HADOOP-1637 Fix to HScanner to Support Filters, Add Filter Tests to
  253. TestScanner2 (Izaak Rubin via Stack)
  254. 48. HADOOP-1516 HClient fails to readjust when ROOT or META redeployed on new
  255. region server
  256. 49. HADOOP-1646 RegionServer OOME's under sustained, substantial loading by
  257. 10 concurrent clients
  258. 50. HADOOP-1468 Add HBase batch update to reduce RPC overhead (restrict batches
  259. to a single row at a time)
  260. 51. HADOOP-1528 HClient for multiple tables (phase 1) (James Kennedy & JimK)
  261. 52. HADOOP-1528 HClient for multiple tables (phase 2) all HBase client side code
  262. (except TestHClient and HBaseShell) have been converted to use the new client
  263. side objects (HTable/HBaseAdmin/HConnection) instead of HClient.
  264. 53. HADOOP-1528 HClient for multiple tables - expose close table function
  265. 54. HADOOP-1466 Clean up warnings, visibility and javadoc issues in HBase.
  266. 55. HADOOP-1662 Make region splits faster
  267. 56. HADOOP-1678 On region split, master should designate which host should
  268. serve daughter splits. Phase 1: Master balances load for new regions and
  269. when a region server fails.
  270. 57. HADOOP-1678 On region split, master should designate which host should
  271. serve daughter splits. Phase 2: Master assigns children of split region
  272. instead of HRegionServer serving both children.
  273. 58. HADOOP-1710 All updates should be batch updates
  274. 59. HADOOP-1711 HTable API should use interfaces instead of concrete classes as
  275. method parameters and return values
  276. 60. HADOOP-1644 Compactions should not block updates
  277. 60. HADOOP-1672 HBase Shell should use new client classes
  278. (Edward Yoon via Stack).
  279. 61. HADOOP-1709 Make HRegionInterface more like that of HTable
  280. HADOOP-1725 Client find of table regions should not include offlined, split parents