1
0

CHANGES.txt 16 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300
  1. HBase Change Log
  2. Trunk (unreleased changes)
  3. INCOMPATIBLE CHANGES
  4. HADOOP-2056 A table with row keys containing colon fails to split regions
  5. HADOOP-2079 Fix generated HLog, HRegion names
  6. NEW FEATURES
  7. HADOOP-2061 Add new Base64 dialects
  8. HADOOP-2084 Add a LocalHBaseCluster
  9. HADOOP-2068 RESTful interface (Bryan Duxbury via Stack)
  10. HADOOP-2316 Run REST servlet outside of master
  11. (Bryan Duxbury via Stack)
  12. OPTIMIZATIONS
  13. BUG FIXES
  14. HADOOP-2059 In tests, exceptions in min dfs shutdown should not fail test
  15. (e.g. nightly #272)
  16. HADOOP-2064 TestSplit assertion and NPE failures (Patch build #952 and #953)
  17. HADOOP-2124 Use of `hostname` does not work on Cygwin in some cases
  18. HADOOP-2083 TestTableIndex failed in #970 and #956
  19. HADOOP-2109 Fixed race condition in processing server lease timeout.
  20. HADOOP-2137 hql.jsp : The character 0x19 is not valid
  21. HADOOP-2109 Fix another race condition in processing dead servers,
  22. Fix error online meta regions: was using region name and not
  23. startKey as key for map.put. Change TestRegionServerExit to
  24. always kill the region server for the META region. This makes
  25. the test more deterministic and getting META reassigned was
  26. problematic.
  27. HADOOP-2155 Method expecting HBaseConfiguration throws NPE when given Configuration
  28. HADOOP-2156 BufferUnderflowException for un-named HTableDescriptors
  29. HADOOP-2161 getRow() is orders of magnitudes slower than get(), even on rows
  30. with one column (Clint Morgan and Stack)
  31. HADOOP-2040 Hudson hangs AFTER test has finished
  32. HADOOP-2274 Excess synchronization introduced by HADOOP-2139 negatively
  33. impacts performance
  34. HADOOP-2196 Fix how hbase sits in hadoop 'package' product
  35. HADOOP-2276 Address regression caused by HADOOP-2274, fix HADOOP-2173 (When
  36. the master times out a region servers lease, the region server
  37. may not restart)
  38. HADOOP-2253 getRow can return HBASE::DELETEVAL cells
  39. (Bryan Duxbury via Stack)
  40. HADOOP-2295 Fix assigning a region to multiple servers
  41. HADOOP-2234 TableInputFormat erroneously aggregates map values
  42. HADOOP-2308 null regioninfo breaks meta scanner
  43. HADOOP-2304 Abbreviated symbol parsing error of dir path in jar command
  44. (Edward Yoon via Stack)
  45. HADOOP-2320 Committed TestGet2 is managled (breaks build).
  46. HADOOP-2322 getRow(row, TS) client interface not properly connected
  47. HADOOP-2309 ConcurrentModificationException doing get of all region start keys
  48. HADOOP-2321 TestScanner2 does not release resources which sometimes cause the
  49. test to time out
  50. HADOOP-2315 REST servlet doesn't treat / characters in row key correctly
  51. (Bryan Duxbury via Stack)
  52. HADOOP-2332 Meta table data selection in Hbase Shell
  53. (Edward Yoon via Stack)
  54. HADOOP-2347 REST servlet not thread safe but run in a threaded manner
  55. (Bryan Duxbury via Stack)
  56. HADOOP-2365 Result of HashFunction.hash() contains all identical values
  57. HADOOP-2362 Leaking hdfs file handle on region split
  58. IMPROVEMENTS
  59. HADOOP-2401 Add convenience put method that takes writable
  60. (Johan Oskarsson via Stack)
  61. HADOOP-2074 Simple switch to enable DEBUG level-logging in hbase
  62. HADOOP-2088 Make hbase runnable in $HADOOP_HOME/build(/contrib/hbase)
  63. HADOOP-2126 Use Bob Jenkins' hash for bloom filters
  64. HADOOP-2157 Make Scanners implement Iterable
  65. HADOOP-2176 Htable.deleteAll documentation is ambiguous
  66. HADOOP-2139 (phase 1) Increase parallelism in region servers.
  67. HADOOP-2267 [Hbase Shell] Change the prompt's title from 'hbase' to 'hql'.
  68. (Edward Yoon via Stack)
  69. HADOOP-2139 (phase 2) Make region server more event driven
  70. HADOOP-2289 Useless efforts of looking for the non-existant table in select
  71. command.
  72. (Edward Yoon via Stack)
  73. HADOOP-2257 Show a total of all requests and regions on the web ui
  74. (Paul Saab via Stack)
  75. HADOOP-2261 HTable.abort no longer throws exception if there is no active update.
  76. HADOOP-2287 Make hbase unit tests take less time to complete.
  77. HADOOP-2262 Retry n times instead of n**2 times.
  78. HADOOP-1608 Relational Algrebra Operators
  79. (Edward Yoon via Stack)
  80. HADOOP-2198 HTable should have method to return table metadata
  81. HADOOP-2296 hbase shell: phantom columns show up from select command
  82. HADOOP-2297 System.exit() Handling in hbase shell jar command
  83. (Edward Yoon via Stack)
  84. HADOOP-2224 Add HTable.getRow(ROW, ts)
  85. (Bryan Duxbury via Stack)
  86. HADOOP-2339 Delete command with no WHERE clause
  87. (Edward Yoon via Stack)
  88. HADOOP-2299 Support inclusive scans (Bryan Duxbury via Stack)
  89. HADOOP-2333 Client side retries happen at the wrong level
  90. HADOOP-2357 Compaction cleanup; less deleting + prevent possible file leaks
  91. Release 0.15.1
  92. Branch 0.15
  93. INCOMPATIBLE CHANGES
  94. HADOOP-1931 Hbase scripts take --ARG=ARG_VALUE when should be like hadoop
  95. and do ---ARG ARG_VALUE
  96. NEW FEATURES
  97. HADOOP-1768 FS command using Hadoop FsShell operations
  98. (Edward Yoon via Stack)
  99. HADOOP-1784 Delete: Fix scanners and gets so they work properly in presence
  100. of deletes. Added a deleteAll to remove all cells equal to or
  101. older than passed timestamp. Fixed compaction so deleted cells
  102. do not make it out into compacted output. Ensure also that
  103. versions > column max are dropped compacting.
  104. HADOOP-1720 Addition of HQL (Hbase Query Language) support in Hbase Shell.
  105. The old shell syntax has been replaced by HQL, a small SQL-like
  106. set of operators, for creating, altering, dropping, inserting,
  107. deleting, and selecting, etc., data in hbase.
  108. (Inchul Song and Edward Yoon via Stack)
  109. HADOOP-1913 Build a Lucene index on an HBase table
  110. (Ning Li via Stack)
  111. HADOOP-1957 Web UI with report on cluster state and basic browsing of tables
  112. OPTIMIZATIONS
  113. BUG FIXES
  114. HADOOP-1527 Region server won't start because logdir exists
  115. HADOOP-1723 If master asks region server to shut down, by-pass return of
  116. shutdown message
  117. HADOOP-1729 Recent renaming or META tables breaks hbase shell
  118. HADOOP-1730 unexpected null value causes META scanner to exit (silently)
  119. HADOOP-1747 On a cluster, on restart, regions multiply assigned
  120. HADOOP-1776 Fix for sporadic compaction failures closing and moving
  121. compaction result
  122. HADOOP-1780 Regions are still being doubly assigned
  123. HADOOP-1797 Fix NPEs in MetaScanner constructor
  124. HADOOP-1799 Incorrect classpath in binary version of Hadoop
  125. HADOOP-1805 Region server hang on exit
  126. HADOOP-1785 TableInputFormat.TableRecordReader.next has a bug
  127. (Ning Li via Stack)
  128. HADOOP-1800 output should default utf8 encoding
  129. HADOOP-1801 When hdfs is yanked out from under hbase, hbase should go down gracefully
  130. HADOOP-1813 OOME makes zombie of region server
  131. HADOOP-1814 TestCleanRegionServerExit fails too often on Hudson
  132. HADOOP-1820 Regionserver creates hlogs without bound
  133. (reverted 2007/09/25) (Fixed 2007/09/30)
  134. HADOOP-1821 Replace all String.getBytes() with String.getBytes("UTF-8")
  135. HADOOP-1832 listTables() returns duplicate tables
  136. HADOOP-1834 Scanners ignore timestamp passed on creation
  137. HADOOP-1847 Many HBase tests do not fail well.
  138. HADOOP-1847 Many HBase tests do not fail well. (phase 2)
  139. HADOOP-1870 Once file system failure has been detected, don't check it again
  140. and get on with shutting down the hbase cluster.
  141. HADOOP-1888 NullPointerException in HMemcacheScanner (reprise)
  142. HADOOP-1903 Possible data loss if Exception happens between snapshot and
  143. flush to disk.
  144. HADOOP-1920 Wrapper scripts broken when hadoop in one location and hbase in
  145. another
  146. HADOOP-1923, HADOOP-1924 a) tests fail sporadically because set up and tear
  147. down is inconsistent b) TestDFSAbort failed in nightly #242
  148. HADOOP-1929 Add hbase-default.xml to hbase jar
  149. HADOOP-1941 StopRowFilter throws NPE when passed null row
  150. HADOOP-1966 Make HBase unit tests more reliable in the Hudson environment.
  151. HADOOP-1975 HBase tests failing with java.lang.NumberFormatException
  152. HADOOP-1990 Regression test instability affects nightly and patch builds
  153. HADOOP-1996 TestHStoreFile fails on windows if run multiple times
  154. HADOOP-1937 When the master times out a region server's lease, it is too
  155. aggressive in reclaiming the server's log.
  156. HADOOP-2004 webapp hql formatting bugs
  157. HADOOP_2011 Make hbase daemon scripts take args in same order as hadoop
  158. daemon scripts
  159. HADOOP-2017 TestRegionServerAbort failure in patch build #903 and
  160. nightly #266
  161. HADOOP-2029 TestLogRolling fails too often in patch and nightlies
  162. HADOOP-2038 TestCleanRegionExit failed in patch build #927
  163. IMPROVEMENTS
  164. HADOOP-1737 Make HColumnDescriptor data publically members settable
  165. HADOOP-1746 Clean up findbugs warnings
  166. HADOOP-1757 Bloomfilters: single argument constructor, use enum for bloom
  167. filter types
  168. HADOOP-1760 Use new MapWritable and SortedMapWritable classes from
  169. org.apache.hadoop.io
  170. HADOOP-1793 (Phase 1) Remove TestHClient (Phase2) remove HClient.
  171. HADOOP-1794 Remove deprecated APIs
  172. HADOOP-1802 Startup scripts should wait until hdfs as cleared 'safe mode'
  173. HADOOP-1833 bin/stop_hbase.sh returns before it completes
  174. (Izaak Rubin via Stack)
  175. HADOOP-1835 Updated Documentation for HBase setup/installation
  176. (Izaak Rubin via Stack)
  177. HADOOP-1868 Make default configuration more responsive
  178. HADOOP-1884 Remove useless debugging log messages from hbase.mapred
  179. HADOOP-1856 Add Jar command to hbase shell using Hadoop RunJar util
  180. (Edward Yoon via Stack)
  181. HADOOP-1928 Have master pass the regionserver the filesystem to use
  182. HADOOP-1789 Output formatting
  183. HADOOP-1960 If a region server cannot talk to the master before its lease
  184. times out, it should shut itself down
  185. HADOOP-2035 Add logo to webapps
  186. Below are the list of changes before 2007-08-18
  187. 1. HADOOP-1384. HBase omnibus patch. (jimk, Vuk Ercegovac, and Michael Stack)
  188. 2. HADOOP-1402. Fix javadoc warnings in hbase contrib. (Michael Stack)
  189. 3. HADOOP-1404. HBase command-line shutdown failing (Michael Stack)
  190. 4. HADOOP-1397. Replace custom hbase locking with
  191. java.util.concurrent.locks.ReentrantLock (Michael Stack)
  192. 5. HADOOP-1403. HBase reliability - make master and region server more fault
  193. tolerant.
  194. 6. HADOOP-1418. HBase miscellaneous: unit test for HClient, client to do
  195. 'Performance Evaluation', etc.
  196. 7. HADOOP-1420, HADOOP-1423. Findbugs changes, remove reference to removed
  197. class HLocking.
  198. 8. HADOOP-1424. TestHBaseCluster fails with IllegalMonitorStateException. Fix
  199. regression introduced by HADOOP-1397.
  200. 9. HADOOP-1426. Make hbase scripts executable + add test classes to CLASSPATH.
  201. 10. HADOOP-1430. HBase shutdown leaves regionservers up.
  202. 11. HADOOP-1392. Part1: includes create/delete table; enable/disable table;
  203. add/remove column.
  204. 12. HADOOP-1392. Part2: includes table compaction by merging adjacent regions
  205. that have shrunk in size.
  206. 13. HADOOP-1445 Support updates across region splits and compactions
  207. 14. HADOOP-1460 On shutdown IOException with complaint 'Cannot cancel lease
  208. that is not held'
  209. 15. HADOOP-1421 Failover detection, split log files.
  210. For the files modified, also clean up javadoc, class, field and method
  211. visibility (HADOOP-1466)
  212. 16. HADOOP-1479 Fix NPE in HStore#get if store file only has keys < passed key.
  213. 17. HADOOP-1476 Distributed version of 'Performance Evaluation' script
  214. 18. HADOOP-1469 Asychronous table creation
  215. 19. HADOOP-1415 Integrate BSD licensed bloom filter implementation.
  216. 20. HADOOP-1465 Add cluster stop/start scripts for hbase
  217. 21. HADOOP-1415 Provide configurable per-column bloom filters - part 2.
  218. 22. HADOOP-1498. Replace boxed types with primitives in many places.
  219. 23. HADOOP-1509. Made methods/inner classes in HRegionServer and HClient protected
  220. instead of private for easier extension. Also made HRegion and HRegionInfo public too.
  221. Added an hbase-default.xml property for specifying what HRegionInterface extension to use
  222. for proxy server connection. (James Kennedy via Jim Kellerman)
  223. 24. HADOOP-1534. [hbase] Memcache scanner fails if start key not present
  224. 25. HADOOP-1537. Catch exceptions in testCleanRegionServerExit so we can see
  225. what is failing.
  226. 26. HADOOP-1543 [hbase] Add HClient.tableExists
  227. 27. HADOOP-1519 [hbase] map/reduce interface for HBase. (Vuk Ercegovac and
  228. Jim Kellerman)
  229. 28. HADOOP-1523 Hung region server waiting on write locks
  230. 29. HADOOP-1560 NPE in MiniHBaseCluster on Windows
  231. 30. HADOOP-1531 Add RowFilter to HRegion.HScanner
  232. Adds a row filtering interface and two implemenentations: A page scanner,
  233. and a regex row/column-data matcher. (James Kennedy via Stack)
  234. 31. HADOOP-1566 Key-making utility
  235. 32. HADOOP-1415 Provide configurable per-column bloom filters.
  236. HADOOP-1466 Clean up visibility and javadoc issues in HBase.
  237. 33. HADOOP-1538 Provide capability for client specified time stamps in HBase
  238. HADOOP-1466 Clean up visibility and javadoc issues in HBase.
  239. 34. HADOOP-1589 Exception handling in HBase is broken over client server connections
  240. 35. HADOOP-1375 a simple parser for hbase (Edward Yoon via Stack)
  241. 36. HADOOP-1600 Update license in HBase code
  242. 37. HADOOP-1589 Exception handling in HBase is broken over client server
  243. 38. HADOOP-1574 Concurrent creates of a table named 'X' all succeed
  244. 39. HADOOP-1581 Un-openable tablename bug
  245. 40. HADOOP-1607 [shell] Clear screen command (Edward Yoon via Stack)
  246. 41. HADOOP-1614 [hbase] HClient does not protect itself from simultaneous updates
  247. 42. HADOOP-1468 Add HBase batch update to reduce RPC overhead
  248. 43. HADOOP-1616 Sporadic TestTable failures
  249. 44. HADOOP-1615 Replacing thread notification-based queue with
  250. java.util.concurrent.BlockingQueue in HMaster, HRegionServer
  251. 45. HADOOP-1606 Updated implementation of RowFilterSet, RowFilterInterface
  252. (Izaak Rubin via Stack)
  253. 46. HADOOP-1579 Add new WhileMatchRowFilter and StopRowFilter filters
  254. (Izaak Rubin via Stack)
  255. 47. HADOOP-1637 Fix to HScanner to Support Filters, Add Filter Tests to
  256. TestScanner2 (Izaak Rubin via Stack)
  257. 48. HADOOP-1516 HClient fails to readjust when ROOT or META redeployed on new
  258. region server
  259. 49. HADOOP-1646 RegionServer OOME's under sustained, substantial loading by
  260. 10 concurrent clients
  261. 50. HADOOP-1468 Add HBase batch update to reduce RPC overhead (restrict batches
  262. to a single row at a time)
  263. 51. HADOOP-1528 HClient for multiple tables (phase 1) (James Kennedy & JimK)
  264. 52. HADOOP-1528 HClient for multiple tables (phase 2) all HBase client side code
  265. (except TestHClient and HBaseShell) have been converted to use the new client
  266. side objects (HTable/HBaseAdmin/HConnection) instead of HClient.
  267. 53. HADOOP-1528 HClient for multiple tables - expose close table function
  268. 54. HADOOP-1466 Clean up warnings, visibility and javadoc issues in HBase.
  269. 55. HADOOP-1662 Make region splits faster
  270. 56. HADOOP-1678 On region split, master should designate which host should
  271. serve daughter splits. Phase 1: Master balances load for new regions and
  272. when a region server fails.
  273. 57. HADOOP-1678 On region split, master should designate which host should
  274. serve daughter splits. Phase 2: Master assigns children of split region
  275. instead of HRegionServer serving both children.
  276. 58. HADOOP-1710 All updates should be batch updates
  277. 59. HADOOP-1711 HTable API should use interfaces instead of concrete classes as
  278. method parameters and return values
  279. 60. HADOOP-1644 Compactions should not block updates
  280. 60. HADOOP-1672 HBase Shell should use new client classes
  281. (Edward Yoon via Stack).
  282. 61. HADOOP-1709 Make HRegionInterface more like that of HTable
  283. HADOOP-1725 Client find of table regions should not include offlined, split parents