HBase
403 FOLLOWERS
HBase is called the Hadoop database because it is a NoSQL database that runs on top of Hadoop. This blog is about Apache HBase, the open source, distributed key value store.
HBase
1M ago
By Lars Hofhansl
My last post here was in 2019.
BigData has had an "interesting" journey. From the "thing" that everybody needed but did not quite understand, to lots of fragmented solutions, to some very large installations, to partial irrelevance due to the public clouds and some kind of coming back to "small" data.
HDFS, HBase and Phoenix were part of this ride, and the almost 10 years I worked on those were some of the best in my career. Open Source is fun, and I was lucky enough to provide 100's of changes and improvements to these projects.
Since 2020 I had worked a bit on Trino, Kafka ..read more
HBase
5y ago
By Lars Hofhansl Why Local Indexes? Local indexes are a powerful tool in the HBase/Phoenix toolbox. They are (1) naturally and cheaply transactional, they (2) avoid creating extra index tables, (3) provide the best write write performance, and (4) they can small since they work uncovered in all cases (i.e. you do not have to include extra columns so that a query can be answered from only the index). Problems with Local Indexes?Local indexes work by maintaining an index per HBase region, each region has its own local index. At read-time, for a query along the index that means that each regi ..read more