hbase建表预分区的2种方法

以下案例建表并设置预分区,分别测试以下2种方法

1.固定散列

示例:rowkey以日期为前缀

create 'test','cf1', SPLITS => ['202401', '202402', '202403']

put 'test','20240101','cf1:name','20240101'

put 'test','20240102','cf1:name','20240102'

put 'test','20240103','cf1:name','20240103'

put 'test','20240201','cf1:name','20240201'

put 'test','20240202','cf1:name','20240202'

put 'test','20240203','cf1:name','20240203'

put 'test','20240301','cf1:name','20240301'

put 'test','20240302','cf1:name','20240302'

put 'test','20240303','cf1:name','20240303'

put 'test','20240304','cf1:name','20240304'

都分发到对应的region

Table Regions

Name Region Server Start Key End Key Locality Requests

test,1714377792028.5582f2cefddffb29a9cb8a47b404df63. whtpiodscshd02t,21302,1710927704067 202401 0.0 0

test,202401,1714377792028.fc66ef2c38740a635f5cea810d7f4d8d. whtpiodscshd01t,21302,1710927618816 202401 202402 0.0 3

test,202402,1714377792028.e72ba47f141297e012ab350cba56ad5c. whtpiodscshd03t,21302,1710927771892 202402 202403 0.0 3

test,202403,1714377792028.cd19ee74662ef8ba393464da96213aba. whtpiodscshd02t,21302,1710927704067 202403 0.0 4

2.哈希散列

Hbase自带了两种pre-split的算法,分别是HexStringSplit和UniformSplit

1.HexStringSplit算法

示例1:

create 'test2', {NAME => 'cf1'},{NUMREGIONS => 4, SPLITALGO => 'HexStringSplit'}

put 'test2','20240101','cf1:name','20240101'

put 'test2','20240102','cf1:name','20240102'

put 'test2','20240103','cf1:name','20240103'

put 'test2','20240201','cf1:name','20240201'

put 'test2','20240202','cf1:name','20240202'

put 'test2','20240203','cf1:name','20240203'

put 'test2','20240301','cf1:name','20240301'

put 'test2','20240302','cf1:name','20240302'

put 'test2','20240303','cf1:name','20240303'

put 'test2','20240304','cf1:name','20240304'

Table Regions

Name Region Server Start Key End Key Locality Requests

test2,1714382074838.ad061d93d07dea90c587ce00e5ad56c0. whtpiodscshd03t,21302,1710927771892 40000000 0.0 10

test2,40000000,1714382074838.c803b1e07c2390095f67041d16771906. whtpiodscshd01t,21302,1710927618816 40000000 80000000 0.0 0

test2,80000000,1714382074838.c17c9a422fece545f968b4652a8d049a. whtpiodscshd02t,21302,1710927704067 80000000 c0000000 0.0 0

test2,c0000000,1714382074838.77e83f95f8fa8dc42210a666dc76f126. whtpiodscshd01t,21302,1710927618816 c0000000 0.0 0

2.UniformSplit算法

示例2:

create 'test3', {NAME => 'cf1'},{NUMREGIONS => 4, SPLITALGO => 'UniformSplit'}

put 'test3','acdzdf4ae5rew','cf1:name','acdzdf4ae5rew'

put 'test3','acdzdfaczerew','cf1:name','acdzdfaczerew'

put 'test3','edddddddddfdd','cf1:name','edddddddddfdd'

put 'test3','acdzdfacaerew','cf1:name','acdzdfacaerew'

put 'test3','acdzd12344rew','cf1:name','acdzd12344rew'

put 'test3','acdzd44caerew','cf1:name','acdzd44caerew'

put 'test3','acdzdfa123rew','cf1:name','acdzdfa123rew'

put 'test3','acdzdfaxaerew','cf1:name','acdzdfaxaerew'

put 'test3','acdzdfadfcrew','cf1:name','acdzdfadfcrew'

put 'test3','acdzdfac1erew','cf1:name','acdzdfac1erew'

Table Regions

Name Region Server Start Key End Key Locality Requests

test3,1714382196221.51c3306c9ef13a9251fb0b184c077711. whtpiodscshd02t,21302,1710927704067 @\x00\x00\x00\x00\x00\x00\x00 0.0 10

test3,@\x00\x00\x00\x00\x00\x00\x00,1714382196221.17b5d102fee381c30255e51692b3050d. whtpiodscshd01t,21302,1710927618816 @\x00\x00\x00\x00\x00\x00\x00 \x80\x00\x00\x00\x00\x00\x00\x00 0.0 10

test3,\x80\x00\x00\x00\x00\x00\x00\x00,1714382196221.4348f72d52953c670f0cfb90a329ad2b. whtpiodscshd03t,21302,1710927771892 \x80\x00\x00\x00\x00\x00\x00\x00 \xC0\x00\x00\x00\x00\x00\x00\x00 0.0 0

test3,\xC0\x00\x00\x00\x00\x00\x00\x00,1714382196221.50b797c3caaf12d54068e323e8e65ba4. whtpiodscshd01t,21302,1710927618816 \xC0\x00\x00\x00\x00\x00\x00\x00 0.0 0

相关推荐
一只爱打拳的程序猿几秒前
【Spring】更加简单的将对象存入Spring中并使用
java·后端·spring
杨荧2 分钟前
【JAVA毕业设计】基于Vue和SpringBoot的服装商城系统学科竞赛管理系统
java·开发语言·vue.js·spring boot·spring cloud·java-ee·kafka
minDuck4 分钟前
ruoyi-vue集成tianai-captcha验证码
java·前端·vue.js
为将者,自当识天晓地。23 分钟前
c++多线程
java·开发语言
daqinzl31 分钟前
java获取机器ip、mac
java·mac·ip
激流丶1 小时前
【Kafka 实战】如何解决Kafka Topic数量过多带来的性能问题?
java·大数据·kafka·topic
Themberfue1 小时前
Java多线程详解⑤(全程干货!!!)线程安全问题 || 锁 || synchronized
java·开发语言·线程·多线程·synchronized·
时差9531 小时前
【面试题】Hive 查询:如何查找用户连续三天登录的记录
大数据·数据库·hive·sql·面试·database
让学习成为一种生活方式1 小时前
R包下载太慢安装中止的解决策略-R语言003
java·数据库·r语言
晨曦_子画1 小时前
编程语言之战:AI 之后的 Kotlin 与 Java
android·java·开发语言·人工智能·kotlin