Hive的数据存储

Hive的数据存储在HDFS的:/user/hive/warehouse中

The /user folder in HDFS is a directory typically used to store user-specific data and configurations. It serves as the home directory for Hadoop users, analogous to the /home directory in Unix-like systems.

Within the /user directory, each user typically has their own subdirectory named after their username. Users can store their data, files, and configurations within their respective subdirectories. This organization helps in managing and segregating user-specific data in a multi-user Hadoop environment.

For example, if a user with the username "john" exists in the Hadoop system, their home directory would be /user/john, and they would have full control over this directory to store and manage their data. Similarly, other users would have their own respective directories under /user.

也就是说相当于Hive在hdfs中创建了一个叫hive的user

结合前面关于Hadoop Proxyuser的知识,可以理解:

Hive Impersonation: Hive, a data warehousing tool on Hadoop, often uses proxyuser configurations to allow HiveServer2 to execute queries on behalf of different users.

我的理解是:

hive--proxy user

其他user想要访问Hive的数据存储(HDFS的:/user/hive/warehouse中),hive用户代理实现的

hadoop--proxy user

而当想访问hdfs的其他存储数据,则是由hadoop用户代理实现的

相关推荐
howard20056 小时前
Hive实战任务 - 9.1 实现词频统计
hive·词频统计
colorknight6 小时前
数据编织-异构数据存储的自动化治理
数据仓库·人工智能·数据治理·数据湖·数据科学·数据编织·自动化治理
满目山河•11 小时前
二、复制三台虚拟机
hive·hadoop·hbase
howard200516 小时前
Hive实战任务 - 9.3 实现学生信息排序和统计
hive·排序·汇总·学生信息
番茄撒旦在上17 小时前
什么样的表适合做拉链表
大数据·数据仓库·hive
YMatrix 官方技术社区17 小时前
YMatrix 高可用详解:3 种镜像策略在节点宕机时表现有何不同?
运维·数据库·数据仓库·ai·数据库开发·数据库架构·ymatrix
howard200517 小时前
Hive实战任务 - 9.5 实现网址去重
hive·网址去重
howard200517 小时前
Hive实战任务 - 9.4 分科汇总求月考平均分
hive·分科汇总月考平均分
howard200518 小时前
Hive实战任务 - 9.2 统计总分与平均分
hive·统计总分与平均分
zhixingheyi_tian21 小时前
Hadoop 之 ENV
大数据·hadoop·分布式