Hive的数据存储

Hive的数据存储在HDFS的:/user/hive/warehouse中

The /user folder in HDFS is a directory typically used to store user-specific data and configurations. It serves as the home directory for Hadoop users, analogous to the /home directory in Unix-like systems.

Within the /user directory, each user typically has their own subdirectory named after their username. Users can store their data, files, and configurations within their respective subdirectories. This organization helps in managing and segregating user-specific data in a multi-user Hadoop environment.

For example, if a user with the username "john" exists in the Hadoop system, their home directory would be /user/john, and they would have full control over this directory to store and manage their data. Similarly, other users would have their own respective directories under /user.

也就是说相当于Hive在hdfs中创建了一个叫hive的user

结合前面关于Hadoop Proxyuser的知识,可以理解:

Hive Impersonation: Hive, a data warehousing tool on Hadoop, often uses proxyuser configurations to allow HiveServer2 to execute queries on behalf of different users.

我的理解是:

hive--proxy user

其他user想要访问Hive的数据存储(HDFS的:/user/hive/warehouse中),hive用户代理实现的

hadoop--proxy user

而当想访问hdfs的其他存储数据,则是由hadoop用户代理实现的

相关推荐
小邓睡不饱耶2 天前
Hadoop 3.x 企业级实战指南:从纠删码到云原生容器化
大数据·hadoop·云原生
makabalala3 天前
如何利用YashanDB进行数据仓库构建
数据仓库
禅与计算机程序设计艺术3 天前
了解NoSQL的数据仓库和ETL
数据库·数据仓库·nosql·etl
好学且牛逼的马3 天前
从“配置地狱“到“云原生时代“:Spring Boot 1.x到4.x演进全记录与核心知识点详解
hive·spring boot·云原生
派可数据BI可视化4 天前
一文读懂系列:数据仓库为什么分层,分几层?数仓建模方法有哪些
大数据·数据仓库·信息可视化·spark·商业智能bi
Light604 天前
不止于名:领码 SPARK 如何“链”动数据仓库、数据湖、中台与湖仓一体新纪元
大数据·数据仓库·数据湖·ipaas·湖仓一体·数据中台·领码 spark
郑小憨4 天前
FlinkSQL窗口函数TUMBLE、SESSION 和 HOP的区别
大数据·数据仓库·sql·flink·database
码字的字节4 天前
锚点模型:数据仓库中的高度可扩展建模技术详解
大数据·数据仓库·spark
Agentic AI人工智能与大数据4 天前
大数据领域数据仓库的ETL任务优化
大数据·数据仓库·ai·etl