Hive的数据存储

Hive的数据存储在HDFS的:/user/hive/warehouse中

The /user folder in HDFS is a directory typically used to store user-specific data and configurations. It serves as the home directory for Hadoop users, analogous to the /home directory in Unix-like systems.

Within the /user directory, each user typically has their own subdirectory named after their username. Users can store their data, files, and configurations within their respective subdirectories. This organization helps in managing and segregating user-specific data in a multi-user Hadoop environment.

For example, if a user with the username "john" exists in the Hadoop system, their home directory would be /user/john, and they would have full control over this directory to store and manage their data. Similarly, other users would have their own respective directories under /user.

也就是说相当于Hive在hdfs中创建了一个叫hive的user

结合前面关于Hadoop Proxyuser的知识,可以理解:

Hive Impersonation: Hive, a data warehousing tool on Hadoop, often uses proxyuser configurations to allow HiveServer2 to execute queries on behalf of different users.

我的理解是:

hive--proxy user

其他user想要访问Hive的数据存储(HDFS的:/user/hive/warehouse中),hive用户代理实现的

hadoop--proxy user

而当想访问hdfs的其他存储数据,则是由hadoop用户代理实现的

相关推荐
RestCloud1 天前
ETL数据质量保障:如何通过优化提升数据准确性?
数据仓库·etl·数据处理·数据传输·数据同步·数据集成平台
隐于花海,等待花开1 天前
40.RAND 函数深度解析
hive·hadoop
2501_927283582 天前
荣联汇智助力天津艺虹打造“软硬一体”智慧工厂,全流程自动化引领印刷包装行业数智变革
大数据·运维·数据仓库·人工智能·低代码·自动化
孤雪心殇2 天前
快速上手数仓基础知识
数据仓库·hive·spark
渣渣盟2 天前
数据仓库 vs 数据湖 vs 湖仓一体:架构演进与选型
数据仓库·架构
隐于花海,等待花开3 天前
39.ROUND / FLOOR / CEIL 函数深度解析
hive·hadoop
juniperhan3 天前
Flink 系列第22篇:Flink SQL 参数配置与性能调优指南:从 Checkpoint 到聚合优化
大数据·数据仓库·分布式·sql·flink
juniperhan3 天前
Flink 系列第21篇:Flink SQL 函数与 UDF 全解读:类型推导、开发要点与 Module 扩展
java·大数据·数据仓库·分布式·sql·flink
看海的四叔3 天前
【SQL】SQL-管好你的字符串
大数据·数据库·hive·sql·数据分析·字符串
坚持就完事了3 天前
YARN资源管理器
大数据·linux·hadoop·学习