Hive的数据存储

Hive的数据存储在HDFS的:/user/hive/warehouse中

The /user folder in HDFS is a directory typically used to store user-specific data and configurations. It serves as the home directory for Hadoop users, analogous to the /home directory in Unix-like systems.

Within the /user directory, each user typically has their own subdirectory named after their username. Users can store their data, files, and configurations within their respective subdirectories. This organization helps in managing and segregating user-specific data in a multi-user Hadoop environment.

For example, if a user with the username "john" exists in the Hadoop system, their home directory would be /user/john, and they would have full control over this directory to store and manage their data. Similarly, other users would have their own respective directories under /user.

也就是说相当于Hive在hdfs中创建了一个叫hive的user

结合前面关于Hadoop Proxyuser的知识,可以理解:

Hive Impersonation: Hive, a data warehousing tool on Hadoop, often uses proxyuser configurations to allow HiveServer2 to execute queries on behalf of different users.

我的理解是:

hive--proxy user

其他user想要访问Hive的数据存储(HDFS的:/user/hive/warehouse中),hive用户代理实现的

hadoop--proxy user

而当想访问hdfs的其他存储数据,则是由hadoop用户代理实现的

相关推荐
TTBIGDATA12 小时前
【Ambari开启Kerberos】- Atlas启动 - Hive服务检查异常处理
大数据·hive·hadoop·硬件架构·ambari·kerberos·bigtop
Q264336502314 小时前
【有源码】基于Python的睡眠压力监测分析系统-基于Spark数据挖掘的睡眠压力动态可视化分析系统
大数据·hadoop·python·机器学习·数据挖掘·spark·课程设计
一只小青团1 天前
Hadoop之MapReduce
大数据·hadoop·mapreduce
阳爱铭1 天前
ClickHouse 中至关重要的两类复制表引擎——ReplicatedMergeTree和 ReplicatedReplacingMergeTree
大数据·hive·hadoop·sql·clickhouse·spark·hbase
isNotNullX2 天前
怎么理解ETL增量抽取?
数据库·数据仓库·etl·企业数字化
Bug快跑-12 天前
云原生微服务环境下分布式事务优化实践——提升系统一致性与高并发处理能力
hadoop
叡鳍2 天前
Hive---案例7-6 列转行
数据仓库·hive·hadoop
❀͜͡傀儡师2 天前
docker-compose一键部署Hadoop集群
hadoop·docker·容器
干就完事了3 天前
Hive内置函数
数据仓库·hive·hadoop
布吉岛没有岛_3 天前
Hadoop学习_week1
大数据·hadoop