Hive的数据存储

zrxrz2024-03-10 13:24

Hive的数据存储在HDFS的：/user/hive/warehouse中

The /user folder in HDFS is a directory typically used to store user-specific data and configurations. It serves as the home directory for Hadoop users, analogous to the /home directory in Unix-like systems.

Within the /user directory, each user typically has their own subdirectory named after their username. Users can store their data, files, and configurations within their respective subdirectories. This organization helps in managing and segregating user-specific data in a multi-user Hadoop environment.

For example, if a user with the username "john" exists in the Hadoop system, their home directory would be /user/john, and they would have full control over this directory to store and manage their data. Similarly, other users would have their own respective directories under /user.

也就是说相当于Hive在hdfs中创建了一个叫hive的user

结合前面关于Hadoop Proxyuser的知识，可以理解：

Hive Impersonation: Hive, a data warehousing tool on Hadoop, often uses proxyuser configurations to allow HiveServer2 to execute queries on behalf of different users.

我的理解是：

hive--proxy user

其他user想要访问Hive的数据存储(HDFS的：/user/hive/warehouse中)，hive用户代理实现的

hadoop--proxy user

而当想访问hdfs的其他存储数据，则是由hadoop用户代理实现的