sqoop导入hdfs,hive

昨夜花落尽2024-12-16 16:50

sqoop将mysql中的表导入到hdfs中

复制代码

sqoop import \
> --connect jdbc:mysql://192.168.52.150/test \
> --username root \
> --password 123456 \
> --table emp \
> --delete-target-dir \
> --target-dir '/sqoop_works/emp_1'

将数据导入hive中，首先要在hive中创建目标表

复制代码

create database hivesqoop;
use hivesqoop;
create table hivesqoop.emp_add_hive(
    id int,
    hon string,
    street string,
    city string
)
row format delimited fields terminated by '\t'
stored as orc;

然后导入

复制代码

 sqoop import --connect jdbc:mysql://192.168.52.150/test --username root --password 123456 --table emp_add --hcatalog-database hivesqoop --hcatalog-table emp_add_hive -m 1

将增量数据导入hdfs中

加上 --where id >= 120

hive导出到MySQL 是换个方向。

上一篇：分页查询和事务管理

下一篇：【GIS教程】使用GDAL-Python将tif转为COG并在ArcGIS Js前端加载-附完整代码