easyexcel读文件入批量入es

  1. 封装实体类,并对应excel表中的列

    @Data
    public class User {

    复制代码
     private String md5;
    
     private String id; 
     @ExcelProperty(value = "age")
     private String age;
     @ExcelProperty(value = "username")
     private String name;

    }

  2. 批量入库

    复制代码
    private void insertBatchToES(List<User> dataList, String indexName) {
         try {
             BulkProcessor bulkProcessor = BulkProcessor.builder(
                     (request, bulkListener) -> elasticsearchClient.bulkAsync(request, RequestOptions.DEFAULT, bulkListener),
                     new BulkProcessor.Listener() {
    
                         @Override
                         public void beforeBulk(long executionId, org.elasticsearch.action.bulk.BulkRequest request) {
                             // 准备执行前的操作
                         }
    
                         @Override
                         public void afterBulk(long executionId, org.elasticsearch.action.bulk.BulkRequest request, org.elasticsearch.action.bulk.BulkResponse response) {
                             if (response != null) {
                                 int insertedCount = request.numberOfActions(); // 获取请求中操作的数量,即插入的条数
                                 log.info("批量插入 " + insertedCount + " 条数据成功");
                             }
                         }
    
                         @Override
                         public void afterBulk(long executionId, org.elasticsearch.action.bulk.BulkRequest request, Throwable failure) {
                             log.info("批量插入 error");
                         }
                     })
                     // 设置每1000个请求执行一次批处理
                     .setBulkActions(500)
                     .build();
    
    
    
             for(User user : dataList) {
                 String jsonString = convertToJson(user);
                 IndexRequest indexRequest = new IndexRequest(indexName)
                         .id(user.getId())
                         .source(jsonString, XContentType.JSON);
                 bulkProcessor.add(indexRequest);
             }
             bulkProcessor.awaitClose(10, TimeUnit.MINUTES);
             bulkProcessor.close();
    
         } catch (InterruptedException | JsonProcessingException e) {
             e.printStackTrace();
         }
     } 

将对象转json工具类:

复制代码
 public String convertToJson(user) throws JsonProcessingException {
        String objStr = JSON.toJSONString(user, SerializerFeature.WriteNullListAsEmpty, SerializerFeature.WriteNullNumberAsZero,
                SerializerFeature.WriteNullStringAsEmpty, SerializerFeature.NotWriteDefaultValue);

        return objStr;
    }
  1. 读指定文件excel , 封装List

    public void importExcelToES(String excelFilePath, String indexName) {
    try {
    EasyExcel.read(excelFilePath, User.class, new AnalysisEventListener<User>() {
    private List<User> dataList = new ArrayList<>();

    复制代码
                 @Override
                 public void invoke(UserFansExcel data, AnalysisContext analysisContext) {
                     long id = generator.nextId();
                     data.setId(String.valueOf(id));
                   
                    
                     if (dataList.size() >= 500) {
                         insertBatchToES(filteredList, indexName);
                         dataList.clear();
                     }
                 }
    
                 @Override
                 public void doAfterAllAnalysed(AnalysisContext analysisContext) {
                     if (!dataList.isEmpty()) {
                         insertBatchToES(dataList, indexName);
                     }
                 }
             }).sheet().doRead();
         } catch (Exception e) {
             e.printStackTrace();
         }

3.1 读执行目录下的所有excel文件,这些文件的格式是一样的

复制代码
public void readExcelFilesFromDirectory(String directoryPath) throws IOException {
        List<User> dataList = new ArrayList<>();
        File dir = new File(directoryPath);
        File[] files = dir.listFiles((d, name) -> name.endsWith(".xlsx"));

        if (files != null) {
            for (File file : files) {
                System.out.println(file.getName());
                try {
                    String primaryUserId = file.getName().replace(".xlsx", "");
                    try (FileInputStream fis = new FileInputStream(file)) {
                        EasyExcel.read(fis, User.class, new AnalysisEventListener<User>() {


                            @Override
                            public void invoke(User data, AnalysisContext context) {
                                data.setName(primaryUserId);
                               
                                dataList.add(data);
                            }

                            @Override
                            public void doAfterAllAnalysed(AnalysisContext analysisContext) {

                            }



                        }).sheet().doRead();
                    }
                } catch (Exception e) {
                    e.printStackTrace();
                }

           if(dataList.size() >0){
              //这里可以插入数据库
               dataList.clear();
           }

            }
        }
        
    }
相关推荐
稚辉君.MCA_P8_Java18 分钟前
深入理解 TCP;场景复现,掌握鲜为人知的细节
java·linux·网络·tcp/ip·kubernetes
熊猫比分站18 分钟前
[特殊字符] Java/Vue 实现体育比分直播系统,支持多端实时更新
java·开发语言·vue.js
lang2015092838 分钟前
深入掌握 Maven Settings:从配置到实战
java·maven
scx_link41 分钟前
修改JetBrains产品(IntelliJ IDEA 、PyCharm等软件)的默认插件和日志的存储位置
java·pycharm·intellij-idea
BUG?不,是彩蛋!41 分钟前
Maven-Java 项目到底解决了什么痛点?
java·servlet·maven
小池先生42 分钟前
idea配置代码注释模板
java·ide·intellij-idea
inferno42 分钟前
Maven基础(一)
java·开发语言·maven
摇滚侠1 小时前
Spring Boot3零基础教程,Reactive-Stream 规范核心接口,笔记103
java·spring boot·笔记
程序猿小蒜2 小时前
基于springboot的校园社团信息管理系统开发与设计
java·前端·spring boot·后端·spring
兔兔爱学习兔兔爱学习2 小时前
Spring Al学习9:模型上下文协议(MCP)
java·学习·spring