使用Java合并PDF文档

本文将介绍如何使用Java中的iText库来合并多个PDF文档为一个。我们将使用iText库提供的功能来创建新的PDF文档，并将现有的PDF页面添加到其中。我们将展示如何使用Maven构建项目，并添加所需的依赖项。接下来，我们将提供一个示例代码，演示如何合并PDF文档。

添加Maven依赖

首先，我们需要在项目的pom.xml文件中添加iText库的依赖项。请确保你的项目已经使用了Maven进行管理。在dependencies标签中添加以下代码：

xml 复制代码

<dependencies>
    <dependency>
        <groupId>com.itextpdf</groupId>
        <artifactId>itextpdf</artifactId>
        <version>5.5.13</version>
    </dependency>
</dependencies>

这将告诉Maven下载并引入iText库，以便我们可以在项目中使用它。

编写合并PDF的工具类

现在我们将编写一个工具类，其中包含合并PDF文档的方法。我们将使用iText库来实现这个功能。以下是一个名为PdfMergeUtil的工具类的代码：

java 复制代码

import com.itextpdf.text.Document;
import com.itextpdf.text.DocumentException;
import com.itextpdf.text.pdf.*;

import java.io.FileOutputStream;
import java.io.IOException;
import java.util.List;

public class PdfMergeUtil {

    /**
     * 合并多个PDF文档为一个
     *
     * @param sourceFiles 要合并的PDF文件路径列表
     * @param outputFile  合并后的PDF文件输出路径
     * @throws IOException       如果读取文件或写入文件时发生错误
     * @throws DocumentException 如果创建PDF文档时发生错误
     */
    public static void mergePdfFiles(List<String> sourceFiles, String outputFile) throws IOException, DocumentException {
        Document document = new Document();

        PdfCopy copy = new PdfCopy(document, new FileOutputStream(outputFile));
        document.open();

        for (String sourceFile : sourceFiles) {
            PdfReader reader = new PdfReader(sourceFile);
            int totalPages = reader.getNumberOfPages();

            for (int page = 1; page <= totalPages; page++) {
                PdfImportedPage importedPage = copy.getImportedPage(reader, page);
                copy.addPage(importedPage);
            }

            reader.close();
        }

        document.close();
    }
}

在这个工具类中，我们定义了一个名为mergePdfFiles的方法，它接受两个参数：一个包含要合并的PDF文件路径的列表和合并后的PDF文件的输出路径。

我们首先创建一个Document对象，并使用PdfCopy类来将页面添加到新的PDF文件中。然后，我们遍历输入的PDF文件列表，对每个文件进行处理。

对于每个源文件，我们使用PdfReader类来读取文件，并获取文件中的页面总数。然后，我们使用PdfCopy对象的addPage方法将每个页面导入到新的PDF文件中。

最后，我们关闭源文件的PdfReader对象和新的PDF文件的Document对象，完成合并操作。

编写测试类

为了验证我们的工具类是否正常工作，我们编写一个测试类来合并一些示例PDF文件。以下是一个示例测试类的代码：

java 复制代码

import com.itextpdf.text.DocumentException;
import org.junit.jupiter.api.Test;

import java.io.IOException;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.ArrayList;
import java.util.List;

import static org.junit.jupiter.api.Assertions.*;

class PdfMergeUtilTest {

    @Test
    void mergePdfFiles() {
        List<String> sourceFiles = new ArrayList<>();
        sourceFiles.add("path/to/source1.pdf");
        sourceFiles.add("path/to/source2.pdf");
        String outputFile = "path/to/output.pdf";

        try {
            PdfMergeUtil.mergePdfFiles(sourceFiles, outputFile);
            // 验证输出文件是否存在
            assertTrue(Files.exists(Paths.get(outputFile)));
        } catch (IOException | DocumentException e) {
            fail("合并PDF文件时发生错误：" + e.getMessage());
        }
    }
}

在这个测试类中，我们使用了JUnit 5的测试框架来编写一个名为mergePdfFiles的测试方法。在该方法中，我们创建了一个包含要合并的两个PDF文件路径的列表，并指定了输出文件路径。

然后，我们调用PdfMergeUtil工具类的mergePdfFiles方法来合并PDF文件。在测试方法中，我们使用了JUnit 5的断言来验证合并操作的结果。我们使用assertTrue断言来验证合并后的输出文件是否存在。

如果合并操作出现异常，我们使用fail断言来标记测试失败，并输出异常信息。

请确保在运行测试之前替换sourceFiles和outputFile的值为你自己的文件路径。

完整示例代码

下面是完整的示例代码，包括工具类和测试类：

java 复制代码

import com.itextpdf.text.Document;
import com.itextpdf.text.DocumentException;
import com.itextpdf.text.pdf.*;

import java.io.FileOutputStream;
import java.io.IOException;
import java.util.List;

public class PdfMergeUtil {

    /**
     * 合并多个PDF文档为一个
     *
     * @param sourceFiles 要合并的PDF文件路径列表
     * @param outputFile  合并后的PDF文件输出路径
     * @throws IOException       如果读取文件或写入文件时发生错误
     * @throws DocumentException 如果创建PDF文档时发生错误
     */
    public static void mergePdfFiles(List<String> sourceFiles, String outputFile) throws IOException, DocumentException {
        Document document = new Document();

        PdfCopy copy = new PdfCopy(document, new FileOutputStream(outputFile));
        document.open();

        for (String sourceFile : sourceFiles) {
            PdfReader reader = new PdfReader(sourceFile);
            int totalPages = reader.getNumberOfPages();

            for (int page = 1; page <= totalPages; page++) {
                PdfImportedPage importedPage = copy.getImportedPage(reader, page);
                copy.addPage(importedPage);
            }

            reader.close();
        }

        document.close();
    }
}

import com.itextpdf.text.DocumentException;
import org.junit.jupiter.api.Test;

import java.io.IOException;
import java.nio.file.Files;
import java.nio.file.Paths;
import java.util.ArrayList;
import java.util.List;

import static org.junit.jupiter.api.Assertions.*;

class PdfMergeUtilTest {

    @Test
    void mergePdfFiles() {
        List<String> sourceFiles = new ArrayList<>();
        sourceFiles.add("path/to/source1.pdf");
        sourceFiles.add("path/to/source2.pdf");
        String outputFile = "path/to/output.pdf";

        try {
            PdfMergeUtil.mergePdfFiles(sourceFiles, outputFile);
            // 验证输出文件是否存在
            assertTrue(Files.exists(Paths.get(outputFile)));
        } catch (IOException | DocumentException e) {
            fail("合并PDF文件时发生错误：" + e.getMessage());
        }
    }
}

总结

本文介绍了如何使用Java和iText库合并多个PDF文档为一个。我们通过添加Maven依赖来引入iText库，并编写了一个工具类来实现合并功能。我们还编写了一个测试类来验证合并操作的结果。

希望本文能够对你了解如何使用Java合并PDF文档有所帮助！