首页技术总结正文内容

Java操作Word转PDF（Word转图片）

技术总结

更新时间：2024-12-22 21:50:31 5

admin 管理员组

文章数量: 887017

1. spire.doc的jar引用

首先我们需要用到国产word处理工具jar包spire.doc，可以通过maven仓库寻找，然后在pom文件中直接引用。

此处需要注意，我们需要使用的是spire.doc.free（免费版的），切勿使用spire.doc（如果使用了，处理后的word文件第一页的顶部会出现红色的警告水印信息）

如果不能直接从仓库引用到此jar，可以在仓库直接下载下来后，手动存放与本地仓库中，处理方式详见本人的另一个帖子：本地Maven仓库导入外部jar

2. 直接上代码

/**
     * word转img（word转pdf）
     *
     * @param inFilePath word文件存放地址全路径
     * @return 生成的图片地址集合
     * @throws IOException
     */
    public static List<String> word2Img(String inFilePath) throws IOException {
        List<String> list = new ArrayList<>();
        Document doc = new Document(inFilePath);
        String outFilePath = inFilePath.substring(0, inFilePath.lastIndexOf("."));
        String pdfFilePath = outFilePath + "_副本.pdf";
        //产生pdf文件（如不需要自己后续增加删除pdf的代码）
        doc.saveToFile(pdfFilePath, FileFormat.PDF);
        int pageCount = doc.getPageCount();
        pageCount = pageCount > 3 ? 3 : pageCount;
        for (int i = 0; i < pageCount; i++) {
        for (int i = 0; i < doc.getPageCount(); i++) {
            BufferedImage bufferedImage = doc.saveToImages(i, ImageType.Bitmap);
            String imgPath = outFilePath + "_副本" + (i + 1) + ".png";
            File file = new File(imgPath);
            ImageIO.write(bufferedImage, "PNG", file);
            list.add(imgPath);
        }
        return list;
    }

    public static void main(String[] args) throws IOException {
        String wordPath = "C:\\Users\\DaiHaijiao\\Desktop/aaa.docx";
        List<String> list = WordUtils.word2Img(wordPath);
        System.out.println(list);
    }

说明：代码执行后，会生成一个pdf文件（此文件就是word转pdf后的文件），而后续的转图片就是基于此pdf文件进行的转换。如果不需要pdf文件的输出，在转图片完成后执行代码删除此pdf即可；如只需要得到pdf文件，则就不需要后续的转图片代码了（直接删除那块代码即可）。

注意：此spire.doc.free在转换pdf或是图片时，word内的内容不能超过3页，如果超过了3页，第四页开始将无法转换（第四页上将会显示警告错误信息，第五页，第六页...全部丢失），转成的图片也一样。

3. 若只需要将Word转PDF（不需要转图片），可以使用poi。此方式可以解决spire.doc.free在页码数量限制上的问题。

3.1 代码pom引入依赖

        <!--<dependency>
            <groupId>org.apache.poi</groupId>
            <artifactId>poi-ooxml</artifactId>
            <version>3.17</version>
        </dependency>
        <dependency>
            <groupId>fr.opensagres.xdocreport</groupId>
            <artifactId>fr.opensagres.poi.xwpf.converter.pdf-gae</artifactId>
            <version>2.0.1</version>
        </dependency>-->
        <dependency>
            <groupId>org.apache.poi</groupId>
            <artifactId>poi-ooxml</artifactId>
            <version>4.1.2</version>
        </dependency>
        <dependency>
            <groupId>fr.opensagres.xdocreport</groupId>
            <artifactId>fr.opensagres.poi.xwpf.converter.pdf-gae</artifactId>
            <version>2.0.2</version>
            <exclusions>
                <exclusion>
                    <artifactId>org.apache.poi</artifactId>
                    <groupId>poi-ooxml</groupId>
                </exclusion>
            </exclusions>
        </dependency>

此处注意：
poi-ooxml 3.17 + fr.opensagres.poi.xwpf.converter.pdf-gae 2.0.1可以直接配合使用。
如果要使用poi-ooxml 4.1.2 + fr.opensagres.poi.xwpf.converter.pdf-gae 2.0.2时，要去除fr.opensagres.poi.xwpf.converter.pdf-gae 2.0.2里面的poi-ooxml。

3.2 相关使用代码

    /**
     * word转pdf
     *
     * @param inFilePath word存放全路径
     * @return 转换后的文件全路径
     * @throws IOException
     */
    public static String doc2Pdf(String inFilePath) throws IOException {
        FileInputStream fileInputStream = new FileInputStream(inFilePath);
        XWPFDocument xwpfDocument = new XWPFDocument(fileInputStream);
        PdfOptions pdfOptions = PdfOptions.create();
        String outFilePath = inFilePath.substring(0, inFilePath.lastIndexOf("."));
        outFilePath += "_副本.pdf";
        FileOutputStream fileOutputStream = new FileOutputStream(outFilePath);
        PdfConverter.getInstance().convert(xwpfDocument, fileOutputStream, pdfOptions);
        fileInputStream.close();
        fileOutputStream.close();
        return outFilePath;
    }

    public static void main(String[] args) throws IOException {
        String inFilePath = "C:\\Users\\DaiHaijiao\\Desktop/aaa.docx";
        String outFilePath = WordUtils.doc2Pdf(inFilePath);
        System.out.println(outFilePath);
    }

此处Word和生成的PDF就不截图了（当Word较大时，执行可能会有点慢）。

特别注意：Word中的字体，一定，一定，一定，要用一样的字体！

字体切勿混用！

好话说三遍！！！

本文标签：操作图片 java pdf Word

版权声明：本文标题：Java操作Word转PDF（Word转图片）内容由网友自发贡献，该文观点仅代表作者本人，转载请联系作者并注明出处：http://www.freenas.com.cn/jishu/1726309895h934208.html，本站仅提供信息存储空间服务，不拥有所有权，不承担相关法律责任。如发现本站有涉嫌抄袭侵权/违法违规的内容，一经查实，本站将立刻删除。

发表评论

全部评论 0

暂无评论

技术交流 – FreeNAS中文网

Java操作Word转PDF（Word转图片）

更多相关文章

java中运行方法名,java

deepin 安装office_Deepin深度操作系统安装及使用体验

windows7计算机图片,win7照片查看器无法显示图片计算机可用内存不足 需要技巧...

word标题前自动分页

Word中遇到的问题记录（页眉，页码分节符，跨页断行）

Word设置每页不同的页眉修改或去掉页眉横线页眉标题在横线上下方的设置

在使用过程中，经常出现黑屏现象，任何操作都无反应

不能完成此操作 因为您没有必要的权限_【操作】Cobalt Strike 中 Bypass UAC

如何在手机上打开xmind文件_xmind在手机上怎么操作

java实现文件的上传和下载

Java环境变量配置教程及工具

Windows 操作系统的介绍和常见操作，任何安全人员都应该理解的Windows内部工作原理

什么是 Microsoft Word Header &amp; Footer 设置里的 Link to Previous 选项

word中插入空白页！

word设置交叉引用快捷键和居中快捷键

Word中单独几页纸张方向变为横向后，页码改变

java for mac 安装_Java for MacLinuxWindows 系统上的安装和配置教程

windows server 服务器基本操作

记一次Surface Pro 2还原操作

Windows照片查看器无法显示此图片，因为计算机上的可用内存可能不足。Win7图片打不开 提示内存不足

发表评论

推荐文章

Visual C++ 6.0 Win7 适用版下载

大数据总结

bat 启动 不弹出对话框_bat教程[285] FORF options选项中usebackq的用法

【linux】Linux 系统 CentOS 最新版本和历史版本下载方法

安装vmware workstation和在虚拟机中安装windows 10 系统

热门文章

占星家眼中的十二星座

基于Python实现的银行信息处理系统

selenium + edge浏览器配置

luogu2179 [NOI2012]骑行川藏

联想小新 Air Pro 13笔记本安装win10和Deepin15.8双系统

windows10升级助手_微软官网下载与安装windows10系统的操作步骤

[ MSF使用实例 ] 利用永恒之蓝(MS17-010)漏洞导致windows靶机蓝屏并获取靶机权限

服务器win系统更新如何设置,Windows服务器更新服务的配置

Windows 使用技巧

【Windows Server 2019】Web服务 IIS 配置与管理——理论（术语解释与工作原理）Ⅰ

最新文章

Raid技术

LSI_阵列卡操作手册

破解Centos7_root用户密码

Redhat重置Root用户密码方法

远程批量修改linux服务器密码的脚本

零基础使用UltraISO制作并安装纯净Win10系统指南

苹果电脑windows系统换苹果系统

Win11系统崩溃错误修复指南：三种实用方法详解

如何封装一个自己的win7系统并安装到电脑做成双系统

如何在Excel 2019中开启数据分析工具？

windows7计算机图片,win7照片查看器无法显示图片计算机可用内存不足需要技巧...

不能完成此操作因为您没有必要的权限_【操作】Cobalt Strike 中 Bypass UAC

什么是 Microsoft Word Header & Footer 设置里的 Link to Previous 选项

Windows照片查看器无法显示此图片，因为计算机上的可用内存可能不足。Win7图片打不开提示内存不足

bat 启动不弹出对话框_bat教程[285] FORF options选项中usebackq的用法