pdf2htmlEX 32位windows版本

2023-10-17 09:20

本文主要是介绍pdf2htmlEX 32位windows版本,希望对大家解决编程问题提供一定的参考价值,需要的开发者们随着小编来一起学习吧!

pdf2htmlEX 32位windows版本,原文出处:https://blog.csdn.net/weixin_44603744/article/details/86596082

windows系统可执行版下载地址:
http://soft.rubypdf.com/software/pdf2htmlex-windows-version
在这里插入图片描述
使用方法:

  1. 将需要转换的pdf文件放入pdf2htmlEX的解压目录
    在这里插入图片描述

  2. 使用命令提示符进入pdf2htmlEX的解压目录

cd d:\pdfex
d:
  • 1
  • 2

在这里插入图片描述

  1. 执行cmd命令调用pdf2htmlex进行转换:
pdf2htmlex --zoom 1.8 abc.pdf
  • 1

在这里插入图片描述

  1. 执行完毕后,会在同目录下生成与pdf同名的html文件:
    在这里插入图片描述
    在这里插入图片描述

参数说明
–zoom 缩放倍率 (转换结果是基于pdf文件的默认设置,如果转换结果阅读体验不佳,可通过调节zoom参数进行文字缩放)

更多参数:https://github.com/coolwanglu/pdf2htmlEX/wiki/Command-Line-Options
项目github:https://github.com/coolwanglu/pdf2htmlEX

OPTIONSPages-f, --first-page <num> (Default: 1)Specify the first page to process-l, --last-page <num> (Default: last page)Specify the last page to processDimensions--zoom <ratio>, --fit-width <width>, --fit-height <height>--zoom specifies the zoom  factor  directly;  --fit-width/heightspecifies  the maximum width/height of a page, the values are inpixels.If multiple values are specified, the minimum one will be used.If none is specified, pages will be rendered as 72DPI.--use-cropbox <0|1> (Default: 1)Use CropBox instead of MediaBox for output.--hdpi <dpi>, --vdpi <dpi> (Default: 144)Specify the horizontal and vertical DPI for imagesOutput--embed <string>--embed-css <0|1> (Default: 1)--embed-font <0|1> (Default: 1)--embed-image <0|1> (Default: 1)--embed-javascript <0|1> (Default: 1)--embed-outline <0|1> (Default: 1)Specify which elements should be embedded into the  output  HTMLfile.If  switched  off,  separated files will be generated along withthe HTML file for the corresponding elements.--embed accepts a string as argument. Each letter of the  stringmust  be  one  of  `cCfFiIjJoO`, which corresponds to one of the--embed-*** switches. Lower case letters for 0  and  upper  caseletters  for  1.  For  example,  `--embed  cFIJo` means to embedeverything but CSS files and outlines.--split-pages <0|1> (Default: 0)If turned on, the content of each page is stored in a  separatedfile.This  switch is useful if you want pages to be loaded separately& dynamically -- a supporting server might be necessary.Also see --page-filename.--dest-dir <dir> (Default: .)Specify destination folder.--css-filename <filename> (Default: <none>)Specify the filename of the generated css file, if not embedded.If it's empty, the file name will be determined automatically.--page-filename <filename> (Default: <none>)Specify the filename template for pages when --split-pages is 1A %d placeholder may be included in `filename` to indicate wherethe  page  number  should  be placed. The placeholder supports alimited subset of normal numerical placeholders, including spec‐ified width and zero padding.If  `filename`  does not contain a placeholder for the page num‐ber, the page number will be inserted directly before  the  fileextension.  If the filename does not have an extension, the pagenumber will be placed at the end of the file name.If --page-filename is not specified,  <input-filename>  will  beused for the output filename, replacing the extension with .pageand adding the page number directly before the extension.Examplespdf2htmlEX --split-pages 1 foo.pdfYields page files foo1.page, foo2.page, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar.bazYields page files bar1.baz, bar2.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename page%dbar.bazYields page files page1bar.baz, page2bar.baz, etc.pdf2htmlEX --split-pages 1 foo.pdf --page-filename bar%03d.bazYields page files bar001.baz, bar002.baz, etc.--outline-filename <filename> (Default: <none>)Specify the filename of  the  generated  outline  file,  if  notembedded.If it's empty, the file name will be determined automatically.--process-nontext <0|1> (Default: 1)Whether to process non-text objects (as images)--process-outline <0|1> (Default: 1)Whether to show outline in the generated HTML--printing <0|1> (Default: 1)Enable  printing  support.  Disabling this option may reduce thesize of CSS.--fallback <0|1> (Default: 0)Output in fallback mode, for better accuracy and browser compat‐ibility, but the size becomes larger.--tmp-file-size-limit <limit> (Default: -1)This  limits the total size (in KB) of the temporary files whichwill also limit the total size of the output file.  This  is  anestimate and it will stop after a page, once the total temporaryfiles size is greater than this number.-1 means no limit and is the default.Fonts--embed-external-font <0|1> (Default: 1)Specify whether the local matched fonts, for fonts not  embeddedin PDF, should be embedded into HTML.If  this  switch  is off, only font names are exported such thatweb browsers may try to find proper fonts themselves,  and  thatmight cause issues about incorrect font metrics.--font-format <format> (Default: woff)Specify the format of fonts extracted from the PDF file.--decompose-ligature <0|1> (Default: 0)Decompose ligatures. For example 'fi' -> 'f''i'.--auto-hint <0|1> (Default: 0)If  set  to 1, hints will be generated for the fonts using font‐forge.This may be preceded by --external-hint-tool.--external-hint-tool <tool> (Default: <none>)If specified, the tool will be called in order to enhanced hint‐ing for fonts, this will precede --auto-hint.The  tool  will  be called as '<tool> <in.suffix> <out.suffix>',where suffix will be the same as specified for --font-format.--stretch-narrow-glyph <0|1> (Default: 0)If set to 1, glyphs narrower  than  described  in  PDF  will  bestretched;  otherwise  space  will be padded to the right of theglyphs--squeeze-wide-glyph <0|1> (Default: 1)If set to  1,  glyphs  wider  than  described  in  PDF  will  besqueezed; otherwise it will be truncated.--override-fstype <0|1> (Default: 0)Clear the fstype bits in TTF/OTF fonts.Turn  this  on  if Internet Explorer complains about 'Permissionmust be Installable' AND you have permission to do so.--process-type3 <0|1> (Default: 0)If turned on, pdf2htmlEX will try to convert Type 3  fonts  suchthat  text can be rendered natively in HTML.  Otherwise all textwith Type 3 fonts will be rendered as image.This feature is highly experimental.Text--heps <len>, --veps <len> (Default: 1)Specify the maximum  tolerable  horizontal/vertical  offset  (inpixels).pdf2htmlEX  would try to optimize the generated HTML file movingText within this distance.--space-threshold <ratio> (Default: 0.125)pdf2htmlEX would insert a whitespace character ' ' if  the  dis‐tance  between two consecutive letters in the same line is widerthan ratio * font_size.--font-size-multiplier <ratio> (Default: 4.0)Many web browsers limit the minimum font size,  and  many  wouldround the given font size, which results in incorrect rendering.Specify a ratio greater than 1 would resolve this issue, howeverit might freeze some browsers.For some versions of Firefox, however, there will be  a  problemwhen  the  font size is too large, in which case a smaller valueshould be specified here.--space-as-offset <0|1> (Default: 0)If set to 1, space characters will be treated as offsets,  whichallows a better optimization.For  PDF  files  with  bad encodings, turning on this option maycause losing characters.--tounicode <-1|0|1> (Default: 0)A ToUnicode map may be provided for each font in PDF which indi‐cates  the  'meaning'  of the characters. However often there isbetter "ToUnicode" info in Type 0/1  fonts,  and  sometimes  theToUnicode map provided is wrong.  If this value is set to 1, theToUnicode Map is always applied, if provided in PDF, and charac‐ters may not render correctly in HTML if there are collisions.If  set to -1, a customized map is used such that rendering willbe correct in HTML (visually the same), but you may not get cor‐rect characters by select & copy & paste.If  set  to  0, pdf2htmlEX would try its best to balance the twomethods above.--optimize-text <0|1> (Default: 0)If set to 1, pdf2htmlEX will try to reduce the  number  of  HTMLelements used for text. Turn it off if anything goes wrong.Background Image--bg-format <format> (Default: png)Specify  the  background  image  format.  Run `pdf2htmlEX -v` tocheck all supported formats.PDF Protection-o, --owner-password <password>Specify owner password-u, --user-password <password>Specify user password--no-drm <0|1> (Default: 0)Override document DRM settingsTurn this on only when you have permission.Misc.--clean-tmp <0|1> (Default: 1)If switched off, intermediate files won't be cleaned in the end.--data-dir <dir> (Default: /usr/local/share/pdf2htmlEX)Specify the folder holding the manifest  and  other  files  (seebelow for the manifest file)`--tmp-dir <dir> (Default: /tmp)Specify the temporary folder to use for temporary files--css-draw <0|1> (Default: 0)Experimental and unsupported CSS drawing--debug <0|1> (Default: 0)Print debug information.Meta-v, --versionPrint copyright and version info--help Print usage information

这篇关于pdf2htmlEX 32位windows版本的文章就介绍到这儿,希望我们推荐的文章对编程师们有所帮助!



http://www.chinasem.cn/article/224420

相关文章

CSS place-items: center解析与用法详解

《CSSplace-items:center解析与用法详解》place-items:center;是一个强大的CSS简写属性,用于同时控制网格(Grid)和弹性盒(Flexbox)... place-items: center; 是一个强大的 css 简写属性,用于同时控制 网格(Grid) 和 弹性盒(F

CSS实现元素撑满剩余空间的五种方法

《CSS实现元素撑满剩余空间的五种方法》在日常开发中,我们经常需要让某个元素占据容器的剩余空间,本文将介绍5种不同的方法来实现这个需求,并分析各种方法的优缺点,感兴趣的朋友一起看看吧... css实现元素撑满剩余空间的5种方法 在日常开发中,我们经常需要让某个元素占据容器的剩余空间。这是一个常见的布局需求

CSS Anchor Positioning重新定义锚点定位的时代来临(最新推荐)

《CSSAnchorPositioning重新定义锚点定位的时代来临(最新推荐)》CSSAnchorPositioning是一项仍在草案中的新特性,由Chrome125开始提供原生支持需... 目录 css Anchor Positioning:重新定义「锚定定位」的时代来了! 什么是 Anchor Pos

CSS中的Static、Relative、Absolute、Fixed、Sticky的应用与详细对比

《CSS中的Static、Relative、Absolute、Fixed、Sticky的应用与详细对比》CSS中的position属性用于控制元素的定位方式,不同的定位方式会影响元素在页面中的布... css 中的 position 属性用于控制元素的定位方式,不同的定位方式会影响元素在页面中的布局和层叠关

HTML5 getUserMedia API网页录音实现指南示例小结

《HTML5getUserMediaAPI网页录音实现指南示例小结》本教程将指导你如何利用这一API,结合WebAudioAPI,实现网页录音功能,从获取音频流到处理和保存录音,整个过程将逐步... 目录1. html5 getUserMedia API简介1.1 API概念与历史1.2 功能与优势1.3

在Windows上使用qemu安装ubuntu24.04服务器的详细指南

《在Windows上使用qemu安装ubuntu24.04服务器的详细指南》本文介绍了在Windows上使用QEMU安装Ubuntu24.04的全流程:安装QEMU、准备ISO镜像、创建虚拟磁盘、配置... 目录1. 安装QEMU环境2. 准备Ubuntu 24.04镜像3. 启动QEMU安装Ubuntu4

Windows下C++使用SQLitede的操作过程

《Windows下C++使用SQLitede的操作过程》本文介绍了Windows下C++使用SQLite的安装配置、CppSQLite库封装优势、核心功能(如数据库连接、事务管理)、跨平台支持及性能优... 目录Windows下C++使用SQLite1、安装2、代码示例CppSQLite:C++轻松操作SQ

全面解析HTML5中Checkbox标签

《全面解析HTML5中Checkbox标签》Checkbox是HTML5中非常重要的表单元素之一,通过合理使用其属性和样式自定义方法,可以为用户提供丰富多样的交互体验,这篇文章给大家介绍HTML5中C... 在html5中,Checkbox(复选框)是一种常用的表单元素,允许用户在一组选项中选择多个项目。本

HTML5 搜索框Search Box详解

《HTML5搜索框SearchBox详解》HTML5的搜索框是一个强大的工具,能够有效提升用户体验,通过结合自动补全功能和适当的样式,可以创建出既美观又实用的搜索界面,这篇文章给大家介绍HTML5... html5 搜索框(Search Box)详解搜索框是一个用于输入查询内容的控件,通常用于网站或应用程

基于Python实现一个Windows Tree命令工具

《基于Python实现一个WindowsTree命令工具》今天想要在Windows平台的CMD命令终端窗口中使用像Linux下的tree命令,打印一下目录结构层级树,然而还真有tree命令,但是发现... 目录引言实现代码使用说明可用选项示例用法功能特点添加到环境变量方法一:创建批处理文件并添加到PATH1