linux – Imagemagick:为PDF flate嵌入生成原始图像数据?
发布时间:2020-12-13 23:06:10 所属栏目:Linux 来源:网络整理
导读:我试图想出一个PDF的命令行,源代码示例(参见 How to generate plain-text source-code PDF examples that work in a document viewer?),但嵌入图像.这是我得到了多远 – 我有一个模板hello.pdf: %PDF-1.41 0 obj /Type /Catalog /Outlines 2 0 R /Pages 3 0
我试图想出一个PDF的命令行,源代码示例(参见
How to generate plain-text source-code PDF examples that work in a document viewer?),但嵌入图像.这是我得到了多远 – 我有一个模板hello.pdf:
%PDF-1.4 1 0 obj << /Type /Catalog /Outlines 2 0 R /Pages 3 0 R >> endobj 2 0 obj << /Type /Outlines /Count 0 >> endobj 3 0 obj << /Type /Pages /Kids [ 4 0 R ] /Count 1 >> endobj 4 0 obj << /Type /Page /Parent 3 0 R /MediaBox [ 0 0 612 792 ] /Contents 5 0 R /Resources << /ProcSet 6 0 R /Font << /F1 7 0 R >> /ProcSet [ /PDF /Text /ImageC ] /XObject << /Im1 8 0 R >> >> >> endobj 5 0 obj << /Length 173 >> stream BT /F1 24 Tf 100 100 Td ( Hello World ) Tj 200 200 Td ( Hello Again ) Tj % width skew-right-up skew-top-right height x y 150 0 0 150 340 130 cm % 150 150 Td - irrelevant for image /Im1 Do ET endstream endobj 6 0 obj [ /PDF /Text ] endobj 7 0 obj << /Type /Font /Subtype /Type1 /Name /F1 /BaseFont /Helvetica /Encoding /MacRomanEncoding >> endobj 8 0 obj << /BitsPerComponent 8 /ColorSpace /DeviceRGB /DecodeParms << /BitsPerComponent 8 /Colors 3 /Columns 150 /Predictor 10 >> /Filter [ /FlateDecode ] /Height 150 /Length 1418 /Subtype /Image /Type /XObject /Width 150 >> stream ###endstream endobj 9 0 obj [ /PDF ] endobj xref 0 10 0000000000 65535 f 0000000009 00000 n 0000000074 00000 n 0000000120 00000 n 0000000179 00000 n 0000000364 00000 n 0000000466 00000 n 0000000496 00000 n 0000001000 00000 n 0000001100 00000 n trailer << /Size 10 /Root 1 0 R >> startxref 625 %%EOF 然后,我使用convert生成图像数据,并使用zlib应用“flate”压缩: convert -size 150x150 gradient:#4b4-#bfb test.ppm du -b test.ppm # 135017 bytes python -c "import zlib,sys;sys.stdout.write(zlib.compress(sys.stdin.read()))" < test.ppm > test.flate du -b test.flate # 1418 bytes 然后,我替换文件中的/ Length 1418,最后使用以下代码替换令牌###: perl -ne 's/^###/`cat test.flate`/e;print' hello.pdf > hello2.pdf 这个文件显然有不正确的外部参照表,然而,在evince打开就好了: …但是,显然位图的格式不正确. 我试图生成,比方说, convert -size 150x150 gradient:#4b4-#bfb -endian LSB rgb:test.raw ……但那些通常都不是绿色的(原始图像应该是). 有没有人知道正确的图像格式 – 和转换命令行 – 生成一个可以“计算”并包含在PDF格式中的原始图像? 非常感谢任何答案, 解决方法
好的,修好了;问题是必须在转换命令行中指定8位深度;因此正确的调用是:
convert -depth 8 -size 150x150 gradient:#4b4-#bfb rgb:test.raw 然后我们有: du -b test.raw # 67500 bytes python -c "import zlib,sys;sys.stdout.write(zlib.compress(sys.stdin.read()))" < test.raw > test.flate du -b test.flate # 664 bytes # replace /Length 664,and then: perl -ne 's/^###/`cat test.flate`/e;print' hello.pdf > hello2.pdf 最后,hello2.pdf在evince中打开并正确显示位图: 顺便说一句,我发现这是因为我实际上是在尝试调试另一个文档中的图像;所以我基本上做了以下事情: # extract and save the stream of this image object qpdf --show-object=23 --raw-stream-data mybadfile.pdf > myraw.file # get raw binary data - deflate the saved object stream python -c "import zlib,sys;sys.stdout.write(zlib.decompress(sys.stdin.read()))" < myraw.file > myraw.deflate identify myraw.deflate # identify: no decode delegate for this image format `myraw.deflate' @ constitute.c/ReadImage/530. identify rgb:myraw.deflate # identify: Must specify image size `myraw.deflate' @ rgb.c/ReadRGBImage/155. identify -size 588x508 rgb:myraw.deflate # rgb:myraw.deflate=>myraw.deflate RGB 588x508 588x508+0+0 16-bit TrueColor DirectClass 875KiB 0.020u 0:00.030 # identify: Unexpected end-of-file `myraw.deflate': No such file or directory @ rgb.c/ReadRGBImage/261. display -size 588x508 rgb:myraw.deflate # display: Unexpected end-of-file `myraw.deflate': No such file or directory @ rgb.c/ReadRGBImage/261. ### but it shows correctly,except for size? identify -depth 8 -size 588x508 rgb:myraw.deflate # rgb:myraw.deflate=>myraw.deflate RGB 588x508 588x508+0+0 8-bit TrueColor DirectClass 875KiB 0.020u 0:00 ## OK display -depth 8 -size 588x508 rgb:myraw.deflate # OK; choosing rgba: is already bad - so confirmed 8-bit rgb 希望这有助于某人,干杯! (编辑:李大同) 【声明】本站内容均来自网络,其相关言论仅代表作者个人观点,不代表本站立场。若无意侵犯到您的权利,请及时与联系站长删除相关内容! |