~ft/pdffs

PDF as a filesystem
add pdf2txt rc script
Significantly improved text output
[parsing] fix unexpected report for n

refs

master
browse  log 

clone

read-only
https://git.sr.ht/~ft/pdffs
read/write
git@git.sr.ht:~ft/pdffs

You can also use your local clone with git send-email.

#PDF as a file system

Yeah. WIP.

There is nothing to look at yet. You can build it and run things like ./6.out my.pdf Root Pages Kids 0 Kids 0 Kids 0 Contents . to dump a specific object (text, images, fonts, etc).

#Installation

mk deps    # install additional decoders (jbig2, jpeg2000)
mk install

#TODO

  • ASCIIHexDecode filter
  • CryptDecode filter
  • does DCTDecode filter need to deal with ColorTransform?
  • do we need predictor 5 ("optimum")?
  • object stream dictionary doesn't support Extends
  • jp2: CMYK color space support needs to be added