From gbhackers.com
This tool will parse a PDF document to distinguish the central components utilized as a part of analyzed file. It won’t render a PDF archive.
Features included:
- Load/parse objects and headers
- Extract metadata (author, description, …)
- Extract text from ordered pages
- Support of compressed pdf
- Support of MAC OS Roman charset encoding
- Handling of hexa and octal encoding in text sections
- PSR-0 compliant (autoloader)
- PSR-1 compliant (code styling)