Projects / PandaLex PDF Parser

PandaLex PDF Parser

PandaLex is the PDF parsing code from Panda, which has been split into its own project to increase its utility. It is a flex and bison description of the PDF specification, which allows programmers to define callbacks to handle different document elements.

Operating Systems

Recent releases

  •  18 May 2003 02:41

    Release Notes: A siginificantly improved lexer, which successfully parses many more of the regression tests.

    •  25 Mar 2002 13:25

      Release Notes: A bug in the header event which stopped the extra header info from being passed through was fixed. The name of the sample application, pdfinfo, was changed to pdfdump since it conflicts with the pdfinfo utility from the xpdf distribution. More callbacks were implemented.

      •  07 Mar 2002 03:09

        Release Notes: A massively-improved PDF parsing engine.

        •  08 Dec 2000 11:41

          Release Notes: Initial freshmeat announcement.


          Project Spotlight


          A Fluent OpenStack client API for Java.


          Project Spotlight

          TurnKey TWiki Appliance

          A TWiki appliance that is easy to use and lightweight.