Apache Tika bindings for PHP: extract text and metadata from documents, images and other formats.
The Apache Tika™ toolkit detects and extracts metadata and text from over a thousand different file types (such as PPT, XLS, and PDF).