A robust, strictly-typed Node.js and Browser library for parsing office files (docx, pptx, xlsx, odt, odp, ods, pdf, rtf). It produces a clean, hierarchical Abstract Syntax Tree (AST) with rich metadata, text formatting, and full attachment support.
Rich Text Format
298
40
MIT License
OfficeParser is a Node.js and browser library for parsing office documents (docx, pptx, xlsx, odt, odp, ods, pdf, rtf) into a structured Abstract Syntax Tree (AST). It's designed for developers who need to extract and analyze document content programmatically, offering rich metadata, formatting support, and optional OCR for images.
Total donated
Undistributed
Share with your subscribers:
Recipients
How the donated funds are distributed
Support the dependencies of harshankur/officeparser