Sample Document Files for Testing
Download free document test files for developers. Test OCR text extraction, layout rendering, macro security, and metadata parsing across all major business formats (PDF, DOCX, XLSX).
|
STANDARD
Portable DocThe universal standard. Test text selection, forms, and embedded fonts. |
Word OFFICE
DOCX / DOCMStandard word processing. Test XML structure and macro-enabled files. |
Excel DATA
XLSX / XLSMSpreadsheet files. Test cell formatting, formulas, and large datasets. |
|
PPTX SLIDES
PresentationSlide decks. Test master layouts, animations, and image embedding. |
ODF OPEN
ODT / ODSLibreOffice standard. Test cross-platform document interoperability. |
EPUB BOOKS
E-BooksReflowable text format. Essential for testing e-reader applications. |
|
TXT RAW
Plain TextNo formatting. Best for testing character encoding (UTF-8) and parsing. |
RTF LEGACY
Rich TextCross-platform rich text. Good for testing basic styling and compatibility. |
Need a custom format? |
Providing high-quality document resources from plain text to macro-enabled Office suites.
Technical Document Samples for QA
Building a Document Management System (DMS), an OCR engine, or a file converter requires a robust set of dummy documents. Our library covers edge cases that often break parsers, such as corrupted XML, large file sizes (100MB+), and complex formatting.
- VBA Macro Security: Use our DOCM and XLSM samples to test your antivirus integration and macro blocking policies.
- Encoding Stress Test: Our TXT and CSV files include UTF-8, UTF-16, and ANSI encoded characters to verify your parser’s internationalization support.
- Structure Parsing: Test your software’s ability to read headers, footers, tables, and embedded images in Word and PDF documents.
Office Open XML vs. Legacy Formats
| Type | Extension | Architecture | Best For |
|---|---|---|---|
| Modern | .DOCX, .XLSX | XML-based (Zip container) | Standard development, smaller file sizes. |
| Macro | .DOCM, .XLSM | XML + VBA Code | Testing security and automation features. |
| Legacy | .DOC, .XLS | Binary OLE CF | Backward compatibility testing (Word 97-2003). |
| Open | .ODT, .ODS | OpenDocument XML | Linux/LibreOffice compatibility. |
Document Test FAQ
Are the PDF files OCR-readable?
Yes. We provide both “Searchable PDF” (text layer present) and “Scanned PDF” (image only) to help you test Optical Character Recognition (OCR) accuracy.
Do the files contain PII (Personal Data)?
No. All names, addresses, and financial data in our dummy documents are generated using Lorem Ipsum or randomized data, ensuring GDPR/CCPA compliance for your test environment.
