PDF Processing Toolkit

A comprehensive collection of PDF processing blocks for the OOMOL platform, providing powerful tools for document manipulation, conversion, and enhancement.

🚀 Overview

This toolkit transforms your OOMOL workspace into a complete PDF processing powerhouse. With 12 specialized blocks, you can handle everything from basic operations like merging and splitting to advanced features like watermarking, encryption, and text extraction.

📦 Available Blocks

🔧 Core PDF Operations

PDF Watermark (`pdf_watermark`)

Add professional watermarks to your PDFs with full customization options.

Text/Image Watermarks: Support both text and image overlays
Position Control: Precise placement with X/Y coordinates
Styling Options: Opacity, rotation, size, color, and font customization
Batch Processing: Apply consistent watermarks across multiple documents

PDF Compression (`pdf_compress`)

Reduce PDF file sizes while maintaining quality for efficient storage and sharing.

Multiple Levels: Low, medium, high, and maximum compression
Smart Optimization: Image quality adjustment and duplicate removal
Size Analytics: Before/after file size comparison with compression ratios
Quality Control: Maintain document readability while reducing file size

PDF Merge (`pdf_merge`)

Combine multiple PDF documents into a single professional document.

Batch Merging: Handle multiple input files simultaneously
Bookmark Preservation: Maintain navigation structure from source documents
Page Numbering: Optionally add sequential page numbers to merged content
Metadata Handling: Preserve document properties and structure

PDF Split (`pdf_split`)

Divide large PDFs into smaller, manageable files with flexible splitting options.

Multiple Split Modes: Single pages, page ranges, bookmarks, or equal parts
Custom Ranges: Specify exact pages like "1-3,5-7,10"
Bookmark-Based: Automatically split at bookmark boundaries
Batch Output: Generate multiple files with organized naming conventions

PDF to Images (`pdf2images`)

Convert PDF pages to high-quality image files.

Multiple Formats: PNG, JPEG, and other image formats
DPI Control: Adjust resolution for different use cases
Page Selection: Convert specific pages or entire documents
Batch Processing: Handle multiple PDFs simultaneously

Images to PDF (`images2pdf`)

Combine multiple images into a single PDF document.

Multi-Format Support: JPEG, PNG, and other image formats
Layout Control: Configure page size and orientation
Batch Conversion: Process entire image collections
Quality Preservation: Maintain image quality during conversion

PDF Metadata Extraction (`pdf2meta`)

Extract document metadata and properties.

Comprehensive Info: Title, author, subject, creator, and more
Custom Properties: Access custom metadata fields
Batch Processing: Extract metadata from multiple documents
JSON Output: Structured metadata export

🔒 Security & Access Control

PDF Encryption (`pdf_encrypt`)

Protect sensitive documents with advanced password and permission controls.

Dual Password System: Separate user and owner passwords
Permission Management: Control printing, copying, and modification rights
128-bit Encryption: Industry-standard security for document protection
Batch Security: Apply consistent security policies across multiple files

PDF Decryption (`pdf_decrypt`)

Remove password protection from authorized documents.

Password Recovery: Unlock documents with valid credentials
Batch Processing: Decrypt multiple protected files simultaneously
Status Reporting: Verify encryption status and successful decryption
Secure Handling: Safe processing of sensitive document credentials

🎨 Content Manipulation

PDF Rotation (`pdf_rotate`)

Correct document orientation with precise page rotation controls.

Standard Angles: 90°, 180°, and 270° rotation options
Selective Pages: Rotate specific pages or page ranges
Batch Rotation: Apply consistent rotation to multiple documents
Preview Support: Visual confirmation before processing

PDF Text Extraction (`pdf_extract_text`)

Extract and export text content in multiple formats for further processing.

Multiple Formats: Plain text, JSON, or CSV output options
Layout Preservation: Maintain original document formatting
Selective Extraction: Process specific pages or page ranges
Batch Export: Extract text from multiple documents simultaneously

PDF Page Deletion (`pdf_delete_pages`)

Fine-tune document structure by removing unwanted pages.

Page Deletion: Remove unwanted pages with range specification
Batch Operations: Process multiple documents with consistent rules
Structure Preservation: Maintain document integrity during modifications

PDF Annotation (`pdf_annotate`)

Enhance documents with professional annotations and markup tools.

Multiple Types: Text, highlights, notes, and custom stamps
Precise Positioning: Coordinate-based placement system
Color Customization: Full color palette for visual organization
Collaborative Features: Professional markup for document review

🛠️ Technical Specifications

Dependencies

PyPDF2: Core PDF manipulation engine
Pillow (PIL): Image processing for watermarks and conversions
Reportlab: Advanced PDF generation and overlay capabilities
PDFplumber: Enhanced text extraction with layout preservation
pdf2image: PDF to image conversion (requires poppler-utils)

Performance Features

Memory Efficient: Optimized for large file processing
Batch Operations: Handle multiple documents simultaneously
Error Handling: Comprehensive error reporting and recovery
Progress Tracking: Real-time processing status and completion metrics

Integration Benefits

OOMOL Native: Seamless integration with OOMOL workflow system
Modular Design: Mix and match blocks for custom processing pipelines
Standard Interfaces: Consistent input/output formats across all blocks
Visual UI: Intuitive configuration with visual file selectors and controls

🎯 Use Cases

Business Document Management

Invoice Processing: Extract text, add watermarks, and secure sensitive financial documents
Report Generation: Merge departmental reports, add page numbers, and apply corporate branding
Contract Management: Fill forms, add annotations, and implement security controls

Educational Content

Lecture Materials: Convert presentation slides, extract reading materials, and organize content
Research Papers: Merge citations, extract references, and apply academic formatting
Student Submissions: Process assignments, add feedback annotations, and manage document collections

Digital Publishing

Content Creation: Merge documents, add professional layouts, and optimize for distribution
Archive Management: Organize with metadata extraction and secure sensitive materials
Portfolio Development: Merge creative works, add watermarks, and prepare professional presentations

Legal and Compliance

Document Redaction: Remove pages while maintaining document integrity
Evidence Management: Secure documents with encryption, add annotations for case notes
Regulatory Submission: Merge compliance documents and ensure proper formatting

🚦 Getting Started

Install Dependencies: Run poetry install --no-root to install all required Python packages
Import Blocks: Add desired PDF processing blocks to your OOMOL workflow
Configure Settings: Customize block parameters for your specific use case
Process Documents: Execute workflows and monitor processing results
Export Results: Download processed documents or continue with additional processing steps

📈 Advanced Workflows

Combine multiple blocks to create powerful document processing pipelines:

Document Conversion: Images to PDF → Merging → Watermarking
Security Pipeline: Encryption → Watermarking → Metadata Extraction
Content Extraction: PDF to Images → Text Extraction → Data Processing
Archive Processing: Splitting → Individual Processing → Secure Storage

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Third-Party Licenses

This project uses the following open-source libraries:

PyPDF2 (BSD-3-Clause License) - PDF manipulation
Pillow (MIT-CMU License) - Image processing
Reportlab (BSD License) - PDF generation
PDFplumber (MIT License) - Text extraction
pdf2image (MIT License) - PDF to image conversion

All dependencies are used in compliance with their respective licenses.

🤝 Contributing

Contributions are welcome! This is an open-source project and we encourage:

Bug reports and feature requests via GitHub Issues
Pull requests for improvements and new features
Documentation enhancements
Use cases and workflow examples

📧 Contact

Author: TaoZeyu Email: i@taozeyu.com

Transform your document processing capabilities with this comprehensive PDF toolkit designed for professional workflows and seamless OOMOL platform integration.

Name		Name	Last commit message	Last commit date
Latest commit History 32 Commits
.claude		.claude
.github/workflows		.github/workflows
flows/test		flows/test
oo-locales		oo-locales
tasks		tasks
.gitignore		.gitignore
.pylintrc		.pylintrc
LICENSE		LICENSE
README.md		README.md
README_zh-CN.md		README_zh-CN.md
create_test_pdf.py		create_test_pdf.py
icon.png		icon.png
package.oo.yaml		package.oo.yaml
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

PDF Processing Toolkit

🚀 Overview

📦 Available Blocks

🔧 Core PDF Operations

PDF Watermark (pdf_watermark)

PDF Compression (pdf_compress)

PDF Merge (pdf_merge)

PDF Split (pdf_split)

PDF to Images (pdf2images)

Images to PDF (images2pdf)

PDF Metadata Extraction (pdf2meta)

🔒 Security & Access Control

PDF Encryption (pdf_encrypt)

PDF Decryption (pdf_decrypt)

🎨 Content Manipulation

PDF Rotation (pdf_rotate)

PDF Text Extraction (pdf_extract_text)

PDF Page Deletion (pdf_delete_pages)

PDF Annotation (pdf_annotate)

🛠️ Technical Specifications

Dependencies

Performance Features

Integration Benefits

🎯 Use Cases

Business Document Management

Educational Content

Digital Publishing

Legal and Compliance

🚦 Getting Started

📈 Advanced Workflows

📄 License

Third-Party Licenses

🤝 Contributing

📧 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

PDF Watermark (`pdf_watermark`)

PDF Compression (`pdf_compress`)

PDF Merge (`pdf_merge`)

PDF Split (`pdf_split`)

PDF to Images (`pdf2images`)

Images to PDF (`images2pdf`)

PDF Metadata Extraction (`pdf2meta`)

PDF Encryption (`pdf_encrypt`)

PDF Decryption (`pdf_decrypt`)

PDF Rotation (`pdf_rotate`)

PDF Text Extraction (`pdf_extract_text`)

PDF Page Deletion (`pdf_delete_pages`)

PDF Annotation (`pdf_annotate`)

Packages