Almost always images, fonts, or export settings — here’s what’s inflating the size and how to reduce it
A PDF that should be a few hundred kilobytes coming out at 50MB, or a simple document ballooning to a size that won’t email or upload, has specific causes that are almost always fixable.
PDF file size is driven by a handful of well-understood factors, and knowing which one applies to your file points directly to the right fix.
Images Are Almost Always the Primary Cause
For most oversized PDFs, images are responsible for the majority of the file size. Every photograph, screenshot, illustration, or graphic embedded in a PDF carries its full resolution and color data unless it’s been compressed or downsampled.
A single high-resolution photograph from a modern camera can be 10 to 25MB as a raw file. Embed several of these in a PDF without compression and the file size reflects the sum of all that image data — potentially hundreds of megabytes for an image-heavy document.
The resolution mismatch problem: Screen display requires 72 to 96 pixels per inch. Print quality requires 150 to 300 PPI. Many people embed images at 300 PPI or higher — the resolution their camera produced — even when the PDF is only ever going to be viewed on screen. The extra resolution is invisible to the viewer but massively inflates the file size.
The fix: Downsample images before embedding them or during PDF export. For screen-only PDFs, 96 to 150 PPI is more than sufficient. For print PDFs, 150 to 200 PPI is adequate for most purposes with 300 PPI reserved for professional commercial printing. The file size reduction from downsampling is dramatic — going from 300 PPI to 150 PPI reduces image data by 75 percent.
Export Settings Aren’t Optimized
How you export or create the PDF significantly affects the file size — the same document can produce files varying in size by a factor of ten depending on which export settings are used.
Most applications offer PDF export presets that balance quality and file size for different use cases:
Print quality / Press quality — highest quality settings, maximum image resolution, all fonts embedded, ICC color profiles included. Produces the largest files. Appropriate for commercial printing, inappropriate for email or web.
High quality print — high resolution with some compression. Still large but more reasonable for general printing purposes.
Standard / Web optimized — moderate compression, downsampled images, reduced color profile data. Appropriate for most document sharing.
Minimum size / Smallest file size — aggressive compression, low image resolution, reduced quality. Appropriate for previews, web upload, and situations where file size matters more than visual quality.
If you created your PDF using a print quality preset when a standard or web preset was appropriate, switching the preset and re-exporting reduces file size dramatically without any change to the content.
Uncompressed or Poorly Compressed Images
Beyond resolution, image compression type affects file size significantly. PDF supports multiple image compression formats internally:
JPEG compression for photographs — lossy compression that dramatically reduces file size with minimal visible quality loss at moderate compression levels.
LZW or ZIP compression for graphics with flat colors and sharp edges — lossless compression that preserves quality while reducing size for appropriate image types.
No compression — full uncompressed image data. This produces the largest possible files and is appropriate only in specific professional archival contexts.
When images are embedded without compression — or with only minimal compression — file sizes reflect the full uncompressed image data. Checking your application’s PDF export settings for image compression options and enabling JPEG compression for photographs at quality level 60 to 80 percent produces a dramatically smaller file with negligible visible quality difference at typical viewing sizes.
Fonts Are Fully Embedded Rather Than Subsetted
Fonts embedded in PDFs add to file size, and how they’re embedded matters. There are two approaches:
Full font embedding includes the complete font file — every character, every glyph, every weight variant that exists in the font family. For a font with extensive character sets, this can add several megabytes to a document even if only a few characters from that font are actually used.
Font subsetting embeds only the specific characters used in the document. If your document uses 47 unique characters from a font, only those 47 characters are embedded rather than the full font containing thousands of glyphs. Font subsetting typically reduces font-related file size by 90 percent or more compared to full embedding.
Most PDF export settings default to subsetting at a threshold — something like “subset fonts when less than 100% of characters are used” — which handles most cases automatically. If full embedding is selected in your export settings, switching to subsetting reduces font-related overhead significantly.
Embedded Color Profiles
ICC color profiles are data files that describe how colors in the document should be reproduced across different devices and color spaces. For professional printing workflows, embedded color profiles are important for color accuracy. For most everyday document sharing, they add file size without any visible benefit.
A single embedded ICC profile can add 500KB to 2MB to a PDF. A document with multiple profiles for different embedded images can add significantly more.
In PDF export settings, look for options related to color management, ICC profiles, or output intent. Disabling ICC profile embedding or selecting a minimal color profile option reduces file size for documents that don’t require professional color accuracy.
Transparency and Layer Effects
Transparency effects — drop shadows, glows, blending modes, and opacity settings — require additional data to render correctly in PDF. Documents created in applications like Adobe InDesign, Illustrator, or PowerPoint with heavy use of transparency effects produce larger PDFs because the transparency data has to be preserved or flattened into raster data.
Transparency flattening converts transparent areas into a combination of vector and raster content — the raster portions add image data to the PDF even in what might otherwise be a vector-heavy document. Complex transparency effects over large page areas produce significant additional raster data.
Reducing the use of transparency effects in the source document before export, or choosing a compatibility level in PDF export settings that handles transparency differently (PDF 1.4 and later support live transparency rather than requiring flattening), reduces this overhead.
Scanned Documents Without OCR Compression
Scanned documents are among the worst-case PDF size scenarios. When a physical document is scanned and saved as a PDF, each page becomes a large raster image — a photograph of the page rather than text data. A 20-page scanned document at standard scanner resolution can easily produce a 50MB to 100MB PDF.
OCR (Optical Character Recognition) processing can dramatically reduce scanned PDF sizes by converting the raster text into actual text data, which takes a fraction of the space. After OCR, only actual images need to remain as raster data while text is stored as efficient text characters.
Applications like Adobe Acrobat, ABBYY FineReader, and various online tools can perform OCR on scanned PDFs. Running OCR and then optimizing the PDF can reduce a 100MB scanned document to 5 to 10MB while making the text searchable and selectable as a bonus.
Repeated Image Instances Not Shared
In well-structured PDFs, the same image used multiple times in a document is stored once and referenced multiple times — this is called image resource sharing. In poorly constructed PDFs, the same image embedded multiple times is stored multiple times, multiplying its contribution to file size.
This is more common with PDFs created by certain software tools that don’t optimize resource sharing. A company logo appearing on every page of a 50-page document, stored as a separate image instance each time, adds the logo’s full file size 50 times over.
PDF optimization tools can detect and consolidate duplicate resources, storing them once and replacing duplicates with references — significantly reducing file size for documents with repeated elements.
Embedded Multimedia and Attachments
PDFs can contain embedded video, audio, 3D models, and file attachments — and these add their full file size to the PDF. If your PDF is unexpectedly large and you didn’t intentionally add multimedia, check whether the application that created it embedded anything unintentionally.
In Adobe Acrobat, go to View → Show/Hide → Navigation Panes → Attachments to see whether any files are attached. Remove any attachments you don’t need.
How to Reduce an Existing PDF’s Size
If you already have an oversized PDF and need to reduce it without going back to the source document:
Adobe Acrobat: Go to File → Compress PDF or File → Save As Other → Optimized PDF. The PDF Optimizer gives detailed control over every compression factor — images, fonts, transparency, color profiles. Use the Audit Space Usage button to see which elements are taking the most space before optimizing.
Preview on Mac: Open the PDF, go to File → Export as PDF, and under Quartz Filter select Reduce File Size. This applies basic compression and is quick but less controllable than dedicated tools.
Online tools: Smallpdf, iLovePDF, PDF Compressor, and similar web tools compress PDFs without requiring software installation. Effective for one-off compression needs, though avoid uploading sensitive documents to third-party services.
Ghostscript (free, command line): For technical users, Ghostscript provides powerful PDF compression with full control over settings. The basic command for screen-quality output:
gs -sDEVICE=pdfwrite -dCompatibilityLevel=1.4 -dPDFSETTINGS=/screen -dNOPAUSE -dQUIET -dBATCH -sOutputFile=output.pdf input.pdf
Replace /screen with /ebook, /printer, or /prepress for progressively higher quality and larger output.
A Quick Checklist
Match your situation to the likely cause:
- PDF contains photos or images — downsample images and apply JPEG compression in export settings
- Created with print quality export preset — re-export using standard or web optimized preset
- Scanned document — run OCR and then optimize the PDF
- Lots of transparency effects — reduce effects in source or adjust PDF compatibility level
- Many pages with repeated elements — run PDF optimization to consolidate duplicate resources
- Already have the large PDF — use Acrobat’s PDF Optimizer, Preview’s Reduce File Size, or an online compression tool
The Bottom Line
Oversized PDFs are almost always caused by high-resolution images without compression, print-quality export settings applied to documents that don’t need print quality, or scanned pages stored as uncompressed raster images.
Addressing whichever of these applies — downsampling images, changing the export preset, or running OCR on scanned content — produces dramatic file size reductions.
For existing PDFs that need to be smaller without regenerating from source, Adobe Acrobat’s PDF Optimizer and free online compression tools handle most cases quickly and effectively.
PDF file size reflects what’s inside it — find the largest contributor and compress it, and the file becomes manageable.
Meet Ry, “TechGuru,” a 36-year-old technology enthusiast with a deep passion for tech innovations. With extensive experience, he specializes in gaming hardware and software, and has expertise in gadgets, custom PCs, and audio.
Besides writing about tech and reviewing new products, he enjoys traveling, hiking, and photography. Committed to keeping up with the latest industry trends, he aims to guide readers in making informed tech decisions.