๐ŸŽ‰ Save 10% Extra on the Webequipe PDF Search Plugin Annual Plan โ€” Use code YEARLY10 ยท Limited-time offer ยท Get discount โ†’

How to Make Scanned PDFs Searchable on WordPress โ€” v2.0.0

Two types of PDFs exist on most WordPress sites โ€” only one is searchable by default. Most site owners don’t know which of their PDFs are scanned. This guide explains the difference, how to check, and how to make scanned documents searchable.

The Difference Between Text PDFs and Scanned PDFs

  • Text-based PDF: created digitally in Word, Google Docs etc. Contains actual text layer. Any search plugin can read it.
  • Scanned PDF: a photograph of a physical page. No text layer. Just pixels.
  • How to tell: open PDF in browser, try to select text. If you can highlight words โ†’ text-based. If selection draws a rectangle โ†’ scanned.

Why Scanned PDFs Are Invisible to WordPress Search

  • WordPress search queries post_content field
  • PDF plugins extract text from PDFs and store it
  • Extraction works by reading the text layer
  • Scanned PDFs have no text layer โ€” extraction returns empty
  • No error shown โ€” just silently returns nothing

Step 1 โ€” Set Up the Free Plugin (Text PDFs)

  • Install WebEquipe PDF Search from WordPress.org
  • Activate, go to Settings โ†’ PDF Search โ†’ Re-index All PDFs
  • Test: search a term inside a text-based PDF
  • Stop here if this covers your needs

Step 2 โ€” Enable OCR for Scanned PDFs (Pro)

  • Upgrade to Starter or higher
  • Settings โ†’ PDF Search โ†’ OCR โ†’ enable
  • Enter Google Vision API key
  • Click Bulk OCR Scan
  • Test: search a term from a scanned document

What to Expect After OCR

  • Previously invisible scanned PDFs appear in search
  • Text excerpts show matched content
  • New scanned PDFs OCR’d automatically on upload

Common Questions

  • Does OCR work on password-protected PDFs? Yes on paid plans
  • What languages? Google Vision supports 50+ automatically
  • What happens when OCR limit hit? Queue until next cycle

Conclusion

If you have scanned documents on your site, the free plugin won’t index them. OCR is the only way โ€” PDF Search Pro has it built in.

Search

Related Articles

How to Keep Specific PDFs Out of WordPress Search Results

How to Keep Specific PDFs Out of WordPress Search Results

How to Make Scanned PDFs Searchable in WordPress (Using OCR)

How to Make Scanned PDFs Searchable in WordPress (Using OCR)

FEATURED TOOLS

PDF Search _Web image

PDF Search Pro

Full-text indexing for scanned and digital PDFs in WordPress.

hero-image-plugin (1)

Spin & Win Wheel

Gamified lead capture with customizable spin-to-win rewards.

CATEGORIES

RELATED ARTICLES