Anthropic PDF Skill
Full-stack PDF — read, write, merge, split, watermark, encrypt, fill forms, OCR. The Swiss army knife for anything .pdf-shaped.
Source: github.com
Install
npx degit anthropics/skills/skills/pdf ~/.claude/skills/pdf The pdf skill covers every PDF operation you’d otherwise script with
half a dozen Python libraries. It picks the right tool — pypdf, PyMuPDF,
or shell — based on what you’re trying to do.
What it can do
- Read & extract — text, tables, metadata, images
- Combine & split — merge multiple PDFs, or split one into per-page files
- Annotate — watermarks, page numbers, headers
- Forms — fill PDF forms programmatically (see FORMS.md)
- Security — encrypt, decrypt, password protect
- OCR — make scanned PDFs searchable
- Generate — create new PDFs from scratch with layout control
What’s surprising
- It defaults to
pypdffor fast operations and only escalates toPyMuPDFwhen you need rendering or precise positioning - Forms support is real — including filling, flattening, and verifying
- OCR can be applied selectively (only scanned pages)
Composes with
pptx— convert decks to handout PDFsdocx— convert reports to PDF for distributionbrand-guidelines— apply Anthropic styling to the final PDF
Notes from curation
If your task involves the string “.pdf” anywhere, start here. Most users underestimate how much of “knowledge work” is actually PDF wrangling — this skill is the one that makes that part disappear.
Example prompt
Use the PDF skill to merge these three Q4 reports into one,
add a "CONFIDENTIAL" watermark on every page, and OCR the scanned
appendix so it becomes searchable.
Related skills
Anthropic DOCX Skill
Create, read, edit, and analyze Word documents — table of contents, headings, page numbers, letterheads, tracked changes, comments, image insertion. The full .docx toolkit.
Anthropic PPTX Skill
Generate, read, and edit .pptx decks — speaker notes, layouts, and tables included.
Anthropic XLSX Skill
Open, read, edit, and create spreadsheets — .xlsx, .xlsm, .csv, .tsv. Formulas, formatting, charts, and cleaning messy tabular data.