| pdf_info {pdftools} | R Documentation |
Utilities based on libpoppler for extracting text, fonts, attachments and metadata from a pdf file.
pdf_info(pdf, opw = "", upw = "") pdf_text(pdf, opw = "", upw = "") pdf_fonts(pdf, opw = "", upw = "") pdf_attachments(pdf, opw = "", upw = "") pdf_toc(pdf, opw = "", upw = "")
pdf |
file path or raw vector with pdf data |
opw |
string with owner password to open pdf |
upw |
string with user password to open pdf |
Poppler is pretty verbose when encountering minor errors in PDF files,
in especially pdf_text. These messages are usually safe
to ignore, use suppressMessages to hide them alltogether.
Other pdftools: pdf_render_page
# Just a random pdf file
pdf_file <- file.path(R.home("doc"), "NEWS.pdf")
info <- pdf_info(pdf_file)
text <- pdf_text(pdf_file)
fonts <- pdf_fonts(pdf_file)
files <- pdf_attachments(pdf_file)