You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,
I have a problem reading a pdf file. The document is not corrupted (e.g. pdftools::pdf_text() works fine, but I prefer tabulizer as I have some pages with double columns), and similar pdfs from the same repository does not produce the error. Thanks for developing the package!
Enrico
Please specify whether your issue is about:
a possible bug
a question about package functionality
a suggested code or documentation change, improvement to the code, or feature request
If you are reporting (1) a bug or (2) a question about code, please supply:
ensure that you can install and successfully load rJava
if an error is occurring, include the output of traceback() run immediately after the error occurs
the output of sessionInfo()
Put your code here:
## rJava loads successfully# install.packages("rJava")
library("rJava")
## load package
library("tabulizer")
## code goes herex<-extract_text("https://sitereports.nabunken.go.jp//files/attach/21/21232/15892_1_立石遺跡+大鶴A遺跡+上揚遺跡+前畑遺跡.pdf")
traceback()
## session info for your system
sessionInfo()
The full error message is:
Error in .jcall("RJavaTools", "Ljava/lang/Object;", "invokeMethod", cl, :
java.lang.ClassCastException: class org.apache.pdfbox.cos.COSObject cannot be cast to class org.apache.pdfbox.cos.COSNumber (org.apache.pdfbox.cos.COSObject and org.apache.pdfbox.cos.COSNumber are in unnamed module of loader RJavaClassLoader @372f7a8d)
Hi,
I have a problem reading a pdf file. The document is not corrupted (e.g.
pdftools::pdf_text()
works fine, but I prefer tabulizer as I have some pages with double columns), and similar pdfs from the same repository does not produce the error. Thanks for developing the package!Enrico
Please specify whether your issue is about:
If you are reporting (1) a bug or (2) a question about code, please supply:
traceback()
run immediately after the error occurssessionInfo()
Put your code here:
The full error message is:
Traceback
SessionInfo:
The text was updated successfully, but these errors were encountered: