Quantcast
Channel: Adobe Community : Popular Discussions - CQ5 (read only)
Viewing all articles
Browse latest Browse all 12476

Apache Tika PDF Parser not working in CQ 5.6

$
0
0

Hi all,

 

I was using following code to extract text from PDF in a CQ package.

ContentHandler handler = new BodyContentHandler();

Parser parser = new PDFParser();

parser.parse(a.getOriginal().getStream(), handler, new Metadata(),

new ParseContext());

String text = handler.toString();

 

This works perfectly in CQ 5.5 but in CQ 5.6 I get following exception:

Caused by: java.lang.ClassNotFoundException: org.apache.pfbox.io.RandomAccess not found by org.apache.tika.parsers [58]

          at org.apache.felix.framework.BundleWiringImpl.findClassOrResourceByDelegation(BundleWiringI mpl.java:1499)

          at org.apache.felix.framework.BundleWiringImpl.access$400(BundleWiringImpl.java:75)

          at org.apache.felix.framework.BundleWiringImpl$BundleClassLoader.loadClass(BundleWiringImpl. java:1882)

          at java.lang.ClassLoader.loadClass(Unknown Source)

 

Any ideas on why this class is not being found ?

 

Thanks!


Viewing all articles
Browse latest Browse all 12476

Latest Images

Trending Articles



Latest Images

<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>