Note that there are some explanatory texts on larger screens.

plurals
  1. PODocument processing in Liferay portal
    text
    copied!<p>I've been using Liferay a lot for past 2 years, but I have never needed any extensive document management.</p> <p>Now I have a portlet where users upload documents (MS office OLE2 documents, ODS documents, PDF etc.) and I have to persist them with all metadata available. </p> <p>I know how would I do that without using Liferay, I'd probably use Apache solr with Apache Tika (<a href="http://wiki.apache.org/solr/UpdateRichDocuments" rel="nofollow">UpdateRichDocuments</a> and <a href="http://wiki.apache.org/solr/ExtractingRequestHandler" rel="nofollow">ExtractingRequestHandler</a>) or Apache Jackrabbit that are using Apache Tika under the hood (org.apache.jackrabbit.extractor.*).</p> <p>The problem is, that If I look at the trunk of Liferay, there are some key classes :</p> <p><code>Hooks (JCRHook, FileSystemHook, CMISHook, s3Hook)</code> that are employed from within <code>DLLocalServiceImpl</code> kinda directly</p> <p>Another alternative is using <code>DLAppLocalServiceImpl</code> that is employing <code>DLRepositoryLocalServiceImpl</code> and the files are persisted into repository also via Hooks, but a lot of additional stuff is done in there.</p> <ol> <li><p>There is not jackrabbit-text-extractors library in Liferay, so I suppose If I wanted metadata to be extracted from PDF, DOCs, ODS documents, I would have very hard times... because the DL service layer doesn't accept additional properties</p> <ol> <li>I think I'd have to avoid using DL services and JCR hook and access Jackrabbit directly... But I would loose the compatibility and possibility migrate my repository etc.</li> </ol></li> </ol> <p>Could please anybody collaborate on this one please ? Thank you</p>
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload