Note that there are some explanatory texts on larger screens.

plurals
  1. POExtract text from PDF in code
    primarykey
    data
    text
    <p>I'm making an app for my school which people can check with if they've got a schedule change. All schedule changes are listed here: <a href="http://www.augustinianum.eu/roosterwijzigingen/14062012.pdf" rel="nofollow">http://www.augustinianum.eu/roosterwijzigingen/14062012.pdf</a>. I want to search that page for a keyword (the user's group, which is entered in an EditText). I've found out how to make the app check if the edittext matches a certain string, so now I only need to download all of the text on that page to a string. But the problem is that it's not a simple webpage, but a PDFpage. I've heard that you need a special pdf library or something to extract the text from the PDF and then put that text into a string and then search the string for keywords using <code>contains()</code>. However I've got some questions about that:</p> <ul> <li><p>This PDF is made with a PDF-creator, it's not a scanned page or so. You can actually for example select the text or search it for keywords using CTRL+F. So I wonder if it is actually required to extract the PDF and stuff or is there maybe an easier way.</p></li> <li><p>I want the app to check for changes every, let's say hour. So it also has to download the PDF and extract the text every hour (about 8 pages), would that consume very much juice?</p></li> <li><p>I've heard that there are many many libraries which do what I want. So which should I use? (If possible, I'd like one which is free :))</p></li> <li><p>Could anyone explain to me how to use it in my code? (I'm not really experienced, so plz keep it a little easy :))</p></li> </ul> <p>THANK YOU ALL SO MUCH!!!</p>
    singulars
    1. This table or related slice is empty.
    plurals
    1. This table or related slice is empty.
    1. This table or related slice is empty.
    1. This table or related slice is empty.
 

Querying!

 
Guidance

SQuiL has stopped working due to an internal error.

If you are curious you may find further information in the browser console, which is accessible through the devtools (F12).

Reload