This Q&A from the New York Times describes ways to siphon the text from a PDF file.