Hi everybody! I want to parse text from PDF file. Is this possible only with .NET framework? I was read about some libraries like iTextSharp or PDFBox. What do you recommend? Please give me some direction or tip where to start.
parsing text from PDF
Page 1 of 13 Replies - 504 Views - Last Post: 28 March 2012 - 10:51 AM
Replies To: parsing text from PDF
#3
Re: parsing text from PDF
Posted 28 March 2012 - 10:49 AM
Psyguy, on 28 March 2012 - 10:37 AM, said:
Here is a possible solution. It's in C#, but that shouldn't slow you down too much.
thank you for the reply. I already read it. I was stunned when I read this:
The size of the required assemblies adds up to almost 16 MB:
IKVM.GNU.Classpath.dll (7 MB)
IKVM.Runtime.dll (360 kB)
PDFBox-0.7.2.dll (8 MB)
#4
Re: parsing text from PDF
Posted 28 March 2012 - 10:51 AM
Ya, I was pretty impressed with the size of the files too. The author did note, however, that the process seems to complete fairly quickly. I guess if the program isn't going to be downloaded, but rather installed from disk, then it wouldn't be an issue.
Page 1 of 1
|
|

New Topic/Question
Reply



MultiQuote




|