3 Replies - 1415 Views - Last Post: 28 March 2012 - 10:51 AM Rate Topic: -----

#1 sela007  Icon User is offline

  • D.I.C Addict

Reputation: 138
  • View blog
  • Posts: 838
  • Joined: 21-December 11

parsing text from PDF

Posted 28 March 2012 - 10:32 AM

Hi everybody! I want to parse text from PDF file. Is this possible only with .NET framework? I was read about some libraries like iTextSharp or PDFBox. What do you recommend? Please give me some direction or tip where to start.
Is This A Good Question/Topic? 1
  • +

Replies To: parsing text from PDF

#2 Psyguy  Icon User is offline

  • D.I.C Regular
  • member icon

Reputation: 69
  • View blog
  • Posts: 314
  • Joined: 12-January 11

Re: parsing text from PDF

Posted 28 March 2012 - 10:37 AM

Here is a possible solution. It's in C#, but that shouldn't slow you down too much.
Was This Post Helpful? 1
  • +
  • -

#3 sela007  Icon User is offline

  • D.I.C Addict

Reputation: 138
  • View blog
  • Posts: 838
  • Joined: 21-December 11

Re: parsing text from PDF

Posted 28 March 2012 - 10:49 AM

View PostPsyguy, on 28 March 2012 - 10:37 AM, said:

Here is a possible solution. It's in C#, but that shouldn't slow you down too much.

thank you for the reply. I already read it. I was stunned when I read this:
The size of the required assemblies adds up to almost 16 MB:

IKVM.GNU.Classpath.dll (7 MB)
IKVM.Runtime.dll (360 kB)
PDFBox-0.7.2.dll (8 MB)
Was This Post Helpful? 0
  • +
  • -

#4 Psyguy  Icon User is offline

  • D.I.C Regular
  • member icon

Reputation: 69
  • View blog
  • Posts: 314
  • Joined: 12-January 11

Re: parsing text from PDF

Posted 28 March 2012 - 10:51 AM

Ya, I was pretty impressed with the size of the files too. The author did note, however, that the process seems to complete fairly quickly. I guess if the program isn't going to be downloaded, but rather installed from disk, then it wouldn't be an issue.
Was This Post Helpful? 0
  • +
  • -

Page 1 of 1