Find the answer to your Linux question:
Results 1 to 5 of 5
Hi, I am looking for some source package which will convert html file to text file without using perl. I mainly need to do this on an ARM platform, so ...
Enjoy an ad free experience by logging in. Not a member yet? Register.
  1. #1
    Just Joined!
    Join Date
    Jul 2010
    Posts
    3

    HTML to plain text converter in C


    Hi,
    I am looking for some source package which will convert html file to text file without using perl.

    I mainly need to do this on an ARM platform, so if I get sources I can cross compile it. I need the source package in C.

    Thanks in advance,
    Cheers!!
    -Senthil

  2. #2
    Linux Guru Rubberman's Avatar
    Join Date
    Apr 2009
    Location
    I can be found either 40 miles west of Chicago, in Chicago, or in a galaxy far, far away.
    Posts
    11,509
    Why can't you use perl? It certainly runs on ARM-based systems. Anyway, I don't know of such a tool in C or C++, but I'd recommend checking on sourceforge.net since it hosts a lot of FOSS utilities of a similar nature.
    Sometimes, real fast is almost as good as real time.
    Just remember, Semper Gumbi - always be flexible!

  3. #3
    Just Joined!
    Join Date
    Jul 2010
    Posts
    3
    Hi,

    Thanks for the suggestion. But I have my other functionalities in 'C' and really looking for this functionality in 'C'. I did search on sourceforge.net earlier itself and couldn't find any source so looking for some suggestions from experts here.

    Cheers!!
    -Senthil

  4. #4
    Linux Guru Rubberman's Avatar
    Join Date
    Apr 2009
    Location
    I can be found either 40 miles west of Chicago, in Chicago, or in a galaxy far, far away.
    Posts
    11,509
    Well, this may seem to be a "simple" problem, but there are a lot of curves in the road. The question is, exactly what do you want to get out of a web page as text? Give some examples.
    Sometimes, real fast is almost as good as real time.
    Just remember, Semper Gumbi - always be flexible!

  5. #5
    Just Joined!
    Join Date
    Jul 2010
    Posts
    3
    Hi,

    Basically I looking to extract only the text contents in html file, strip off all the html tags and html related information. Just the text content of what it displays in html file.

    Cheers!!
    -Senthil

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts
  •