Skip to Main Content

Java Programming

Announcement

For appeals, questions and feedback about Oracle Forums, please email oracle-forums-moderators_us@oracle.com. Technical questions should be asked in the appropriate category. Thank you!

Interested in getting your voice heard by members of the Developer Marketing team at Oracle? Check out this post for AppDev or this post for AI focus group information.

Read in HTML file, now need ot get rid of the HTML part

807607Dec 31 2006 — edited Jan 1 2007
I'm able to connect to a website and read in the HTML fine, I now need to be able to get rip of the HTML coding, which will leave me with only the text.

So if I was to read in this website:

http://news.bbc.co.uk/1/hi/world/africa/6220797.stm

Then all I want is a string that holds the story text.

How is the best way of approaching this? Are there any in built classes that will help me get rid of the HTML coding.

Thanks
Comments
Locked Post
New comments cannot be posted to this locked post.
Post Details
Locked on Jan 29 2007
Added on Dec 31 2006
6 comments
161 views