Read in HTML file, now need ot get rid of the HTML part
807607Dec 31 2006 — edited Jan 1 2007I'm able to connect to a website and read in the HTML fine, I now need to be able to get rip of the HTML coding, which will leave me with only the text.
So if I was to read in this website:
http://news.bbc.co.uk/1/hi/world/africa/6220797.stm
Then all I want is a string that holds the story text.
How is the best way of approaching this? Are there any in built classes that will help me get rid of the HTML coding.
Thanks