Web scraping

I wish to create something (preferably using lotus script) that will acess webpages and pull the data from them before moving onto another page and doing the same.

First of all, is it possible to do this and if it is, what is the best way of doing it?

Thanks

Subject: Try something based on this

Dim ie As Variant	Dim sHTML As String

Dim objColl As Variant



Set ie = CreateObject("InternetExplorer.Application")

ie.Navigate "http://www.yoursitehere.com/"

While ie.Busy

	Sleep 5

Wend



Set objColl = ie.document.all

Forall x In objColl

	Print x.innerHTML

End Forall

This assumes the web page is just tag soup. If it is XHTML then you could load it into an XML parser and extract data via XML methods instead.

Subject: Simple with Java, not so much with LS

You can quite easy pull down a web page using HttpURLConnection.

Its also rather simple to parse the HTML using javax.swing.text.html. These classes are all included in the JVM.

If you have to use LS, I imagine you would have to use some sort of VB object from the OS, and would of course be limiting yourself to Windows (which might be acceptable in your environment)

Subject: scrapers

And you got copyright permission to do all this exactly how??

Subject: copyright

To get the nutritional content from certain foods - do I really need to get copyright when I could just manually enter these values into a database after looking on the back of a packet?