First page Back Continue Last page Graphics
Webpage Scraping
- On an earlier slide a webpage placed on the PIC was described with Macros that would substitute collected data into any web response:
- AdcValues=["%n00","%n01","%n02","%n03","%n04","%n05","%n06","%n07","%n08","%n09","%n0A","%n0B"];
- So, returned http fetch response might look like this:
- AdcValues=[“1329",“0",“332“, etc];
- ScanEngine Explorer http scraping script
- page [jsdata.cgi]
- port [80]
- find [AdcValues=]
- name [AdcValues0]
- find ["]
- text_upto ["]
- name [AdcValues1]
- find [,"]
- text_upto ["]
- more of the same
- Provide a name for the next scraped text
- Advance to the next quote,
- and scrape text to the next quote
- Provide a name for the next scraped text
- Advance to the next comma/quote,
- and scrape text to the next quote
- In ScanEngine Explorer a web page scraping script can be created and associated with any fetched page, and further assign each scraped text (values) with names we define here … in this case, AdcValues0, AdcValues1, etc