|
WebStripperTM
FAQ - Why can't WebStripper download this page?
Do you have the latest version?
We're striving to improve WebStripper all the
time and whenever a problem comes to light we'll do our best to solve
it. If you're having problems we recommend you to download the latest
version first - if somebody else has reported the problem since
your version was released it may well have been fixed already.
Have you got the right settings?
The default settings for a site make WebStripper
look in the folder you specify and any subfolders. It wont look
for html files in different parts of the folder tree unless you
tell it. This means you can strip just a small section of a large
site with ease, but some sites don't use such a sensible structure
and store index files in a subdirectory. With these sites you either
have to uncheck the option 'Sub-pages only' (Range page of Add Site
or Site Properties windows) or find a start page which is in a better
location.
TIP: Hold the mouse cursor over some links and in the status bar
of your browser see where the links point. Check the links are in
the same folder (or a subfolder of) the page you are looking at.
Another thing to check if graphics are missing from the page is whether the
graphics are stored on another server. If they are you'll need to check 'Fetch graphics from other servers',
again on the Range page.
What webstripper can and can't download
WebStripper does a very good job of downloading
standard html web pages but there are some special types of html
which it can't download and some types of links it can't follow.
Support for all these types of links will
be added to future versions of WebStripper. Please be patient. If
you want to get the latest information sign up to the wbs-announce
mailing list.
Some password protected sites
WebStripper can handle username/password
protected sites which use the http standard - the ones where you
get a pop up box from your browser. It can't handle sites where
you have to log in via a form.
Secure pages
WebStripper can't download secure HTML pages (ie those starting with https://).
Links
Some javascript. WebStripper tries to do it's best
to extract links from javascript, but it's hard for it to know what
is a link and what is an ordinary string.
Java. WebStripper is unable to parse Java files.
Links in flash animations. WebStripper can download the animations
but is anable to get the links within them. If the animation is only on the entry page to the site
you can usually get around this by starting the download inside the site.
Cookies
Some sites store data on your machine in
'cookies'. These cookies are then retrieved by other pages on the
site. WebStripper V2.68 and later contain cookie support.
^
Most material on the internet is copyrighted.
If you intend to use downloaded material for
anything other than personal use you must obtain the copyright holders permission first.
Solent Software is opposed to copyright theft.
|