wget: "following links" vs. "retrieving directory tree"
(Sorry for the xxx below, newbs are not allowed to post certain banned strings.)
I want to mirror a website using wget -r, but even after reading the wget manual, I'm still a little unclear on how it works. With a command like:
Code:
wget -r --level=inf hxxp://xxx.foo.com/bar
will wget automatically follow all html links and recreate the entire directory tree under the bar/ directory? What if I don't want to follow any html links but still want to recreate the entire directory tree under the bar/ directory? What if I want the opposite: I don't need the directory tree but just want to follow all links in bar/index.html, up to say, N links away from index.html? I don't see how these operations could be distinguished using wget's recursive retrieval capabilities. Thanks for any input!