About wget behaviour: will it do *just* what i want? (folder mirroring)
by dedec0 from LinuxQuestions.org on (#5D396)
Hello, all,
i want to mirror a folder in a site, and i built the following command line to do it with wget:
Code:$ wget --mirror --convert-links --html-extension \
--wait=`echo "scale=0;$RANDOM%31" |bc` \
-o log http://www.domain.com.br/content/The folder content is a normal folder which will have one link for each file inside it, one link for each subfolder that leads to other similar folders with more file and (possibly) more folder, all of these without any index.* files to do anything different. But the link i give to wget has a "parent folder" which is the site root, which will have things i do not want to mirror. What will wget do?
I am executing it with verbose on (which is its default behaviour, according to its manual), but it does not tell what it is downloading. I just see that several files i want are already here.
What do you recommend?


i want to mirror a folder in a site, and i built the following command line to do it with wget:
Code:$ wget --mirror --convert-links --html-extension \
--wait=`echo "scale=0;$RANDOM%31" |bc` \
-o log http://www.domain.com.br/content/The folder content is a normal folder which will have one link for each file inside it, one link for each subfolder that leads to other similar folders with more file and (possibly) more folder, all of these without any index.* files to do anything different. But the link i give to wget has a "parent folder" which is the site root, which will have things i do not want to mirror. What will wget do?
I am executing it with verbose on (which is its default behaviour, according to its manual), but it does not tell what it is downloading. I just see that several files i want are already here.
What do you recommend?