Python Forum

Full Version: I wan't to Download all .zip Files From A Website (Project AI)
You're currently viewing a stripped down version of our content. View the full version with proper formatting.
Pages: 1 2 3 4 5 6 7
I tried this code in Wget before, I have noticed that all the Project AI Zip Files, on the www.flightsim.com Website, begin with pai.

And I found out you can download only .zip Files beginning with pai after an example given on the WWW :-

wget -e robots=off -r --accept='pai*.zip' https://www.flightsim.com

But unfortunately, only scraped for a few seconds, some sort of block maybe ? I am not sure, I got the following info shown in Command Prompt :-

C:\wget>wget -e robots=off -r --accept='pai*.zip' https://www.flightsim.com
--2018-08-27 17:36:57-- https://www.flightsim.com/
Resolving www.flightsim.com (www.flightsim.com)... 104.28.1.19, 104.28.0.19
Connecting to www.flightsim.com (www.flightsim.com)|104.28.1.19|:443... connecte
d.
HTTP request sent, awaiting response... 303 See Other
Location: https://www.flightsim.com/vbfs/content.p...Simulation
-Downloads-And-More [following]
--2018-08-27 17:36:57-- https://www.flightsim.com/vbfs/content.php?1-Home-PC-Fl
ight-Simulation-Downloads-And-More
Reusing existing connection to www.flightsim.com:443.
HTTP request sent, awaiting response... 200 OK
Length: unspecified [text/html]
Saving to: 'www.flightsim.com/index.html.tmp'

www.flightsim.com/i [ <=> ] 106.19K 487KB/s in 0.2s

2018-08-27 17:36:59 (487 KB/s) - 'www.flightsim.com/index.html.tmp' saved [10874
2]

Removing www.flightsim.com/index.html.tmp since it should be rejected.

FINISHED --2018-08-27 17:36:59--
Total wall clock time: 2.2s
Downloaded: 1 files, 106K in 0.2s (487 KB/s)

C:\wget>
Quote:And I found out you can download only .zip Files beginning with pai after an example given on the WWW
say what? that's just bogus information.

You are all over the place with this. You don't seem to be able to stick with one idea until it works. Sorry, but I'm not willing to come along for the ride.

If you want to use wget, read the docs, that's what they are there for: https://www.gnu.org/software/wget/manual/wget.html
Many apologies Larz60+, I am not trying to mess you around, I was just getting frustrated at the fact, I kept hitting problems with my Python Code, your help has been much appreciated. I have scrubbed using the Wget Web-Scraping Method, partially as I know it is heavily frowned upon and also as I am failing miserably know matter what i try.

Snippsat as you have been excellent in helping me out before, would you be willing to help me with my Python Code ? I am embarrassed at how inept I am at Python, although I do get alot of the coding methods to some extent. Could you go back through the previous pages of this thread, and give me some pointers, as to where I am going wrong ? I would appreciate it if you could.

Regards

Eddie
I have a feeling, that my code is longer, than it needs to be. I wish I was as good at Python, as others on this forum.
OK,
I am going to use the code that Dead-eye and snippsat provided, and start from there.
I don't have a password for this site, so I will register, otherwise I can't actually run the complete code.
Can get through this pretty quickly.
I'll be Back soon
You are very kind, to help me Larz60. Many thanks.
can you please (using a browser) find a page where you can manually download the first zip file.
Make sure you can actually retrieve the zip file, then post that URL here.
There's going to be a delay here. I'm Atlantic Time and need to sleep for a few more hours. so will be back in about 4 hours.
Thats fine Larz60+, I will do that for you, here is the Website Url :-

https://www.flightsim.com/vbfs/fslib.php...537&page=1

And here is the Website Url, to the first .zip File on page 1 :-

https://www.flightsim.com/vbfs/fslib.php...&fid=64225

Eddie
working on it now.
Thankyou so much Larz60+, I appreciate you doing this for me, very much.
Pages: 1 2 3 4 5 6 7