Issues downloading files from EMPIAR

Hi, I’ve been trying to download EMPIAR-10025 for the tutorial but I’m having issues with IBM aspera connect and HTTP.

I’ve tried with multiple browsers both on ubuntu and windows but they all fail to detect aspera connect even though I have it running. On windows aspera connect says Error: server aborted session: management authorization refused.

Through HTTP I’ve had success downloading smaller files but even 37gb files fail. I’m working on getting faster internet but even then 1tb+ downloads would take a few hours so I suspect the same issues would reoccur.

Any advice would be appreciated.

If you have to use a GUI, Firefox with the DownThemAll! extension will allow you to bulk download the ftp folder.

Otherwise, if you’re comfortable in a commandline, wget works fine if you point it at the directory. If some inspiration is needed to get started, the TeamTomo website has an example script which can be adjusted appropriately. :smiley:

ebi.ac.uk has added robots.txt to block bulk downloads by wget. :unamused: That’s likely due to Google, Facebook, OpenAI et al being greedy and scraping everything they can get their sticky little fingers in but I’d still check with them that they don’t mind if you wget what you want - they shouldn’t, if it’s for academic use, after all, that’s what EMPIAR is there for!

edit: Directly calling individual files still works, so scripting it should be OK.

Thank you Resolver of issues. I’m now using .aria2 to download the mrc files directly from ftp. If I rerun the same .aria2 command would it download the failed files and leave out the downloaded one?

Unknown, sorry.

Posts must be at least 20 characters.