It looks like you're new here. If you want to get involved, click one of these buttons!
Geraldine_VdAuwera
Posts: 2,239Administrator, GSA Official Member admin
We make various files available for public download from the GSA FTP server, such as the GATK resource bundle and presentation slides. We also maintain a public upload feature for processing bug reports from users.
There are two logins to choose from depending on whether you want to upload or download something:
location: ftp.broadinstitute.org
username: gsapubftp-anonymous
password: <blank>
location: ftp.broadinstitute.org
username: gsapubftp
password: 5WvQWSfi
Geraldine Van der Auwera, PhD
Comments
Hi Geraldine,
I have checked all the main directories on ftp.broadinstitute.org, but I cannot find the bundle anymore. Can you tell me exactly where it is?
Thanks Eva
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Eva, there should be a
bundle/directory right at the root of the FTP server.Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine,
I also can not find the bundle/ directory.
Thanks
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Make sure you are connecting as user
gsapubftp-anonymous(for downloads). If you connect as usergsapubftp(which is for uploads), you will not see the bundle.Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I also cannot find the bundle/ directory, and there is no prompt to connect using a username and password. Thanks for any help.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •@Ick, I'm not sure what you mean by "no prompt". What program are you using?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
-1 • Off Topic Disagree Agree Like 1WTF •I'm just using Internet Explorer 8.0, which I've often used in the past to download components of the resource bundle.
If I enter the address as ftp://ftp.broadinstitute.org/, it brings up a list of directories and files (distribution/, incoming/, outgoing/, ftp, pub and welcome.msg). I don't know how to ask it to let me log in using a username/password. Is there another way I should be doing this? Thanks!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I would recommend using a separate program like FileZilla, which will make it much easier for you to set up and manage your file transfers.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thanks, I'll do that.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine, thanks for the explanation. I found the bundle now. I think our confusion came from the fact that until one or two weeks ago one could access the bundle with an internet browser simply by clicking on ftp://ftp.broadinstitute.org/
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Seems like the reference ucsc.hg19.fasta in hg19 folder is not sorted correctly. It throws an error when I ran the program.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Are you sure it's not your bam that is mis-sorted? That seems the more likely explanation (unless you can show us otherwise)...
Eric Banks, PhD -- Group Leader, Methods Development, MPG, Broad Institute of Harvard and MIT
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Eric, does it need to be sorted in the order of 1,2,3,4,...X,Y,MT as described in the introduction?
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •No, the official ordering for UCSC is a little different: chrM, chr1, chr2, ..., chrX, chrY - which is exactly the ordering in the fasta in our bundle.
Eric Banks, PhD -- Group Leader, Methods Development, MPG, Broad Institute of Harvard and MIT
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I see. Thanks!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi, I was wondering if there's a way to do this from the command line. I tried ftp but the cluster I'm logged in to doesn't allow it.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •If you're just using a browser and not FileZilla to download from the FTP site, you can specify the username like this:
ftp://gsapubftp-anonymous@ftp.broadinstitute.org
And then you should see the bundle/ directory.
- Spam
- Abuse
- Troll
4 • Off Topic Disagree 2Agree 2Like WTF •Or to grab all the bundle/1.5/ files in unix:
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Sure -- just keep in mind that when we update the bundle, you'll want to change the version number, and you might not know you need to do that unless you look at what's in the bundle.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
1 • Off Topic Disagree 1Agree Like WTF •Here is a script that will find the latest version and download one of the resources, e.g. hg19:
It requires that curl, gzip, md5sum and wget are installed. To download a bundle other than hg19, change the RESOURCE variable.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Better (and last) version of the script above:
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thanks! It works perfectly. Cheers, Fernando
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine, Would it be possible that, in the future, the resource bundle could be made accessible using rsync? This will shrink that data transfer load when updating....
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Didier,
That's an interesting idea; we don't have the resources to work on that right now but I'll try to get that on our TODO list of improvements for the future. Can't guarantee when that'll get done though, it could be a while -- that TODO list is already pretty long!
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I am sure if you tell your network administrator that you can shrink the network data transfer load, this will climb up your list pretty fast ;-).
The idea is to
1) Create a link called "latest" to the bundle's latest version directory
2) gzip the data with the parameter : --rsyncable Make rsync-friendly archive
3) install and allow rsync connection
The users have to keep the gz version of the files. If the users are interested in versioning, they can make a copy of the bundle files before rsyncing.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hah, I'm sure it will climb high on the network admin's list; but it's my todo list I'm worried about ;)
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Can I set read/write permissions on files I upload to the ftp server? For instance if there is a file I want to share just with the GATK team.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi there, It's not necessary; anything you upload using the upload-specific login gets put in a directory that only we can take data from. Anybody else can only browse the directory and see filenames, but they cannot download the files. So your data will be protected.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •It appears that I can't do an "ls" on the download server. Viz:
alanmac:~ alan$ ftp gsapubftp-anonymous@ftp.broadinstitute.org
Connected to ftp.broadinstitute.org.
220 ProFTPD 1.3.3g Server (Broad Institute of MIT and Harvard) [69.173.80.251]
331 Anonymous login ok, send your complete email address as your password
Password:
230 Anonymous access granted, restrictions apply
Remote system type is UNIX.
Using binary mode to transfer files.
ftp> ls
421 Service not available, remote server timed out. Connection closed.
257 "/" is the current directory
ftp>
Am I doing something wrong?
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •The server has some restrictions on what you can and can't do in a session which aren't under our control. We recommend you simply use a GUI client to access it and you'll be fine.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi I'm getting the following error. There should be no password right???
wget -r ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg19
--16:07:41-- ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/hg19
Resolving ftp.broadinstitute.org... 69.173.80.251
Connecting to ftp.broadinstitute.org|69.173.80.251|:21... connected.
Logging in as gsapubftp-anonymous ...
Login incorrect.
Thanks,
MC
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •As mentioned previously, the ftp server doesn't respond well to shell access, so we recommend you simply use a GUI client to access it. Sorry for the inconvenience.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thanks for the clarification Geraldine!
Thanks,
MC
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •I did use the gsapubftp-anonymous username but I don't see a bundle directory. However, I found distribution/gsa/gatk_resources.tgz. So I ended up this wget ftp://ftp.broadinstitute.org/distribution/gsa/gatk_resources.tgz It's downloading now, but Is it correct?
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •No, what you want is the bundle directory. It should be at the root of the dir you end up in when you log on.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi, I am using FileZilla, I connected using the username and blank password and started downloading the bundle/2.3/.
I am getting multiple failed transfers (>=43) and the rest successful. The reason for failed transfers is listed as "Incorrect password". Are there some files there that aren't meant to be downloadable or is something wrong?
Edit/Update: I selected the failed transfers and shoved them back into queue, second time around they worked. (╯°□°)╯︵ ┻━┻
- Spam
- Abuse
- Troll
1 • Off Topic Disagree Agree 1Like WTF •Our FTP server is a little temperamental, sorry... (loving the text meme btw!)
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
1 • Off Topic Disagree Agree 1Like WTF •Hi! I can't connect to ftp server. Well.. i can but i lost the connection after the first command. I would like to download the latest version of GATK-Lite, there is another way to do it? Thanks in advance, cheers!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •You can download the source and compile: https://github.com/broadgsa/gatk
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thank you Geraldine. I'm compiling using 'ant' and I have configurated build.xml build "public" scripts only.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi, I'm having trouble downloading files; I get "permission denied". e.g. I run
wget --ftp-user=gsapubftp-anonymous ftp://ftp.broadinstitute.org/bundle/2.3/b37/Mills_and_1000G_gold_standard.indels.b37.vcf.gz
And I see
--2013-03-06 17:07:07-- ftp://gsapubftp-anonymous@ftp.broadinstitute.org/bundle/2.3/b37/Mills_and_1000G_gold_standard.indels.b37.vcf.gz => `Mills_and_1000G_gold_standard.indels.b37.vcf.gz' Resolving ftp.broadinstitute.org... 69.173.80.251 Connecting to ftp.broadinstitute.org|69.173.80.251|:21... connected. Logging in as gsapubftp-anonymous ... Logged in! ==> SYST ... done. ==> PWD ... done. ==> TYPE I ... done. ==> CWD /bundle/2.3/b37 ... done. ==> SIZE Mills_and_1000G_gold_standard.indels.b37.vcf.gz ... 19868212 ==> PASV ... done. ==> RETR Mills_and_1000G_gold_standard.indels.b37.vcf.gz ... done. Mills_and_1000G_gold_standard.indels.b37.vcf.gz: Permission denied
Using a graphical client from my desktop, I seem to be able to download files, but with the above command I get 'permission denied'. What I am I doing wrong? I'd like to download the larger files directly to our compute cluster so using the graphical client is not a good option for anything except testing.
Thanks for any help!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Unfortunately right now you cannot bypass using a GUI client. This is not under our direct control; I will try to look into a better solution with our IT infrastructure folks but I can't guarantee a solution will be forthcoming. Sorry for the inconvenience.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi @Clare
Quick follow-up on your question -- can you confirm that you have write permissions on the destination directory (local) from which you're opening the connection? Just want to rule out a client-side error.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine,
I could not connect to GSA FTP server. Could you check whether the server is shutdown or not?
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Geraldine
I can't access the ftp download server. I have tried cyberduck, filezilla, chrome browser, terminal. At all of themI get the following error.
Status: Resolving address of ftp.broadinstitute.org Status: Connecting to 69.173.80.251:21... Status: Connection established, waiting for welcome message... Response: 220 ProFTPD 1.3.3g Server (Broad Institute of MIT and Harvard) [69.173.80.251] Command: USER gsapubftp-anonymous Response: 331 Anonymous login ok, send your complete email address as your password Command: PASS ************** Response: 530 Login incorrect. Error: Critical error Error: Could not connect to server
Doesn't matter if I leave password blank as in the instructions on this site or if I put in my email address.
Any help?
Thanks
TS
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi folks,
We are experiencing issues with the FTP server. We're in touch with our IT support and trying to get service resumed as quickly as possible. We're very sorry for any inconvenience this may cause you.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •UPDATE: the FTP issues have been resolved and it is working normally again. Thanks for your patience!
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi,
I am trying to download hg19.fasta and dbsnp137 from the ftp server. After it downloads around 180mb, the connection is closed by sever stating 600sec idle timeout. I am using filezilla. I have tried multiple time but same issue continues. What could be the problem?
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hello,
The bundle directory holds example fasta files as well as hg19 and hg18 files. But what are the 'b' directories? They look like the same files as hg19 and hg18.
Joe White MEEI joseph_white@meei.harvard.edu
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Joe,
The b directories contain the Broad versions of the human genome reference. They're very similar but the chromosome names don't have the "chr" part prepended, and there are a few sequence differences. What's really important is to always use the resources that were generated with the same reference as was used to align your data. Most of our resources derive from b37 so if you haven't aligned your reads yet I'd recommend using b37, at least if you want to take advantage of our resources.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi @ashwinipatil,
Can you please post the full error message that you get?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Thank you for the prompt reply. I get the following error:
Response: Proxy reply: HTTP/1.0 200 Connection established Response: 150 Opening BINARY mode data connection for ucsc.hg19.fasta.gz (948729977 bytes) Response: 421 Idle timeout (600 seconds): closing control connection Error: Connection closed by server
Response: 227 Entering Passive Mode (69,173,80,251,247,131). Command: REST 170561368 Response: 502 Command REST not allowed by policy. Error: File transfer failed Status: Starting download of /bundle/2.3/hg19/dbsnp_137.hg19.vcf.gz Error: Connection closed by server Error: File transfer failed after transferring 180,294,824 bytes in 900 seconds
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •This looks like a firewall issue, I'll ask our IT department to look into it. You may need to consult your own IT support as well, to ask if they can do anything about this part of the error: "502 Command REST not allowed by policy"
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi @ashwinipatil, can you try setting your FTP client to active mode? In Filezilla you can find this setting in the Preferences. See screenshot attached.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hello Geraldine,
I'm trying to remotely FTP to ftp.broadinstitute.org, via an ssh into a remote machine cluster.
The bundle is to be used on this cluster, and I'd like to save a lot of time on down-then-upload (the cluster is in a different country from where I am) by FTPing directly to BROAD, from the machine I'm ssh'ed into. It would save me days!
Do you think that terminal command "ftp ftp.broadinstitute.org" should work? I get a connection refused. Normal ftp from my own computer (using filezilla) works fine, of course.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi Laurent,
We've had reports that terminal FTP does not work for many users. We are looking into a cloud-based solution to replace the FTP, but for now I'm afraid you'll need to use a GUI such as FileZilla.
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •FYI I just had success using the command line program
wgetlike this:wget ftp://gsapubftp-anonymous:@ftp.broadinstitute.org/bundle/2.3/hg19/dbsnp_137.hg19.vcf.idx.gz
I first used the GUI tool fetch to log in and browse the FTP directory, then right clicked on the file I wanted and selected "copy fetch address" which gave me the above ftp address with the associated username. Works fine with wget which is a very easy-to-install command line tool.
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Hi, I'm trying to upload a bug report but I seem not to be able to transfer the file. I used FileZilla as recommended, and checked it is in active mode. I checked the usernam/password multiple times, but the connection itself is successfully established. However, I get an "access denied" when trying to transfer the file
Status: Resolving address of ftp.broadinstitute.org Status: Connecting to 69.173.80.251:21... Status: Connection established, waiting for welcome message... Response: 220 ProFTPD 1.3.3g Server (Broad Institute of MIT and Harvard) [69.173.80.251] Command: USER gsapubftp Response: 331 Password required for gsapubftp Command: PASS ******** Response: 230 Anonymous access granted, restrictions apply Command: OPTS UTF8 ON Response: 550 Access is denied. Status: Connected Status: Starting upload of /Users/fles/Documents/GATK/recalibrator_startOverStop.tar.gz Command: CWD / Response: 250 CWD command successful Command: PWD Response: 257 "/" is the current directory Status: Retrieving directory listing... Command: TYPE I Response: 200 Type set to I Command: PASV Response: 227 Entering Passive Mode (69,173,80,251,246,127). Command: MLSD Response: 550 Access is denied. Command: SIZE recalibrator_startOverStop.tar.gz Response: 550 Access is denied. Command: PASV Response: 227 Entering Passive Mode (69,173,80,251,239,42). Command: STOR recalibrator_startOverStop.tar.gz Response: 550 Access is denied. Error: Critical file transfer errorAny suggestions? thanks!
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •Sounds like a firewall issue -- have you tried uploading from a different network, eg from home?
Geraldine Van der Auwera, PhD
- Spam
- Abuse
- Troll
0 • Off Topic Disagree Agree Like WTF •