Skip to content

Latest commit

 

History

History
40 lines (31 loc) · 2.18 KB

data-transfer.md

File metadata and controls

40 lines (31 loc) · 2.18 KB

Data transfer

sdf-dtn.slac.stanford.edu is a load-balanced DNS name which points to a pool of dedicated data trasnfer nodes. It is open to everyone with a SDF account. Common tools/service like scp/sftp/rsync are available for casual data transfer. For serious large volume data transfer, two common services are available: bbcp and Globus.

bbcp is a high performance multi-stream data transfer tool. In its simplest form, the bbcp command line is similar to that of scp. A simple command using bbcp at SDF looks like this:

bbcp [email protected]:/tmp/myfile ./myfile

You may need to type your password for [email protected], unless you setup password less login to remote.univ.edu (e.g. ssh key).

To archive high performance, bbcp opens a additional TCP port. This sometime won't work if there is a firewall. The -Z option allows you to specify a range of TCP ports that are not blocked by firewall. The -z is another commonly used option to work with firewall. Type bbcp --help or go to the bbcp web page for more info.

Both source and destination must have the bbcp executable in $PATH. bbcp executable can be downloaded by following the link in the bbcp web page. If bbcp is not in $PATH, use the -S or -T option to specify the none standard location. Please carefully read the bbcp web page with regards to these options as they are not as intuitive as you may think. Also, sometimes a cut-n-paste of dash (-) from the web page end up with something that looks like a dash but not a dash. In that case, just replace it by a real dash.

Using the above command line as a example, if you copy bbcp to your home directory at remote.univ.edu, do this:

bbcp -S 'ssh -l %U %H ~/bbcp' [email protected]:/tmp/myfile ./myfile

Here we use option -S because remote.univ.edu is the data source. bbcp will substitute %U and %H by me and remote.univ.edu respectively.

More examples from NERSC.

SDF has a Globus endpoint slac#sdf. This service is available to everyone with a SDF account.