[maker-devel] NFSLock problem
Fields, Christopher J
cjfields at illinois.edu
Thu Oct 3 12:44:07 MDT 2013
I have a MAKER job running that seems to be stalled on a failed scaffold. It's running via MPI (MAKER v2.28, openMPI 1.6.3), that appears to have worked successfully for the most part. This is a run that only uses transcriptome and protein information in order to get a decent dseat of
The failed scaffold seems to be holding the job from completion. There does seem to be changes, but mainly they are on the NFSLock files:
./.NFSLock.gi_lock.NFSLock
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1/.NFSLock.KB913038%2E1.281.282.junction.blastn.holdover.NFSLock
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1/.NFSLock.KB913038%2E1.282.start.blastn.holdover.NFSLock
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1/.NFSLock.KB913038%2E1.281.end.blastn.holdover.NFSLock
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1/.NFSLock.KB913038%2E1.282.start.blastn.holdover.NFSLock.STACK
./Zalbi.unplaced.scaf_datastore/2C/0F/KB913038.1/theVoid.KB913038.1/.NFSLock.KB913038%2E1.281.end.blastn.holdover.NFSLock.STACK
Everything else seems to have completed.
I have seen a few issues re: NFS locking problems, would this be related? Should I stop the job?
We're running GPFS for our NFS. Here's 'mount':
-system-specific-4.1$ mount
/dev/sda5 on / type ext4 (rw)
proc on /proc type proc (rw)
sysfs on /sys type sysfs (rw)
devpts on /dev/pts type devpts (rw,gid=5,mode=620)
tmpfs on /dev/shm type tmpfs (rw)
/dev/sda1 on /boot type ext4 (rw)
/dev/sdb1 on /export type ext4 (rw)
/dev/sda2 on /var type ext4 (rw)
tmpfs on /var/lib/ganglia/rrds type tmpfs (rw,size=6180842000,gid=99,uid=99)
none on /proc/sys/fs/binfmt_misc type binfmt_misc (rw)
sunrpc on /var/lib/nfs/rpc_pipefs type rpc_pipefs (rw)
nfsd on /proc/fs/nfsd type nfsd (rw)
/dev/IGBHOME0 on /home type gpfs (rw,mtime,dev=IGBHOME0)
128.174.124.79:/shares/group on /archive/group type nfs (rw,sync,hard,intr,retrans=10,timeo=300,rsize=65536,wsize=1048576,vers=3,proto=tcp,mountproto=tcp,addr=128.174.124.79)
128.174.124.79:/shares/CBC on /archive/CBC type nfs (rw,sync,hard,intr,retrans=10,timeo=300,rsize=65536,wsize=1048576,vers=3,proto=tcp,mountproto=tcp,addr=128.174.124.79)
chris
More information about the maker-devel
mailing list