Run capstart.sh command to begin rotating parallel tcpdumps on all nodes with external interfaces.
- the scripts
check-nfs.shandcheck-nfs-datacapture.shmust be loaded onto the node and stored in/rootto be used via cron. - configure an ssh key on the node that can reach a specific node on the storage cluster as root (required to terminate the server tcpdump process by calling
capstop.sh) - Make sure the file
/tmp/nfsissuecaptureddoes not exist - run
echo 0 > /tmp/nfsmonitor.semaphoreto reset the event notification from previous captures - Configure
check_nfs.shto run every 10 minutes via cron - Configure
check-nfs-datacapture.shto run every 2 minutes so that it catches events quickly. - launch new tcpdump capture with
tcpdump -w /tmp/pcaps/$(date '+%Y-%m-%dT%H.%M.%S').$(hostname).em1.pcap -i em1 -s 640 -C 250 -W 100 -Z root- Recommended that this is launched in a tmux session on the individual node with additional panes running:
tail -F /var/log/messages | grep CAPTUREtail -F /var/log/cron -n 40