User Tools

Site Tools


admin:debuggingcrash

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Next revision
Previous revision
admin:debuggingcrash [2018/10/03 11:36]
veera created
admin:debuggingcrash [2018/10/03 13:20] (current)
veera [Debugging with a binary with symbols]
Line 1: Line 1:
 ====== Debugging crashes and other problems on the Trisul Probe nodes ====== ====== Debugging crashes and other problems on the Trisul Probe nodes ======
  
-Trisul probes are running on hundreds of different networks and are generally very stable. ​ We also put them though fuzz tests and mangled packets to test the stability. Howeverit is possible that sometimes the Probes stop. This document describes how you can debug such events. ​+Despite our best efforts with testing ​, sometimes the Trisul-Probes ​can stop for a number of reasons. This document describes how you can debug such events. ​
  
 ===== Where to look ===== ===== Where to look =====
  
-The following ​four pieces of information ​will be needed ​+If you find Trisul Probe has crashed, you can collect the following information ​before restarting. ​ If nothing jumps out , you can contact [[https://​www.trisul.org/​support/​|Trisul Support]] and send us the logs. 
-  Trisul ​probe log files + 
-  ​- ​Version number of Trisul Probe +==== Log files ==== 
-  Platform (Ubuntu/CentOS version) + 
-  - dmesg output +The log files on the trisul probe are stored under ''/​usr/​local/​var/​log/​trisul-probe/​domain0/​probe0/​context0'' ​ The latest ''​ns-xxx.log''​ contains the most recent logs before the crash. You can also use the [[https://​www.trisul.org/​docs/​ref/​trisbashrc.html|trisbashrc bash aliases]] to navigate the directories as shown below: ​  
-  - Some information about the frequency ​of crashes+ 
 +<code bash> 
 +# add the helper aliases 
 +source /​usr/​local/​share/​trisul-probe/​trisbashrc ​ 
 + 
 +# to change to probe log dir 
 +cd.l 
 + 
 +# to tailf probe log file 
 +tailf.ns  
 + 
 +# the latest log file 
 +cd.l 
 +ls -lrt ns* 
 + 
 +</code> 
 + 
 +When contacting Trisul Support send us the latest log file ns-xxx.log 
 + 
 +==== dmesg : view messages from the kernel ==== 
 + 
 +Sometimes useful clues can be found in the kernel message log.  For example, when trisul is stopped by an OOM Out of Memory killer. ​ To view dmesg output 
 + 
 +<code bash> 
 + 
 +# to dump the kernel message log  
 +dmesg -T  
 + 
 +# to view trisul messages  
 +dmesg -T | grep trisul 
 + 
 +</​code>​ 
 + 
 +==== Software versions ==== 
 + 
 +To get probe version ​ ''​trisul ​--version''​ 
 + 
 +To get platform version ''​uname -a''​  
 + 
 +To get installed packages on probe 
 + 
 +<code bash> 
 +# on ubuntu 
 +dpkg -l | grep trisul 
 + 
 +# on centos 
 +rpm -qa | grep trisul 
 + 
 +</​code>​ 
 + 
 + 
 +===== Debugging with a binary with symbols ===== 
 + 
 +If we find nothing in the Trisul and Kernel log files, we (Trisul Support) will send you a special binary image with symbols. You can then run it under ''​gdb''​ to help pinpoint the location ​of the crash. This could be a bug 
 + 
 +<note tip>We recommend you run this under the '​screen'​ utility, so you can detach the terminal and log off.  
 +</​note>​ 
 + 
 + 
 +<​code>​ 
 + 
 +# install gdb 
 +apt install gdb 
 + 
 +# run the special trisul binary we send you like this  
 +gdb --args /​home/​mike3/​trisul -nodemon \ 
 +    /​usr/​local/​etc/​trisul-probe/​domain0/​probe0/​context0/​trisulProbeConfig.xml \ 
 +       -mode online_rxring 
 + 
 +(...wait...)  
 + 
 +# after it crashes ​- type bt on the gdb prompt 
 + 
 +gdb:  bt 
 + 
 +..output here.. send us this output 
 + 
 +</​code>​
  
  
admin/debuggingcrash.1538566571.txt.gz · Last modified: 2018/10/03 11:36 by veera