Hi,
Am 2015-08-18 18:35, schrieb Bjoern Franke:
Thanks for the hint, it did not work on the debian machines, but I got the systems running with crashkernel enabled. Now we got the first crash: https://p.rrbone.net/paste/nnNHrIJI#oHfBMOs2
We've seen these on our Goettingen Freifunk gateways, too. There, too, batadv_frag_purge_orig was the smoking gun. However, I didn't report it, because:
- first and foremost, we were using the outdated legacy 2013.4 version - it was most probably an issue with RCU lists - and either disabling SMP or using a much more current kernel fixed it.
So I blamed a buggy RCU implementation in older kernels, plus maybe some ill behaviour in the old batman-adv codebase. The crashing kernel was the old debian-wheezy one - pretty old, I'd say.
-hwh