On 08/20/2015 12:08 PM, Hans-Werner Hilse wrote:
Hi,
Am 2015-08-18 18:35, schrieb Bjoern Franke:
Thanks for the hint, it did not work on the debian machines, but I got the systems running with crashkernel enabled. Now we got the first crash: https://p.rrbone.net/paste/nnNHrIJI#oHfBMOs2
We've seen these on our Goettingen Freifunk gateways, too. There, too, batadv_frag_purge_orig was the smoking gun. However, I didn't report it, because:
- first and foremost, we were using the outdated legacy 2013.4 version
- it was most probably an issue with RCU lists
- and either disabling SMP or using a much more current kernel fixed it.
So I blamed a buggy RCU implementation in older kernels, plus maybe some ill behaviour in the old batman-adv codebase. The crashing kernel was the old debian-wheezy one - pretty old, I'd say.
-hwh
This is an independent bug (2013.4 uses a completely different fragmentation implementation) that has been reported in https://github.com/freifunk-gluon/batman-adv-legacy/issues/1 . Please don't bother the upstream BATMAN developers with batman-adv-legacy bugs.
Matthias