[doc] backup-redmine/2017-07-13, master: doc: open-mesh/The-olsr-story (ba8241a8) - commits - lists.open-mesh.org

14 Jul 2017

Repository : ssh://git@open-mesh.org/doc
On branches: backup-redmine/2017-07-13,master
...

commit ba8241a8d4b4fac8c4bf396a7ba235e1d1eabad7
Author: Marek Lindner mareklindner@neomailbox.ch
Date:   Tue Mar 24 16:55:01 2009 +0000
doc: open-mesh/The-olsr-story
...

ba8241a8d4b4fac8c4bf396a7ba235e1d1eabad7
 open-mesh/The-olsr-story.textile | 267 +++++++++++++++++++++++++++++++++++++++
 1 file changed, 267 insertions(+)

diff --git a/open-mesh/The-olsr-story.textile b/open-mesh/The-olsr-story.textile
new file mode 100644
index 00000000..77e3a6b9
--- /dev/null
+++ b/open-mesh/The-olsr-story.textile
@@ -0,0 +1,267 @@
+The OLSR.ORG story  [[BR]]
+
+Proactive protocols (Link State Routing Protocols) generate a lot of [[BR]]
+overhead because they have to keep topoloy information and routing  [[BR]]
+tables in sync amongst all or at least amongst adjacent nodes. If the  [[BR]]
+protocol does not manage to keep the routing tables synced it is likely  [[BR]]
+that the payload will spin in routing loops until the !TimeToLive (TTL)  [[BR]]
+is expired. Apart from high traffic-overhead and CPU-Load this is the  [[BR]]
+biggest issue for Link State Routing Protocols.  We were actively  [[BR]]
+involved in the evolution of olsrd from olsr.org. Actually we were the  [[BR]]
+people that made it functional. RFC3626 - the initial IETF-draft of olsr  [[BR]]
+- does not work in real life. If you want to find out what it's  [[BR]]
+developers intended it to be and how it should work, I would like to  [[BR]]
+suggest reading the RFC3626 after you have seen the presentation of  [[BR]]
+Andreas Tøennesen on the OLSR.ORG website about RFC3626.   [[BR]]
+
+We heavily modified olsr over the time. We disabled almost everything  [[BR]]
+that the inital designers of olsr thought was smart and replaced it with  [[BR]]
+the LQ/ETX-Mechanism and Fish-Eye Mechanism tp update topology  [[BR]]
+information.    [[BR]]
+
+What we did to improve olsr (in historical order):  [[BR]]
+
+Test OLSR according to RFC3626 at the conference Wizards of OS III in  [[BR]]
+2004   - Meshcloud with 25 Nodes [[BR]]
+ 
+Results: [[BR]] 
+Routing tables take long time to build and no time to break down.  [[BR]]
+Routes flap.  [[BR]]
+Routing loops.  [[BR]]
+No throughput.  [[BR]]
+Gateway switches all the time - so stateful connections to the Internet  [[BR]]
+will brake down all the time     [[BR]]
+
+Conclusion:   [[BR]]
+Hysteresis mechanism frequently kicks Multi Point Relays (MPRs) out  of  [[BR]]
+the routing table --> Infrastructure to broadcast topology information  [[BR]]
+breaks down all the time and MPRs have to be negotiated again...   [[BR]]
+Multipoint relay selection selects nodes far away to keep the number of   [[BR]]
+necessary Multi Point Relays low --> Links to MPRs are weak, so  [[BR]]
+hysteresis kicks them out of the routing table more often than not    [[BR]]
+Multipoint relay selection reduces protocol overhead and prevents  [[BR]]
+topology  information from being in sync --> Routing loops    [[BR]]
+Routes are unstable --> No throughput    [[BR]]
+Routes selected on minimum Hop-count maximises packetloss --> No  [[BR]]
+throughput   [[BR]]
+Routing loops --> No throughput   [[BR]]
+Dynamic gateway selection --> Stateful connections get interrupted when  [[BR]]
+a  different gateway is selected [[BR]]
+
+What we did:  [[BR]]
+Disable hysteresis.  [[BR]]
+Disable MPRs - all nodes forward topology information.    [[BR]] 
+
+
+Now almost everything that was meant to optimize Link State Routing was  [[BR]]
+disabled - a simple proactive link-state routing protocol with support  [[BR]]
+for multiple interfaces was all that was left. We started to deploy OLSR  [[BR]]
+in the Freifunk Mesh in Berlin - rather we should have named it LSR back  [[BR]]
+then. But since the implementation came from olsr.org and everything  [[BR]]
+could be switched on and off by the configuration file we didn't think  [[BR]]
+about starting a new project and renaming it. This became later a source  [[BR]]
+of confusion and disappointment for all people that tried olsr.org and  [[BR]]
+had no idea what was going on in Berlin. If you use the standard  [[BR]]
+configuration file that is shipped with olsr.org, olsrd will still  [[BR]]
+behave according to RFC3626. So if you want to see how miserable RFC3626  [[BR]]
+works - try it with the default configuration file.    [[BR]] 
+
+
+* Deployment of OLSR (with 'Optimizations' removed) in the Berlin  [[BR]]
+Freifunk  mesh cloud - 2004 [[BR]]
+
+Results:  [[BR]]
+Works much better than RFC3626. Still it was hardly usable.  [[BR]]
+Throughput very low and unstable.  [[BR]]
+Routing table doesn't break down anymore  [[BR]]
+Dynamic gateway selection --> Stateful connections get interrupted    [[BR]]
+when a different gateway is selected [[BR]]
+    
+
+Conclusion:  [[BR]]
+We knew routes based on minimum hopcount will likely have very low  [[BR]]
+throughput.   [[BR]]
+Dynamic gateway selection is a tradeoff of automatic gateway  selection  [[BR]]
+by the protocol [[BR]]
+
+
+I knew from my first experience with Mobilemesh (another Link State  [[BR]]
+Routing Protocol that we tried at the very beginning of the Freifunk  [[BR]]
+Mesh) that minimum hop count sucks completely as an algorithm to select  [[BR]]
+routes. So I started to think about routes that are chosen based on  [[BR]]
+metrics measuring the quality/throughput of links. I decided to use a  [[BR]]
+metric based on packet loss and found the idea of ETX (Expected  [[BR]]
+Transmission Count) in a paper written at the MIT. I didn't like the way  [[BR]]
+they suggested to implement it (sending extra UDP packets), so I  [[BR]]
+developed the idea of ETX/LQ together with Thomas Lopatic who  [[BR]]
+implemented the new ideas in olsrd. Rather than sending extra  [[BR]]
+UDP-packets we could just keep track of missed or successfully  [[BR]]
+transmitted Hello-Packets which are frequently broadcasted by the  [[BR]]
+LSR-mechanism anyway. And we could send the information about the  [[BR]]
+successfully received packets within the Hello messages - so neighbors  [[BR]]
+are updated immediately with every "Hello" broadcast how their neighbor  [[BR]]
+thinks about their own transmissions.  This was a lot of work in the  [[BR]]
+code of olsrd and Thomas did a cumbersome but really great job. I had  [[BR]]
+the feeling that this would be a major milestone on the way to a good  [[BR]]
+working protocol. It was released as olsr-0.4.8 - we had a nice party and  [[BR]]
+a big barrel of beer at the c-base to celebrate the moment :)  There was  [[BR]]
+one tradeoff, however. We had to break compatibility with RFC3626. But  [[BR]]
+since RFC3626 wasn't usable in real-life we didn't bother much.   [[BR]]
+
+
+
+* Deployment of olsr-0.4.8 in the Freifunk-Mesh with ETX/LQ-Mechanism [[BR]]
+
+Results:  [[BR]]
+
+Probably bugs in the huge amount of new program-code   [[BR]] 
+Good routing decisions on wireless links operating at the same speed as  [[BR]]
+long as the network is idle [[BR]]
+Throughput improved - but throughput is interrupted by routing loops as  [[BR]]
+soon as heavy network load is introduced    [[BR]]
+Payload runs for a while at high speed, then the traffic is  [[BR]]
+interrupted,  comes back after a while at slow speed - caused by routing  [[BR]]
+loops [[BR]]
+Dynamic gateway selection --> Stateful connections get interrupted when  [[BR]]
+a different gateway is selected   [[BR]]
+
+Conclusion:  This was a mayor improvement, but... [[BR]]
+
+Payload traffic in the mesh causes interference and alters LQ/ETX-Values  [[BR]]
+- interference causes lost LQ-Messages, so LQ/ETX-Values in topology  [[BR]]
+messages detoriate when payload traffic is introduced. If the protocol  [[BR]]
+fails to update the link state information in time the routing tables  [[BR]]
+are not in sync - thus causing routing loops.  Freifunk OLSR-Networks in  [[BR]]
+other cities that had relatively low payload traffic compared to the  [[BR]]
+capacity of their wireless links were quite happy with 0.4.8. But  [[BR]]
+networks where links got saturated were still unstable.  Now it became  [[BR]]
+even more clear how stupid the idea of Multipoint-Relays was. Traffic  [[BR]]
+causes interference and lost topology update messages. So the link  [[BR]]
+quality values detoriate compared to a mesh that is idle - and the  [[BR]]
+information that tells the nodes in the mesh about the topology changes  [[BR]]
+are likely to get lost on their way. MPRs reduce the redundancy that is  [[BR]]
+desperately needed to keep routing tables in sync. And - even worse -  [[BR]]
+the information about who is whose MPR is another information that has  [[BR]]
+to be synced. Another source of failure.  [[BR]]
+
+
+So we had to find a way to make sure that information about topology  [[BR]]
+changes is updated in time to avoid routing loops. A perfect routing  [[BR]]
+table that only works as long as the network is idle is quite  [[BR]]
+useless...  One viable solution in a small mesh would be to send  [[BR]]
+topology control messages (TC-Messages) more often than Hello's - but we  [[BR]]
+already had a mesh with more than 100 nodes, so the traffic caused by  [[BR]]
+redundant TC-Messages would suffocate the network by introducing massive  [[BR]]
+overhead. Than we had the idea of sending TC-Messages with different TTL  [[BR]]
+(Time-To-Live) values. I had the hypothesis that routing loops would  [[BR]]
+occur amongst adjacent nodes - so we would only have to update topology  [[BR]]
+changes quickly and redundant amongst adjacent nodes.  We had to design  [[BR]]
+an algorithm that would make sure that adjacent nodes have correct  [[BR]]
+topology information - but the problem is that it seemingly would not  [[BR]]
+work without massive overhead.  The idea we came up with is to send TC  [[BR]]
+messages only to adjacent nodes very often, i.e. nodes that are likely  [[BR]]
+to be involved in routing loops, without flooding the whole mesh with  [[BR]]
+each sent TC message. We called it Link Quality Fish Eye mechanism.  [[BR]]
+
+OLSR packets carry a Time To Live (TTL) that specifies the maximum  [[BR]]
+number of hops that the packets is allowed to travel in the mesh. The  [[BR]]
+Link Quality Fish Eye mechanism generates TC messages not only with the  [[BR]]
+default TTL of 255, but with different TTLs, namely 1, 2, 3, and 255,  [[BR]]
+restricting the distribution of TC messages to nodes 1, 2, 3, and 255  [[BR]]
+hops away. A TC message with a TTL of 1 will just travel to all one-hop  [[BR]]
+neighbours, a message with a TTL of 2 will in addition reach all two-hop  [[BR]]
+neighbours, etc. [[BR]]
+
+TC messages with small TTLs are sent more frequently than TC messages  [[BR]]
+with higher TTLs, such that immediate neighbours are more up to date  [[BR]]
+with respect to our links than the rest of the mesh.  The following  [[BR]]
+sequence of TTL values is used by olsrd: [[BR]]
+      
+255 3 2 1 2 1 1 3 2 1 2 1 1  [[BR]]
+
+Hence, a TC interval of 0.5 seconds leads to the following TC broadcast  [[BR]]
+scheme.    [[BR]]
+* Out of 13 TC messages, all 13 are seen by one-hop neighbours (TTL 1,  [[BR]]
+2, 3, or 255), i.e. a one-hop neighbour sees a TC message every  0.5  [[BR]]
+seconds.    [[BR]]
+* Two-hop neighbours (TTL 2, 3, or 255) see 7 out of 13 TC messages,   [[BR]]
+i.e. about one message per 0.9 seconds.    [[BR]]
+* Three-hop neighbours (TTL 3 or 255) see 3 out of 13 TC messages,  i.e.  [[BR]]
+about one message per 2.2 seconds.    [[BR]]
+* All other nodes in the mesh (TTL 255) see 1 out of 13 TC messages,   [[BR]]
+i.e. one message per 6.5 seconds.  [[BR]]
+ 
+The sequence of TTL values is hard-coded in lq_packet.c and can be  [[BR]]
+altered easily for further experiments.  The implementation of Link  [[BR]]
+Quality Fish Eye mechanism took Thomas only a few minutes - and it was  [[BR]]
+the second major improvement. [[BR]]
+
+Thomas also introduced a new switch, called !LinkQualityDjikstraLimit.  [[BR]]
+The slow CPUs of embedded routers have serious problems to recalculate  [[BR]]
+the routing tables in a mesh-cloud with more than 100 nodes. Every  [[BR]]
+incoming TC-Message would trigger another recalculation of the  [[BR]]
+Djikstra-Table - this would be far too often. !LinkQualityDjikstraLimit  [[BR]]
+allows to set an interval for recalculating the Djikstra-Table.  [[BR]]
+
+* Deployment of olsr-0.4.10   [[BR]]
+ 
+Results:   [[BR]]
+
+Now it is really working and usable :)      [[BR]]
+It's still not absolutely loop-free under heavy payload (sometimes loops  [[BR]]
+for 3-10 seconds)    [[BR]]
+Multihop-Links with 10 Hops work and are stable as long as the wireless  [[BR]]
+links work [[BR]]
+!LinkQualityDjikstraLimit allows to run olsr even on a relatively slow  [[BR]]
+CPU in a big mesh-cloud -   but the routing-table becomes very very  [[BR]]
+static      [[BR]]
+Gateway-Switching is still a constant annoyance if a mesh has more than  [[BR]]
+one Internet-Gateway    [[BR]]
+
+Conclusions:   [[BR]]
+Apart from the problems with Gateway-Switching it is now a well behaving  [[BR]]
+routing protocol.  [[BR]]
+
+But still... Thomas and I agreed that we could cope with the increasing  [[BR]]
+size of the Freifunk networks only by making the protocol more and more  [[BR]]
+static. So the Freifunk mesh protocol wouldn't be exactly capable for  [[BR]]
+mobile operation. What disenchanted me in particular was that we  [[BR]]
+couldn't get entirely rid of routing loops. Link State Routing has  [[BR]]
+significant design flaws. Why does every node calculate full routing  [[BR]]
+paths to every node - if all it can do is decide which direct neighbor  [[BR]]
+it chooses as the next hop? If a node has a only  a single neighbor to  [[BR]]
+talk to a mesh of 500 nodes it will calculate each and every route - but  [[BR]]
+all it can do is to select its only single hop neighbor as gateway to  [[BR]]
+every destination... So all topology / route calculation is superfluous  [[BR]]
+in this case. What's more: What a node calculates based on stale  [[BR]]
+information has nothing to do with the path a packet actually takes on a  [[BR]]
+routing path of considerable length. This is a bliss for Olsr - because  [[BR]]
+nodes closer to the destination have better knowledge about the  [[BR]]
+topology. I have serious doubts that adding source routing to Olsrd  [[BR]]
+would be a improvement... [[BR]]
+
+Synchronized Link State Information is impossible to achieve in a  [[BR]]
+wireless network. No matter what you do the topology keeps on changing  [[BR]]
+while you are trying to sync every nodes view about it, particularly  [[BR]]
+when you are utilizing broadcast messages in a unreliable medium. (And  [[BR]]
+unicast messages are a naturally a no-no for a protocol that generates  [[BR]]
+such a massive protocol overhead.) Why let every node gather and  [[BR]]
+calculate unneccessary information -  the topology of the whole mesh -  [[BR]]
+if all a node has to decide is which single hop neighbor to choose as  [[BR]]
+next hop for each destination? Besides accelerated global warming you  [[BR]]
+gain routing loops because of the superfluous effort. Link State Routing  [[BR]]
+thinks too much and is far too complex for is own good. Why do all  [[BR]]
+this?  We decided to come up with something better, simpler, something  [[BR]]
+that doesn't loop. And a mechanism that allows to select the gateway by  [[BR]]
+the gateway client to get rid of the unbearable gateway-switching  [[BR]]
+problem. Thomas had the idea for a name: B.A.T.M.A.N - Better Approach  [[BR]]
+To Mobile Ad-Hoc Networking. [[BR]]
+
+We both lost interest in Olsr development in spring 2006 and Thomas  [[BR]]
+implemented a quick and dirty version of Batman-I in one night to see if  [[BR]]
+the new algorithm was viable. It is - but that's a whole different story... [[BR]]
+
+Written by Elektra and published at www.open-mesh.org [[BR]]
+
+Copyleft: [[BR]]
+(CC) Creative Commons Attribution-Noncommercial-Share Alike 3.0 [[BR]]