git.stricted.de - GitHub/LineageOS/android_kernel_motorola

author	Eric Dumazet <edumazet@google.com>
	Mon, 6 Jan 2014 17:54:31 +0000 (09:54 -0800)
committer	David S. Miller <davem@davemloft.net>
	Mon, 6 Jan 2014 21:37:09 +0000 (16:37 -0500)
commit	8f646c922d55047ecd6c65ada49ead88ed0db61e
tree	a8298774a1675f87a3ca71725bc01d20b0e1a279	tree \| snapshot (tar.gz zip)
parent	996b175e39ed42ec2aa0c63b4a03cc500aa6269f	commit \| diff

vxlan: keep original skb ownership

Sathya Perla posted a patch trying to address following problem :

<quote>
The vxlan driver sets itself as the socket owner for all the TX flows
it encapsulates (using vxlan_set_owner()) and assigns it's own skb
destructor. This causes all tunneled traffic to land up on only one TXQ
as all encapsulated skbs refer to the vxlan socket and not the original
socket. Also, the vxlan skb destructor breaks some functionality for
tunneled traffic like wmem accounting and as TCP small queues and
FQ/pacing packet scheduler.
</quote>

I reworked Sathya patch and added some explanations.

vxlan_xmit() can avoid one skb_clone()/dev_kfree_skb() pair
and gain better drop monitor accuracy, by calling kfree_skb() when
appropriate.

The UDP socket used by vxlan to perform encapsulation of xmit packets
do not need to be alive while packets leave vxlan code. Its better
to keep original socket ownership to get proper feedback from qdisc and
NIC layers.

We use skb->sk to

A) control amount of bytes/packets queued on behalf of a socket, but
prior vxlan code did the skb->sk transfert without any limit/control
on vxlan socket sk_sndbuf.

B) security purposes (as selinux) or netfilter uses, and I do not think
anything is prepared to handle vxlan stacked case in this area.

By not changing ownership, vxlan tunnels behave like other tunnels.
As Stephen mentioned, we might do the same change in L2TP.

Reported-by: Sathya Perla <sathya.perla@emulex.com>
Signed-off-by: Eric Dumazet <edumazet@google.com>
Cc: Stephen Hemminger <stephen@networkplumber.org>
Signed-off-by: David S. Miller <davem@davemloft.net>