/openbmc/linux/include/net/ |
H A D | dst_ops.h | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|
/openbmc/linux/net/ipv4/ |
H A D | xfrm4_policy.c | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|
H A D | route.c | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|
/openbmc/linux/net/core/ |
H A D | dst.c | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|
/openbmc/linux/net/ipv6/ |
H A D | xfrm6_policy.c | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|
H A D | route.c | diff fc66f95c68b6d4535a0ea2ea15d5cf626e310956 Fri Oct 08 01:37:34 CDT 2010 Eric Dumazet <eric.dumazet@gmail.com> net dst: use a percpu_counter to track entries
struct dst_ops tracks number of allocated dst in an atomic_t field, subject to high cache line contention in stress workload.
Switch to a percpu_counter, to reduce number of time we need to dirty a central location. Place it on a separate cache line to avoid dirtying read only fields.
Stress test :
(Sending 160.000.000 UDP frames, IP route cache disabled, dual E5540 @2.53GHz, 32bit kernel, FIB_TRIE, SLUB/NUMA)
Before:
real 0m51.179s user 0m15.329s sys 10m15.942s
After:
real 0m45.570s user 0m15.525s sys 9m56.669s
With a small reordering of struct neighbour fields, subject of a following patch, (to separate refcnt from other read mostly fields)
real 0m41.841s user 0m15.261s sys 8m45.949s
Signed-off-by: Eric Dumazet <eric.dumazet@gmail.com> Signed-off-by: David S. Miller <davem@davemloft.net>
|