netifd: change RPS/XPS handling to all CPUs and disable by default
authorAlan Swanson <reiver@improbability.net>
Fri, 15 Nov 2019 14:05:35 +0000 (14:05 +0000)
committerMathias Kresin <dev@kresin.me>
Tue, 3 Mar 2020 21:43:08 +0000 (22:43 +0100)
The current implementation is significantly lowering lantiq
performace [1][2] by using RPS with non-irq CPUs and XPS
with alternating CPUs.

The previous netifd implementation (by default but could be
configured) simply used all CPUs and this patch essentially
reverts to this behaviour.

The only document suggesting using non-interrupt CPUs is Red
Hat [3] where if the network interrupt rate is extremely high
excluding the CPU that handles network interrupts *may* also
improve performance.

The original packet steering patches [4] advise that optimal
settings for the CPU mask seems to depend on architectures
and cache hierarcy so one size does not fit all. It also
advises that the overhead in processing for a lightly loaded
server can cause performance degradation.

Ideally, proper IRQ balancing is a better option with
the irqbalance daemon or manually.

The kernel does not enable packet steering by default, so
also disable in OpenWRT by default. (Though mvebu with its
hardware scheduling issues [5] might want to enable packet
steering by default.)

Change undocumented "default_ps" parameter to clearer
"packet_steering" parameter. The old parameter was only ever
set in target/linux/mediatek/base-files/etc/uci-defaults/99-net-ps
and matched the default.

[1] https://forum.openwrt.org/t/18-06-4-speed-fix-for-bt-homehub-5a
[2] https://openwrt.ebilan.co.uk/viewtopic.php?f=7&t=1105
[3] https://access.redhat.com/documentation/en-us/red_hat_enterprise_linux/6/html/performance_tuning_guide/network-rps
[4] https://marc.info/?l=linux-netdev&m=125792239522685&w=2
[5] https://git.openwrt.org/?p=openwrt/openwrt.git;a=commitdiff;h=2e1f6f1682d3974d8ea52310e460f1bbe470390f

Fixes: #1852
Fixes: #2573
Signed-off-by: Alan Swanson <reiver@improbability.net>
package/network/config/netifd/files/etc/hotplug.d/net/20-smp-tune

index ab9a90418ed4f04731f74abaf3954da7e8f2154a..9d7aaae0ac4f0ae56faf953311c41f0100526bd0 100644 (file)
@@ -34,8 +34,8 @@ set_hex_val() {
        echo "$val" > "$file"
 }
 
-default_ps="$(uci get "network.@globals[0].default_ps")"
-[ -n "$default_ps" -a "$default_ps" != 1 ] && exit 0
+packet_steering="$(uci get "network.@globals[0].packet_steering")"
+[ "$packet_steering" != 1 ] && exit 0
 
 exec 512>/var/lock/smp_tune.lock
 flock 512 || exit 1
@@ -53,15 +53,10 @@ for dev in /sys/class/net/*; do
        irq_cpu_mask="$((1 << $irq_cpu))"
 
        for q in ${dev}/queues/rx-*; do
-               set_hex_val "$q/rps_cpus" "$(($PROC_MASK & ~$irq_cpu_mask))"
+               set_hex_val "$q/rps_cpus" "$PROC_MASK"
        done
 
-       ntxq="$(ls -d ${dev}/queues/tx-* | wc -l)"
-
-       idx=$(($irq_cpu + 1))
        for q in ${dev}/queues/tx-*; do
-               set_hex_val "$q/xps_cpus" "$((1 << $idx))"
-               let "idx = idx + 1"
-               [ "$idx" -ge "$NPROCS" ] && idx=0
+               set_hex_val "$q/xps_cpus" "$PROC_MASK"
        done
 done