I managed to get to a failure state with the laptop on 5GHz and the phone on 2.4GHz. This allowed me to test which interfaces the ICMP packets were going through, and I could definitely detect the requests (no responses) on both wlan0
:
root@turris:~# tcpdump -i wlan0 -vv icmp
tcpdump: WARNING: wlan0: no IPv4 address assigned
tcpdump: listening on wlan0, link-type EN10MB (Ethernet), capture size 65535 bytes
19:39:51.314195 IP (tos 0x0, ttl 64, id 58277, offset 0, flags [none], proto ICMP (1), length 84)
192.168.1.160 > 192.168.1.206: ICMP echo request, id 59154, seq 41, length 64
19:39:52.314166 IP (tos 0x0, ttl 64, id 27457, offset 0, flags [none], proto ICMP (1), length 84)
192.168.1.160 > 192.168.1.206: ICMP echo request, id 59154, seq 42, length 64
and wlan1
:
root@turris:~# tcpdump -i wlan1 -vv icmp
tcpdump: WARNING: wlan1: no IPv4 address assigned
tcpdump: listening on wlan1, link-type EN10MB (Ethernet), capture size 65535 bytes
19:40:05.341652 IP (tos 0x0, ttl 64, id 41383, offset 0, flags [none], proto ICMP (1), length 84)
192.168.1.160 > 192.168.1.206: ICMP echo request, id 59154, seq 55, length 64
19:40:06.346173 IP (tos 0x0, ttl 64, id 54482, offset 0, flags [none], proto ICMP (1), length 84)
192.168.1.160 > 192.168.1.206: ICMP echo request, id 59154, seq 56, length 64
Interestingly enough, after a while of keeping the ping command running, responses just started coming through. As soon as I switched the phone to the 5GHz network, though, it started failing again. And then after a while it started to work again, with occasional packet loss.
When things work, the ping times are high and irregular, going from < 100ms to over 1000ms, whereas pinging the router directly (192.168.1.1) is always on the order of a few ms.
So, to summarise all of my testing so far:
- The problem affects both wireless bands
- The problem is intermittent, and only starts happening after the router has been running for a while
- It makes ping times high and irregular between wireless clients
- It sometimes causes total packet loss between wireless clients, in that packets simply do not show up on the router (
wlan0
or wlan1
) and the ping command reports “no route to host”
- It does not affect reachability or ping times between wireless clients and router
- It does not affect reachability or ping times between wireless clients and wired clients
- It does not affect reachability or ping times between wireless clients and WAN
All of this screams “bug” at me, but I’m at the end of my debugging skills to track things down. I suspect a driver or some firmware somewhere, but the kernel log is clean.
Could someone from CZ.NIC please comment on this issue? I’m happy to test as much as needed, but I need some guidance of where to look.