syscall recvmsg() seems to hang periodically

General support questions
sauteeed
Posts: 4
Joined: 2017/02/16 02:41:34

syscall recvmsg() seems to hang periodically

Postby sauteeed » 2017/02/16 03:03:12

I was running a BIND9 DNS server on CentOS 6. I found that sometimes the number of queries per second of the server suddenly became 0, and then this lasts a few seconds, usually over 10 seconds. However, the number of incoming DNS query requests never drops below 500 per second.

I added some debug logs to BIND9 source code, rebuilt and ran again, then I noticed the thread would wait for a few seconds before recvmsg() returns, periodically. While it hangs, the Recv-Q becomes full because new packets keep coming. The version is BIND-9.9.6, but unfortunately when I switch to BIND- 9.9.9-P4, the problem is still there.

I then tried another DNS server written in Golang, I noticed that the problem never came. The query rate keeps stable as it should be.

I think the ReadFromUDP() function of Golang that server used just wraps the syscall recvfrom(), and the recvfrom() there wouldn't block, either. But ReadFromUDP() seems to keep trying recvfrom(), while BIND9 is using epoll. When data's successfully read, ReadFromUDP() will then return. I'm not sure whether there is actually any difference between those two cases.

I'm using CentOS 6.4, and the kernel version is 2.6.32-358-el6.x86_64.

Has anyone ever encountered such kind of problem?

User avatar
TrevorH
Forum Moderator
Posts: 21161
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: syscall recvmsg() seems to hang periodically

Postby TrevorH » 2017/02/16 03:08:59

6.4 is nearly 4 years old. You need to yum update to 6.8 and the latest kernel 2.6.32-642.6.1.el6 along with all the other 2000 packages that were updated in the last 4 years.

We only ship bind 9.8.2
CentOS 5 died in March 2017 - migrate NOW!
Full time Geek, part time moderator. Use the FAQ Luke

sauteeed
Posts: 4
Joined: 2017/02/16 02:41:34

Re: syscall recvmsg() seems to hang periodically

Postby sauteeed » 2017/02/16 03:50:04

Thank you, TrevorH.

I'm going to try the latest CentOS. By the way, is there any known bug corresponding the problem I posted?

User avatar
TrevorH
Forum Moderator
Posts: 21161
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: syscall recvmsg() seems to hang periodically

Postby TrevorH » 2017/02/16 13:29:20

No idea but rpm -q --changelog kernel-2.6.32-642.13.1.el6.x86_64 has the changes for 2.6.32-358 starting on line 23832 so that means there've been around 23,000 changes to the kernel since the one you're running.
CentOS 5 died in March 2017 - migrate NOW!
Full time Geek, part time moderator. Use the FAQ Luke

sauteeed
Posts: 4
Joined: 2017/02/16 02:41:34

Re: syscall recvmsg() seems to hang periodically

Postby sauteeed » 2017/03/28 09:10:04

Hi, TrevorH, after more investigations, I now believe that it's caused by disk IO, not recvmsg().

I made a mistake by trusting the log messages produced by BIND9, and it then led me to a wrong direction. In fact, it was the fflush() call that blocked all the worker threads. I put logs near the recvmsg() call, so it looked like there was something wrong with the syscall.

I then googled a lot about that, and noticed someone said there was a bug regarding JBD2 process that could possibly make what I encountered happen.

Could you please tell me whether there's indeed such a bug that could make the IO performance very low? It should help me a lot to make a conclusion.

Thank you.
Last edited by sauteeed on 2017/03/28 09:21:48, edited 1 time in total.

User avatar
TrevorH
Forum Moderator
Posts: 21161
Joined: 2009/09/24 10:40:56
Location: Brighton, UK

Re: syscall recvmsg() seems to hang periodically

Postby TrevorH » 2017/03/28 09:17:09

No idea, you'll have to check bugs.centos.org and bugzilla.redhat.com for things matching your symptoms but before you attempt that, make sure you are running the latest kernel. CentOS 6.9's kernel is already in the CR repo which would take you to kernel-2.6.32-696.el6.x86_64 - there are over 25,000 lines in the rpm changelog between 2.6.32-358 and the latest so there are a _lot_ of bugs fixed between the two versions.
CentOS 5 died in March 2017 - migrate NOW!
Full time Geek, part time moderator. Use the FAQ Luke

sauteeed
Posts: 4
Joined: 2017/02/16 02:41:34

Re: syscall recvmsg() seems to hang periodically

Postby sauteeed » 2017/03/28 09:24:28

OK, thanks anyway.

Whoever
Posts: 964
Joined: 2013/09/06 03:12:10

Re: syscall recvmsg() seems to hang periodically

Postby Whoever » 2017/03/28 14:39:11

Are you sure that you don't have rate-limiting configured?