<div dir="ltr">I have a pair of Syslog-NG servers running 3.2.5-3. The hardware specs are - Quad Xeon E5-2680 (32 cores), 32GB RAM, and two 1TB SAS 7200 RPM disks in RAID-1.<div> </div><div style>OS is RHEL6.2 - Kernel 2.6.32-279.5.2. Filesystem is ext3.</div>
<div style><br></div><div style>Global options are set as:</div><div style><div>options {</div><div><span class="" style="white-space:pre">        </span>flush_lines (1000);</div><div><span class="" style="white-space:pre">        </span>time_reopen (10);</div>
<div><span class="" style="white-space:pre">        </span>log_fifo_size (1000);</div><div><span class="" style="white-space:pre">        </span>long_hostnames (off);</div><div><span class="" style="white-space:pre">        </span>use_dns (no);</div>
<div><span class="" style="white-space:pre">        </span>use_fqdn (no);</div><div><span class="" style="white-space:pre">        </span>create_dirs (yes);</div><div><span class="" style="white-space:pre">        </span>keep_hostname (yes);</div>
<div><span class="" style="white-space:pre">        </span>keep_timestamp(yes);</div><div><span class="" style="white-space:pre">        </span>dir_group("syslog");</div><div><span class="" style="white-space:pre">        </span>perm(0640);</div>
<div><span class="" style="white-space:pre">        </span>dir_perm(0750);</div><div><span class="" style="white-space:pre">        </span>group("syslog");</div><div>};</div><div><br></div><div style>I have already set TCP kernel buffers to 128MB max and set disk scheduler to "deadline".</div>
<div style><br></div><div style>But even under light disk IO load, from ~8-25MB, I see "1320811067 packets collapsed in receive queue due to low socket buffer". I had some other processes on the host writing to disk. Stopping them reduced the packet errors but this number still keeps incrementing.</div>
<div style><br></div><div style>To rule out other issues, I temporarily pointed my disk-based destinations to /dev/null and then packet losses/errors stopped. So either Syslog-NG isn't able to write to disk fast enough or there is an underlying OS/hardware issue.</div>
<div style><br></div><div style>Both hosts have the same issue. Any pointers in troubleshooting it will be appreciated.</div><div style><br></div><div style>TIA.</div><div style><br></div><div style><br></div></div></div>