WIth reference to the conn.log spec base/protocols/conn/main.zeek — Book of Zeek (git/master)
orig_bytes: The number of payload bytes the originator sent. For TCP this is taken from sequence numbers and might be inaccurate (e.g., due to large connections)
resp_bytes: The number of payload bytes the responder sent. See orig_bytes
Are these values inaccurate to the point that they should not be relied upon? Here is an example of a common conn.log entry for TCP traffic:
orig_bytes: 2,145,967,013
resp_bytes: 81,091,225
duration: 15.638
orig_ip_bytes: 104
resp_ip_bytes: 340
The orig_bytes and resp_bytes values appear ‘impossible’ due to the amount of bytes, the short duration, and in relation to the respective ip_bytes. Such entries are really messing with visualizations and analysis using orig_bytes and resp_bytes. Are others relying on orig_ip_bytes and resp_ip_bytes instead of orig_bytes and resp_bytes? Should orig_bytes ever be larger than orig_ip_bytes?
We are running Zeek 5.0.3 on a RPM/RedHat-based multi-node cluster with pf_ring, Myricom 10Gb NICs, OpenSearch, and Corelight’s zeek2es.
Any comments on the experiences of others with these records would be appreciated.
Ryan