Troubleshooting dropped TCP connections
We're trying to figure out why connections to this server are dropping. Looking at the capture it seems like both sides go silent at one point then 2 hours later the client tries to keep the connection alive and the server resets it. Looking at the capture, it would seem that either the server should realize it didn't get an acknowledgement for missing segments and resend them, and the client should continue sending duplicate acks until the server resends it. The capture I have was done on the client so maybe the server is retrying but they're MIA. Though there's other traffic on the connection that indicates data is still passed. When traffic from the server stops, is it possible it hit the limit of the congestion window? Any insight to the problem is appreciated.
Comments
We much prefer to analyze packets instead of pixels. Are you able to share the capture file on a public file share? Have a look at Tracewrangler if you need to anonymize the file.
If possible, a trace on both the client and server side would help the analysis.
Interesting little tool. Here's the anonymized packets in that stream. http://streaming2.thedavidcorrigan.co...
We're working on getting a capture from both sides. Right now we have the server owners, and the network team trying to figure this out and I've been bouncing ideas off both of them while we all try and figure it out. It's infrequent enough to be difficult to pin down but frequent enough to annoy a lot of people running big jobs.