Flush pending writes before reading from buffer by paddor · Pull Request #12 · socketry/io-stream

paddor · 2026-03-25T18:10:52Z

Problem

When read-ahead pulls more data than a single read/read_exactly call needs, the excess stays in @read_buffer. A subsequent read(size) finds @read_buffer.bytesize >= size and returns directly from the buffer without entering the fill_read_buffer loop — which means flush is never called.

Any data sitting in the write buffer is silently held back.

Deadlock scenario

In a bidirectional protocol (e.g. ZMTP handshake) where two fibers exchange data over a socket pair under Async:

Fiber A writes "A1" (buffered), calls read_exactly(2) → fill_read_buffer → flushes "A1", sysread yields (:wait_readable)
Fiber B writes "B1" (buffered), calls read_exactly(2) → fill_read_buffer → flushes "B1", sysread reads "A1" → returns
Fiber B writes "B2" (buffered), calls read_exactly(2) → fill_read_buffer → flushes "B2", sysread yields
Fiber A resumes, sysread returns "B1B2" (4 bytes — read-ahead pulled both)
read_exactly(2) consumes "B1" (2 bytes), "B2" remains in buffer
Fiber A writes "A2" (buffered), calls read_exactly(2)
@read_buffer.bytesize >= size → loop skipped → flush never called → "A2" never sent
Fiber B blocks forever waiting for "A2"

Reproduction

require "async"
require "io/stream"
require "socket"

Async do
  a, b = UNIXSocket.pair.map { |s| IO::Stream::Buffered.wrap(s) }

  ta = Async do
    a.write("A1")        # buffered
    a.read_exactly(2)    # flush-on-read flushes A1, reads B1
    a.write("A2")        # buffered
    a.read_exactly(2)    # BUG: buffer has B2 from read-ahead, flush skipped, A2 never sent
  end

  tb = Async do
    b.write("B1")        # buffered
    b.read_exactly(2)    # flush-on-read flushes B1, reads A1
    b.write("B2")        # buffered
    b.read_exactly(2)    # flush-on-read flushes B2, blocks forever waiting for A2
  end

  ta.wait; tb.wait
ensure
  a&.close rescue nil
  b&.close rescue nil
end

Fix

Call flush at the top of #read before checking the buffer, so pending writes are always sent before any read — even when the read buffer already has enough data from a previous read-ahead.

Test plan

All 330 existing tests pass
The reproduction above deadlocks on main, completes with the fix

Replace hand-rolled SocketIO wrapper and ZMTP.read_exact with IO::Stream::Buffered from the io-stream gem. This brings read-ahead buffering (fewer syscalls for frame parsing), automatic TCP_NODELAY, and read_exactly for exact-byte reads. Deleted: - TCP::SocketIO class - ZMTP.read_exact / Mechanism module - String#rsplit_host_port monkey-patch - TCP keepalive options (tcp_keepalive_*) TCP throughput improved 20-28% from read-ahead buffering. Uses minimum_write_size: 0 to work around socketry/io-stream#12.

When read-ahead pulls more data than requested, subsequent reads can complete from the buffer without calling fill_read_buffer. This skips the flush call inside fill_read_buffer, leaving pending writes unsent. In bidirectional protocols (e.g. ZMTP handshake), this causes deadlock: fiber A's write sits in the buffer while fiber B blocks waiting for it. Move the flush to the top of #read so it always runs, regardless of whether the read buffer already has enough data.

Copilot

Pull request overview

This PR fixes a deadlock in bidirectional/Async usage by ensuring buffered writes are flushed even when a read(size) can be satisfied entirely from the existing read-ahead buffer.

Changes:

Update IO::Stream::Readable#read to flush pending writes even when the read buffer already contains enough data to satisfy the request.
Add an Async regression test using Socket.pair to cover the “read-ahead leaves extra bytes in buffer” scenario.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 4 comments.

File	Description
`lib/io/stream/readable.rb`	Flushes pending writes on buffer-hit paths in `#read(size)` to prevent write starvation/deadlocks.
`test/io/stream/buffered.rb`	Adds a regression test using `Sus::Fixtures::Async::ReactorContext` to reproduce/guard the deadlock scenario.

Copilot · 2026-03-27T03:25:07Z

lib/io/stream/readable.rb

-
-					# Don't read less than @minimum_read_size to avoid lots of small reads:
-					fill_read_buffer(read_size > @minimum_read_size ? read_size : @minimum_read_size)
+				if @finished or @read_buffer.bytesize >= size


read(size) now calls flush when @finished is true. That means a buffered read after EOF can raise (e.g. EPIPE/IOError from flushing pending writes) and prevent returning data that’s already in @read_buffer, which is a behavior change from before. If the intent is only to fix the “buffer satisfied size so fill_read_buffer is skipped” case, consider only flushing when @read_buffer.bytesize >= size (or otherwise ensure EOF-buffered reads can still succeed without being interrupted by write-side errors).

Suggested change

if @finished or @read_buffer.bytesize >= size

if @read_buffer.bytesize >= size

Copilot · 2026-03-27T03:25:08Z

lib/io/stream/readable.rb

+					while true
+						# Compute the amount of data we need to read from the underlying stream:
+						read_size = size - @read_buffer.bytesize
+
+						# Don't read less than @minimum_read_size to avoid lots of small reads:
+						fill_read_buffer(read_size > @minimum_read_size ? read_size : @minimum_read_size)
+
+						break if @finished or @read_buffer.bytesize >= size
+					end


This addresses the flush-skipped path for #read, but other APIs (e.g. read_partial, peek, gets/read_until) can also return purely from @read_buffer without calling fill_read_buffer (which is where flushing currently happens). That can leave the same “pending writes never flushed” hazard for those methods. Consider centralizing “flush before serving data from the read buffer” in a shared helper (or applying the same flush-on-buffer-hit behavior to the other read entry points) to keep semantics consistent.

Copilot · 2026-03-27T03:25:08Z

test/io/stream/buffered.rb

+		task_a.wait
+		task_b.wait


As written, a regression will manifest as a deadlock and this example will hang indefinitely (waiting on task_a/task_b) rather than failing fast. Please wrap the waits in a bounded timeout (using the Async/Sus reactor context utilities) so CI gets a deterministic failure instead of a stuck test run.

Suggested change

task_a.wait

task_b.wait

reactor.with_timeout(1) do

task_a.wait

task_b.wait

end

Copilot · 2026-03-27T03:25:08Z

test/io/stream/buffered.rb

+		# flush. This test verifies that pending writes are still flushed.
+		task_a = reactor.async do
+			client.write("A1")
+			data = client.read_exactly(2)


data = client.read_exactly(2) assigns to data but the value is never used. Consider removing the assignment (or asserting on the returned value) to keep the test focused and avoid unused locals.

Suggested change

data = client.read_exactly(2)

client.read_exactly(2)

ioquatix · 2026-03-27T03:53:50Z

I merged this, but I'm having 2nd thoughts about it.

Implicit flushing adds overhead and it also messes with interleaved reads and writes across fibers. Also, as copilot pointed out, it would have to go in multiple methods.

  ta = Async do
    a.write("A1")        # buffered
    a.flush              # message finished
    
    a.read_exactly(2)    # flush-on-read flushes A1, reads B1
    a.write("A2")        # buffered
    a.flush              # message finished
    
    a.read_exactly(2)
  end

Honestly, it's probably better to terminate messages with flush explicitly.

ioquatix · 2026-03-27T03:56:27Z

For example, it's entirely possible to have this:

stream = ...

Async do
  stream.read ...
end

Async do
  stream.write ...
end

It seems like a better direction would be to remove the implicit flush to reduce contention.

paddor · 2026-03-27T15:45:15Z

This was interesting to read. But I wanna apologize for wasting your time, probably. Me and Claude were a little too PR-happy. Turns out all we had to do was flush after queueing a message send, duh. No workaround needed. Huge perf wins. Sorry again.

ioquatix · 2026-03-28T01:27:49Z

You don't need to apologise, I think what we should do is improve the documentation to cover the recommended patterns if not already.

ioquatix · 2026-03-29T22:26:23Z

I wrote some documentation: https://socketry.github.io/io-stream/guides/high-performance-io/index

paddor · 2026-03-29T23:07:22Z

Thanks a lot. Great writeup! In my omq gem I've actually implemented proper batch flushing in the meantime. The send pump waits for data (messages) in the queue, greedy drain (up to 64), flush, so latency is still good. 3-4x better throughput. It's nuts! And not janky!

paddor force-pushed the fix-flush-before-buffered-read branch from 221a57b to db27c24 Compare March 25, 2026 18:34

paddor force-pushed the fix-flush-before-buffered-read branch from db27c24 to 308b84c Compare March 26, 2026 00:02

ioquatix force-pushed the fix-flush-before-buffered-read branch from 308b84c to d17a0c4 Compare March 27, 2026 02:55

Only flush if required.

5f5b45d

ioquatix requested a review from Copilot March 27, 2026 03:17

Copilot started reviewing on behalf of ioquatix March 27, 2026 03:17 View session

Update release notes.

ace3c05

Copilot AI reviewed Mar 27, 2026

View reviewed changes

ioquatix merged commit b01c4b9 into socketry:main Mar 27, 2026
21 of 24 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Flush pending writes before reading from buffer#12

Flush pending writes before reading from buffer#12
ioquatix merged 3 commits intosocketry:mainfrom
paddor:fix-flush-before-buffered-read

paddor commented Mar 25, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Copilot AI Mar 27, 2026

Uh oh!

Uh oh!

ioquatix commented Mar 27, 2026 •

edited

Loading

Uh oh!

ioquatix commented Mar 27, 2026

Uh oh!

paddor commented Mar 27, 2026

Uh oh!

ioquatix commented Mar 28, 2026

Uh oh!

ioquatix commented Mar 29, 2026

Uh oh!

paddor commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	if @finished or @read_buffer.bytesize >= size
	if @read_buffer.bytesize >= size

Uh oh!

Conversation

paddor commented Mar 25, 2026

Problem

Deadlock scenario

Reproduction

Fix

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 27, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

ioquatix commented Mar 27, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ioquatix commented Mar 27, 2026

Uh oh!

paddor commented Mar 27, 2026

Uh oh!

ioquatix commented Mar 28, 2026

Uh oh!

ioquatix commented Mar 29, 2026

Uh oh!

paddor commented Mar 29, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

ioquatix commented Mar 27, 2026 •

edited

Loading