Add os.readinto API for reading data into a caller provided buffer

# Feature or enhancement

### Proposal:


Code reading data in pure python tends to make a buffer variable, call `os.read()` which returns a separate newly allocated buffer of data, then copy/append that data onto the pre-allocated buffer[0]. That creates unnecessary extra buffer objects, as well as unnecessary copies. Provide `os.readinto` for directly filling a [Buffer Protocol](https://docs.python.org/3/c-api/buffer.html) object.

`os.readinto` should closely mirror [`_Py_read`](https://github.com/python/cpython/blob/298dda57709c45cbcb44831e0d682dc071af5293/Python/fileutils.c#L1857-L1927) which underlies os.read in order to get the same behaviors around retries as well as well-tested cross-platform support.

Move simple cases that use os.read (ex. [0]) to use the new API when it makes code simpler and more efficient. Potentially adding `readinto` to more readable/writeable file-like proxy objects or objects which transform the data (ex. [`Lib/_compression`](https://github.com/python/cpython/blob/298dda57709c45cbcb44831e0d682dc071af5293/Lib/_compression.py#L66-L70)) is out of scope for this issue.

[0]
https://github.com/python/cpython/blob/298dda57709c45cbcb44831e0d682dc071af5293/Lib/subprocess.py#L1914-L1921
https://github.com/python/cpython/blob/298dda57709c45cbcb44831e0d682dc071af5293/Lib/multiprocessing/forkserver.py#L384-L392
https://github.com/python/cpython/blob/298dda57709c45cbcb44831e0d682dc071af5293/Lib/_pyio.py#L1695-L1701



# `os.read` loops to migrate

## Well contained `os.read` loops
- [x] [`multiprocessing.forkserver read_signed`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/multiprocessing/forkserver.py#L384-L392) - @cmaloney - https://github.com/python/cpython/pull/129425
- ~~[x] [`subprocess Popen._execute_child`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/subprocess.py#L1916-L1921) - @cmaloney - https://github.com/python/cpython/pull/129498~~

## `os.read` loop interleaved with other code
- [ ] [`_pyio FileIO.read FileIO.readall FileIO.readinto`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/_pyio.py#L1636-L1701) see, gh-129005 -- @cmaloney
- [ ] [`_pyrepl.unix_console UnixConsole.input_buffer`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/_pyrepl/unix_console.py#L202-L217) -- fixed length underlying buffer with "pos" / window on top.
- [ ] [`pty _copy`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/pty.py#L93-L156). Operates around a "high waterlevel" / attempt to have a fixed-ish size buffer. Wraps `os.read` with a `_read` function.
- [ ] [`subprocess Popen.communicate`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/subprocess.py#L2093-L2153). Note, this feels like something non-contiguous Py_buffer would be really good for, particularly in `self.text_mode` where currently all the bytes are "copied" into a contiguous `bytes` to turn then turn into text...
- [ ] [`tarfile _Stream._read and _Stream.__read`](https://github.com/python/cpython/blob/9abbb58e3f023555473d9e8b82738ef44077cfa8/Lib/tarfile.py#L530-L571). Note, builds _LowLevelFile around `os.read`, but other read methods also available.


### Has this already been discussed elsewhere?

No response given

### Links to previous discussion of this feature:

https://github.com/python/cpython/issues/129005#issuecomment-2608196581



### Linked PRs
* gh-129211
* gh-129316
* gh-129425
* gh-129498
* gh-130098

	# Wait for exec to fail or succeed; possibly raising an
	# exception (limited in size)
	errpipe_data = bytearray()
	while True:
	part = os.read(errpipe_read, 50000)
	errpipe_data += part
	if not part or len(errpipe_data) > 50000:
	break

	def read_signed(fd):
	data = b''
	length = SIGNED_STRUCT.size
	while len(data) < length:
	s = os.read(fd, length - len(data))
	if not s:
	raise EOFError('unexpected EOF')
	data += s
	return SIGNED_STRUCT.unpack(data)[0]

	def readinto(self, b):
	"""Same as RawIOBase.readinto()."""
	m = memoryview(b).cast('B')
	data = self.read(len(m))
	n = len(data)
	m[:n] = data
	return n

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add os.readinto API for reading data into a caller provided buffer #129205

Feature or enhancement

Proposal:

`os.read` loops to migrate

Well contained `os.read` loops

`os.read` loop interleaved with other code

Has this already been discussed elsewhere?

Links to previous discussion of this feature:

Linked PRs

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Uh oh!

Add os.readinto API for reading data into a caller provided buffer #129205

Description

Feature or enhancement

Proposal:

os.read loops to migrate

Well contained os.read loops

os.read loop interleaved with other code

Has this already been discussed elsewhere?

Links to previous discussion of this feature:

Linked PRs

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions

`os.read` loops to migrate

Well contained `os.read` loops

`os.read` loop interleaved with other code