Empty read from gitdb.OStream.read() before EOF

I have code that relies on reading an object from a gitdb stream.

To do this I used with a standard `.read()` loop (like with [io.RawIOBase](https://docs.python.org/3/library/io.html#io.RawIOBase)):

```python
stream = db.stream(bytes.fromhex(sha))
while chunk := stream.read(4096):
    yield chunk
```

The behaviour I expected to see (from the duck-type with RawIOBase) is to only see `b''` at EOF:

> If 0 bytes are returned, and size was not 0, this indicates end of file. 

However `stream.read(4096)` can return empty chunks even before the end of the stream, so the loop exits early.

For the file where I saw this first, it is sensitive to the `size` parameter - it apparently occurs for `0 < size <= 4096`.

Looking at the code there is a condition to repeat a read if we got insufficient bytes:

https://github.com/gitpython-developers/gitdb/blob/f36c0cc42ea2f529291e441073f74e920988d4d2/gitdb/stream.py#L316-L317

However the leading `if dcompdat and ` means that the condition doesn't apply if zero bytes were read. Removing this part of the condition addresses the issue (but I understand from the comment that this is in order to support `compressed_bytes_read()`).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Empty read from gitdb.OStream.read() before EOF #120

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

	if dcompdat and (len(dcompdat) - len(dat)) < size and self._br < self._s:
	dcompdat += self.read(size - len(dcompdat))

Empty read from gitdb.OStream.read() before EOF #120

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions