Description
In git.util.IterableObj
, the list_items
method recommends that iter_items
be used instead, and begins to explain why, but it appears the actual explanation was never written:
Lines 1255 to 1256 in 4023f28
The deprecated git.util.Iterable
class has the same note in its list_items
docstring:
Lines 1218 to 1219 in 4023f28
In both cases, the subsequent non-blank line in the docstring documents what the list_items
method itself returns, so the continuation truly is missing. The intended continuation does not seem to me to be something that can be inferred from the docstring as a whole. For example, here's the whole IterableObj.list_items
docstring:
Lines 1250 to 1258 in 4023f28
It may seem odd that, unlike #1712, I did not notice this when working on #1725. But I think the reason is not that the docstrings had made sense to me at that time, but instead that I had noticed the more minor (really, almost trivial) issue that perhaps the first paragraph should be split so that only its first sentence would be its summary line, decided to return to it later to consider that further, and then forgot about it.
The text "Favor the iter_items method as it will" appears to have been present, and its continuation absent, for as long as the surrounding code has been in GitPython. It was introduced in f4fa1cb along with the Iterable
class itself.
In general, it may sometimes be preferable to obtain an iterator rather than a list or other sequence because it may not be necessary to materialize a collection just to iterate over it, and because unnecessary materialization can sometimes increase space usage. On the other hand, materialization guards against mutation of the original collection during iteration. But these are completely general ideas, not informed by the list_items
docstrings nor even by any consideration specific to GitPython.
My guess is that the docstring intended to say something more specific, or at least to identify which general benefit of iter_items
serves to recommend it. So I don't think I could propose a specific improvement to that documentation without insight into what was originally intended.
Activity
Byron commentedon Dec 21, 2023
Thanks for bringing this up!
Solely on the basis of 'knowing myself a little' I'd think that the continuation would be a general listing of advantages of iterators. If
iter_items
is indeed implemented as iterator, it should be worth following up on finishing the note as the latency of collecting all items could be high, depending on the item at hand. Nowadays I'd argue thatlist_items()
shouldn't exist in the first place, but it's too late for that now.Improve self-documentation of IterableObj and related classes
Improve self-documentation of IterableObj and related classes
EliahKagan commentedon Dec 22, 2023
I would say it is implemented as an iterator. The more detailed picture is:
git.util.Iterable
anymore, it's code outside GitPython.iter_items
is abstract ingit.util.IterableObj
, which introduces it.iter_items
explicitly raisesNotImplementedError
inPushInfo
andFetchInfo
. Those concrete classes declare themselves to implementIterableObj
yet, in effect, decline to implement this abstract method. However, because those classes don't implementlist_items
, whose base-class implementation usesiter_items
, this is not a reason to avoid recommendingiter_items
overlist_items
in the base class docstrings.IterableObj
have concrete implementations ofiter_items
that return iterators: either their own, or those introduced by an intermediate base class. I don't think I missed anything while looking into this.iter_items
method in the codebase belongs toSymbolicReference
, and it returns an iterator.SymbolicReference
does not inherit fromIterableObj
, nor from any other classes, except the implicit inheritance fromobject
(not to be confused with GitPython'sObject
). I wonder if it would be beneficial to explain its relationship further in its class docstring; the description "Special case of a reference that is symbolic" might lead readers to think it is a subclass of some class representing a more general case, or that it is intended to be a subclass ofReference
(which does implementIterableObj
). But if I understand correctly, this is independent of any considerations in howIterableObj
and its methods should be documented.I've opened #1780 to propose some
IterableObj
anditer_items
related improvements, including filling in the missing information about whyiter_items
should be preferred tolist_items
along the lines of what you've said here.