-
-
Notifications
You must be signed in to change notification settings - Fork 34.5k
bpo-33234 Improve list() pre-sizing for inputs with known lengths (no __length_hint__) #9846
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
7602879
39a8757
ab450b3
717ca67
b678442
b40bd62
3837486
b6ff7ce
e7f8c46
0a6c8ba
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
The list() constructor isn't taking full advantage of known input lengths or length hints. This commit makes the constructor pre-size and not over-allocate when the input size is known or can be reasonably estimated. A new test is added to test_list.py and a test needed to be modified in test_descr.py as it assumed that there is only one call to __length_hint__() in the list constructor.
- Loading branch information
There are no files selected for viewing
| Original file line number | Diff line number | Diff line change |
|---|---|---|
| @@ -0,0 +1,2 @@ | ||
| The list constructor will pre-size and not over-allocate when the input size | ||
| is known or can be reasonably estimated. | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. "or can be reasonably estimated" I don't understand this part. |
||
| Original file line number | Diff line number | Diff line change |
|---|---|---|
|
|
@@ -76,6 +76,32 @@ list_resize(PyListObject *self, Py_ssize_t newsize) | |
| return 0; | ||
| } | ||
|
|
||
| static int | ||
| list_preallocate_exact(PyListObject *self, Py_ssize_t size) | ||
| { | ||
| PyObject **items; | ||
| size_t allocated, num_allocated_bytes; | ||
|
|
||
| allocated = (size_t)size; | ||
| if (allocated > (size_t)PY_SSIZE_T_MAX / sizeof(PyObject *)) { | ||
| PyErr_NoMemory(); | ||
| return -1; | ||
| } | ||
|
|
||
| if (size == 0) { | ||
| allocated = 0; | ||
| } | ||
| num_allocated_bytes = allocated * sizeof(PyObject *); | ||
| items = (PyObject **)PyMem_Malloc(num_allocated_bytes); | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Would not be easier to use |
||
| if (items == NULL) { | ||
| PyErr_NoMemory(); | ||
| return -1; | ||
| } | ||
| self->ob_item = items; | ||
|
vstinner marked this conversation as resolved.
|
||
| self->allocated = allocated; | ||
| return 0; | ||
| } | ||
|
|
||
| /* Debug statistic to compare allocations with reuse through the free list */ | ||
| #undef SHOW_ALLOC_COUNT | ||
| #ifdef SHOW_ALLOC_COUNT | ||
|
|
@@ -2649,6 +2675,13 @@ list___init___impl(PyListObject *self, PyObject *iterable) | |
| (void)_list_clear(self); | ||
| } | ||
| if (iterable != NULL) { | ||
| Py_ssize_t iter_len = PyObject_LengthHint(iterable, 0); | ||
| if (iter_len == -1){ | ||
|
Member
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. nitpick: ...) { (add a space) |
||
| PyErr_Clear(); | ||
| } | ||
| if (iter_len > 0 && list_preallocate_exact(self, iter_len)) { | ||
| return -1; | ||
| } | ||
| PyObject *rv = list_extend(self, iterable); | ||
| if (rv == NULL) | ||
| return -1; | ||
|
|
||
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
length, not size.