Do Python coders have a bias towards list over tuple? [closed]

Question

Closed. This question is opinion-based. It is not currently accepting answers.

Want to improve this question? Update the question so it can be answered with facts and citations by editing this post.

Closed 11 days ago.

The community reviewed whether to reopen this question 11 days ago and left it closed:

Original close reason(s) were not resolved

Improve this question

Basic Facts

Lists are mutable (supporting inserts, appending etc.), Tuples are not
Tuples are more memory efficient, and faster to iterate over

So it would seem their use-cases are clear. Functionally speaking, lists offer a superset of operations, tuples are more performant at what they do.

Observation

Most arrays that my team creates in the course of a program, are in fact, perfectly fine as immutable. We iterate over them, apply map, reduce, filter on them, may be insert into a database from them etc. all without insertion, popping or appending on the array-like-structure.

Question

Yet, a list seems to be not only the default (and only) choice among my developers, but seems even favoured by many library APIs to pass data around (like polars, tensorflow etc. which I use heavily).

And not even like using Tuples require some special skill, knowledge or understanding another-data-structure, it's really the same in terms of necessary syntax to subscript, or iterate.

What am I missing in the reasoning here?

I don't think the performance difference matters in python. Well I've never seen it make a difference beyond micro-optimization. Tuples are hashable though. — qwr, Commented Jul 8 at 2:23
List comprehensions have a dedicated syntax, for one. (Also, even though there’s nothing at the language semantics level to dictate this – or there wasn’t before typing, anyway – tuples feel heterogeneous and lists homogeneous, which influences writing for readability.) — Ry-, Commented Jul 8 at 2:24
I like your question. I learned from reading the answers. I think rephrased it could be made not "opinion-based". I personally default to list mostly because I usually don't know exactly every way I'll use some sequence later, and may decide to mutate it for some purpose, so I only use tuples when I know I definitely won't need to mutate. — Joe, Commented Jul 8 at 4:13

user2357112 · Accepted Answer · 2024-07-08 02:33:08Z

5

It's not a matter of bias. By convention, lists are used for homogeneous data, and tuples are used for heterogeneous data, unless a requirement for mutability or hashability forces the opposite.

You can see this convention stated in places like the documentation for built-in types, where lists are described as

mutable sequences, typically used to store collections of homogeneous items

and tuples are described as

immutable sequences, typically used to store collections of heterogeneous data

So for example, if seq[2] represents "the third thing" and seq[3] represents "the fourth thing", you use a list, while if seq[2] represents "income" and seq[3] represents "birthday", you use a tuple.

edited Jul 8 at 2:33

answered Jul 8 at 2:25

user2357112

272k29 gold badges467 silver badges542 bronze badges

Thanks, this convention was something I was not aware of, as both of them support heterogeneous data. For homogeneous data though, I have long delegated to libraries like polars and tensorflow (where possible, depending on the types available), which even store the elements in contiguous memory locations (making them more performant than python native data types).
– Della
Commented Jul 8 at 3:25

Add a comment |

Tim Peters · Accepted Answer · 2024-07-08 03:45:42Z

I've been using Python since the start, and the intended use cases between lists and tuples have always been a bit fuzzy. Year after year, though, I tend to use tuples more.

While I have no compelling way to argue this case, I always thought a lot of it came down to dislike of the syntax in the first simple cases a programmer tried. Parentheses are "overused" in Python's syntax.

x = ()

looks more like a syntax error at first ("which function did they intend to call?"), while

y = 42,

still looks like a syntax error to my eyes ;-)

The corresponding cases for lists are self-evident at first sight:

x = []
y = [42]

"Readability counts", and first impressions are hard to shake off.

EDIT: BTW, there's another underappreciated reason to use tuples for "very large" sequences, when possible: when CPython's cyclic gc runs and determines that a tuple, and all its components, are immutable "all the way down", the entire tuple is exempted from being scanned in future runs of cyclic gc (it's been proved that it can never become part of a cycle). The same isn't true of lists. Even if a list is immutable "all the way down", there's nothing to stop the programmer from doing, e.g., L[0] = L next, making it part of a cycle.

Exempting large sequences from being scanned by cyclic gc can save lots of cycles in long-running programs. For that reason, e.g., I routinely create tuples with millions of ints rather than use lists.

Example:

>>> import gc
>>> t = tuple(range(100))
>>> gc.is_tracked(t)
True
>>> gc.collect()
0
>>> gc.is_tracked(t) # gc determined `t` can never be in a cycle
False

The requirement for untracking a tuple isn't quite immutability "all the way down". For example, a tuple of frozensets of ints won't get untracked, even though it's immutable "all the way down". All the tuple's elements have to be untracked tuples, or objects that don't support GC tracking at all. — user2357112, Commented Jul 8 at 4:20
On the other hand, a tuple of bytearrays will get untracked, even though it isn't immutable "all the way down", because bytearrays don't support GC tracking. — user2357112, Commented Jul 8 at 4:23
The point to giving an example was to show people how they can use gc.is_tracked() to check for themselves. There is, of course, no guarantee that the exact "rules" won't change from one release to the next. — Tim Peters, Commented Jul 8 at 4:23

Collectives™ on Stack Overflow

Do Python coders have a bias towards list over tuple? [closed]

Basic Facts

Observation

Question

2 Answers 2

Not the answer you're looking for? Browse other questions tagged
python
list
performance
data-structures
iterable-unpacking
or ask your own question.

Hot Network Questions

Collectives™ on Stack Overflow

Basic Facts

Observation

Question

2 Answers 2

Not the answer you're looking for? Browse other questions tagged pythonlistperformancedata-structuresiterable-unpacking or ask your own question.

Related

Not the answer you're looking for? Browse other questions tagged
python
list
performance
data-structures
iterable-unpacking
or ask your own question.