Discussion about this post

User's avatar
Synthetic Civilization's avatar

What this really highlights is that productivity doesn’t scale at the task level, it scales at the institutional level.

We’re seeing genuine task acceleration, but jobs are embedded in workflows, permissions, review cycles, liability, and coordination structures that haven’t moved. Until those layers change, micro gains get absorbed as slack, quality variance, or reallocation rather than output.

The Solow paradox wasn’t about computers being weak. It was about institutions being slow.

Sarah Bana's avatar

Thanks, Alex. I really enjoyed reading this. With another group of coauthors, we tried to bridge this macro-micro divide by looking at open source software but when a country banned it (https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5332003).

Maybe this is worthy of a longer conversation, but I wonder if simple measures of adoption are not very informative. The best way I've heard it said is by Arvind Narayanan and Sayesh Kapoor:

Imagine an alternate universe in which people don’t have words for different forms of transportation, only the collective noun “vehicle.” They use that word to refer to cars, buses, bikes, spacecraft, and all other ways of getting from place A to place B. Conversations in this world are confusing. There are furious debates about whether or not vehicles are “environmentally friendly,” but (even though no one realizes it) one side of the debate is talking about bikes and the other side about trucks. There is a breakthrough in rocketry, but when the media focuses on how vehicles have gotten faster, people call their car dealer (oops, vehicle dealer) to ask when faster models will be available. Meanwhile, fraudsters have capitalized on the fact that consumers don’t know what to believe when it comes to vehicle technology, so scams are rampant in the vehicles sector.

Now replace the word “vehicles” with “artificial intelligence,” and we have a pretty good description of the world we live in.

If this is the case, then are we measuring adoption in a way that is helpful? Even in Shen and Tamkin (2025), the way the tool is used is so different across participants, is it even the same tool? I am asking these questions because I don't know the answers. And I am simultaneously trying to make progress in my other work on algorithmic hiring, where I think the challenges are similar. A resume screening tool is different from a voice interviewer is different from having candidates play games that are trained on performance of your current employees. But we are calling these 'algorithmic hiring' and trying to draw broad conclusions over inherently different technologies.

22 more comments...

No posts

Ready for more?