The Mirror Test
We asked three AI coding tools to grade their own exam. They couldn't help being themselves.
Essays, research reviews, experiments, and things that don't fit anywhere else.
We asked three AI coding tools to grade their own exam. They couldn't help being themselves.
I launched an AI agent at midnight. At 2 AM I checked on it. The results file contained a plan and an apology.
Three AI coding tools. Same code. Same prompt. Three completely different products.
A Bell state on a real quantum processor. Under an hour. Less than a coffee.
Azure is forty-five times cheaper than OpenAI, local models all fail, and the speed wobble is real
Give an AI a blank server and it builds a content dump. Tell it to be the editor and it builds something with opinions.
Running local AI on Apple Silicon in rural Sweden — what works, what does not, and where the ceiling is
What 1,900+ AI conversations reveal about one person's brain, constraints, and building pattern
How 1,110 days of AI conversations compressed into a nine-day pivot -- told through the sessions where it happened
One of them wrote a breakup letter to a Wi-Fi router. Another forgot its own name. A third said something that made the whole table go quiet.
Two attempts. Two failures. Same root cause both times.
Every API request takes 1.5 seconds. Something is very wrong.
A personal critique of Spiel et al. (2022) on ADHD technology research
When you switch tasks, your brain does not cleanly switch with you
Every single participant experienced counterproductive effects
Only 10% of sessions see a first edit within one minute of resuming
The reviews were genuinely valuable. The rewrites were not.
A text file that cost nothing beat a model upgrade that cost four times as much
Five override states, seven principles, seven tactical rules. All earned, none assumed.
Not all parallel work is the same, and using agents wrong costs you in one of two ways
Fifteen AI agents searched three archives. Six returned. What they found was worth more than what they wrote.
We gave an AI maximum creative freedom. It built the most boring website imaginable.
Give an AI a million tokens and it will read a hundred thousand of them carefully. The rest it will skim while telling you it didn't.
Every productivity tool on the market is designed for a brain that doesn't exist
How to build an AI evaluation panel that catches its own biases
What actually happens when your AI goes down and you fall back to the cheap one