How do I practice energy cost of context switching?

Track your context switches for one full workday. Keep a running log — a notebook beside your keyboard or a simple text file — and every time you shift from one task, application, or cognitive mode to another, note three things: the time, what you switched from, and what you switched to. Do not.

Why does energy cost of context switching fail?

Treating all context switching as equally harmful and attempting to eliminate it entirely. Not every switch costs the same. Moving from writing a report to checking a quick factual reference within that report is a micro-switch with near-zero residue — the cognitive frame stays intact. Moving from.

How to ThinkIn the Age of AI

The energy cost of context switching

~16 min read·sovereignty·

sovereignty energy context-switching attention-residue batching deep-work focus

Core Primitive

Every context switch depletes energy — batch similar tasks to conserve it.

The invisible tax on your day

You know about time management. You schedule your tasks. You protect your peak window (Peak energy for peak work). You maintain a priority stack (The priority stack) and work from the top. And yet, at the end of the day, you often feel more depleted than your activity log can explain. You did four hours of "deep work," but the output feels like two. You had energy this morning — your rhythm map (Energy follows ultradian rhythms) confirmed it — and yet by noon the tank was empty.

The missing variable is not time. It is transitions.

Every time you shift from one cognitive task to another — from writing a report to answering an email, from strategic planning to a team check-in, from code to a Slack thread and back to code — your brain does not perform a clean swap. It performs a messy, expensive, neurologically demanding operation that consumes working memory, depletes executive function, and leaves behind traces of the previous task that contaminate the new one. This is not a metaphor. It is a measurable cognitive phenomenon with a research literature spanning three decades, and it explains a significant portion of the energy drain that most knowledge workers attribute to "a long day" when the real culprit is a fragmented one.

The previous lessons in this phase examined where energy comes from and where it goes across different dimensions — physical (Recovery is not laziness through Nutrition affects cognitive energy directly), social (Social energy management), and the broader rhythm patterns (Energy follows ultradian rhythms) that determine when you have the most. This lesson examines one of the most potent and least visible ways that energy is wasted: the switching tax levied every time your cognitive system tears down one task context and builds another.

What actually happens in your brain when you switch

The intuitive model of task-switching is a toggle: you are on Task A, you flip a switch, now you are on Task B. Fast, clean, costless. The neuroscience reveals something far more expensive.

When you engage with a complex task, your prefrontal cortex constructs what cognitive scientists call a "task set" — a configuration of goals, rules, stimuli-response mappings, and active memory contents specific to that task. Writing a strategy document requires one task set: the vocabulary of the domain, the logical structure of the argument, the intended audience, the current position within the document. Responding to an email about a budget issue requires an entirely different task set: different facts, different audience, different emotional register, different decision criteria.

When you switch tasks, two things must happen. First, your brain must inhibit the current task set — actively suppress the goals, rules, and representations that were serving Task A. Second, it must activate the new task set — load the goals, rules, and representations required for Task B. Joshua Rubinstein, David Meyer, and Jeffrey Evans documented this dual process in their 2001 paper in the Journal of Experimental Psychology: Human Perception and Performance. They called the two stages "goal shifting" (deciding to switch) and "rule activation" (reconfiguring the cognitive system for the new task). Both stages take time and consume cognitive resources. Neither is instantaneous.

Their experiments showed that switching costs increase with the complexity of the tasks being switched between. Simple tasks — sorting cards by color versus shape — incur switching costs measurable in hundreds of milliseconds. Complex tasks — the kind that constitute knowledge work — incur costs measurable in minutes. And critically, the costs are not just time costs. They are resource costs. Each switch consumes a portion of the executive function that your prefrontal cortex uses for planning, decision-making, and sustained attention. Executive function is a depletable resource, as Roy Baumeister's ego depletion research demonstrated (though the effect sizes have been debated, the directional finding — that effortful cognitive control draws on a limited pool — is consistently supported by meta-analyses). Every switch draws from that pool. The more switches, the less remains for the actual work.

Attention residue: the ghost of the task you left

The switching cost would be bad enough if the transition were clean — if, once you finished the switch, you were fully present in the new task. But it is not clean. This is the contribution of Sophie Leroy, whose research at the University of Washington Bothell has fundamentally changed how we understand the relationship between task-switching and cognitive performance.

Leroy introduced the concept of "attention residue" in a 2009 paper in Organizational Behavior and Human Decision Processes. Her core finding: when people transition from Task A to Task B before Task A is complete, their cognitive performance on Task B is significantly degraded — not because they lack the ability to do Task B, but because part of their attention remains stuck on the unfinished Task A. The unresolved questions, the partial solutions, the unfinished threads of Task A persist as active cognitive processes that compete with Task B for working memory resources.

The effect is strongest when Task A is left incomplete. If you finish writing a section before switching to email, the residue is minimal — the task set has been cleanly closed. But if you switch mid-paragraph, mid-thought, mid-problem, the open loop remains active. Your brain, following the Zeigarnik effect (Bluma Zeigarnik's 1927 finding that uncompleted tasks are remembered better than completed ones), keeps the unfinished task in an active, accessible state — precisely because it is unfinished. That accessibility is not free. It consumes the same working memory resources you need for the new task.

Leroy's experiments controlled for motivation and ability. Participants were told to focus entirely on the new task. They wanted to focus. They tried to focus. It did not matter. The residue persisted regardless of intention. Knowing about attention residue does not prevent it, just as knowing about optical illusions does not make them disappear. The effect is structural, not motivational. Your prefrontal cortex cannot simply decide to release the unfinished task. The release mechanism is completion, not willpower.

This is why the common advice to "just focus" on the task at hand is neurologically naive. Focus is not a willpower problem. It is a state that depends on having a clean cognitive workspace — one that is not contaminated by the half-processed remnants of the previous three things you glanced at.

The twenty-three-minute number

Gloria Mark, a professor of informatics at UC Irvine, has spent over two decades studying attention and distraction in the workplace. Her research, synthesized in her 2023 book Attention Span, includes extensive observational studies of knowledge workers in their natural work environments — not laboratory settings but actual offices, with actual email, actual colleagues, and actual interruptions.

Mark's most widely cited finding is that, after an interruption, it takes an average of twenty-three minutes and fifteen seconds to return to the original task. This number is frequently misunderstood. It does not mean that you stare blankly at your screen for twenty-three minutes after checking a Slack message. It means that after an interruption, people typically do not return directly to the interrupted task. They visit, on average, two other tasks before coming back. The twenty-three minutes includes the time spent on those intermediate tasks plus the reorientation time once they finally return.

The energy implication is critical: those twenty-three minutes are not simply "lost time." Each of the intermediate tasks incurs its own switching cost. A single interruption does not create one switch — it creates a cascade of switches. You leave the strategy document to check Slack. While in Slack, you see a message about tomorrow's meeting. You open your calendar to check the time. While in the calendar, you notice a conflicting appointment. You email a colleague about the conflict. You return to Slack. You remember the strategy document. You return to it. That is five context switches triggered by one interruption. Each one extracted a tax from your executive function. Each one left a trace of attention residue.

Mark's research also revealed that people compensate for interruptions by working faster after returning to the interrupted task — but at a cost. The compensatory acceleration is associated with higher stress, higher frustration, higher mental workload, and lower quality of work. You get back to roughly the same output level, but the energy expenditure is dramatically higher. It is the cognitive equivalent of sprinting to make up for a delay rather than running at a sustainable pace. You arrive at the same destination, but you arrive exhausted.

The distinction between switching and multitasking

Context switching and multitasking are often conflated. They are different phenomena with different costs.

Multitasking — performing two demanding tasks simultaneously — is, for all practical purposes, impossible. The cognitive science is unambiguous on this point. What people call "multitasking" is rapid task-switching: alternating between tasks so quickly that it feels simultaneous but is actually serial. David Meyer summarized it in an interview for the American Psychological Association: "There is no such thing as true multitasking for complex cognitive tasks. There is only rapid switching, and rapid switching is expensive."

Cal Newport, in Deep Work (2016) and A World Without Email (2021), extended this laboratory finding into an argument about knowledge work culture. Modern knowledge work environments, Newport argues, are designed for maximal context switching. Email, instant messaging, open-plan offices, and the expectation of constant responsiveness create conditions where the average knowledge worker switches tasks every three to five minutes — not because the work requires it, but because the communication infrastructure demands it. The result is that deep, focused work becomes structurally impossible during normal working hours, pushed to early mornings, late evenings, or weekends — precisely the hours that should be reserved for recovery (Recovery is not laziness).

The energy cost compounds. Each switch is not merely additive — it is slightly super-additive, because each switch begins from a more depleted executive function baseline than the last. Your first switch of the day is relatively cheap. Your fortieth is substantially more expensive per unit of cognitive reconfiguration, because the cognitive system executing the switch has been degraded by the previous thirty-nine. This is why the afternoon feels so much harder than the morning, even when the tasks are identical. It is not just circadian decline. It is accumulated switching damage.

The batching solution

If context switching is the disease, batching is the treatment. The principle is straightforward: instead of interleaving different types of cognitive work throughout the day, group similar tasks together into unbroken blocks. Write for two hours. Then communicate for an hour. Then administrate for an hour. Within each block, the cognitive task set stays loaded. You eliminate the tear-down-and-rebuild cycle that switching demands.

The concept is not new. Peter Drucker, in The Effective Executive (1967), argued that effective knowledge workers consolidate discretionary time into large blocks and protect those blocks from interruption. Newport formalized this as "time blocking" — assigning every hour of the workday to a specific category of work and respecting those assignments as commitments.

But the energy framing adds something that the time management framing misses. Batching is not just more time-efficient. It is more energy-efficient. Two hours of unbroken writing does not merely produce more words than four thirty-minute writing sessions separated by email checks. It produces those words at a lower total energy cost, because the cognitive system loads the writing task set once and runs it continuously, rather than loading it four times with all the associated costs of inhibition, activation, and residue management.

The research supports this. Mark's studies show that uninterrupted work blocks produce both higher output quality and lower subjective stress compared to the same total duration of work fragmented by switches. Csikszentmihalyi's flow research reinforces the point: flow states — periods of effortless, high-performance focus — require sustained engagement with a single task. Flow cannot develop in a context-switching environment, because the minimum time to enter flow is approximately fifteen to twenty minutes of unbroken focus, and most workers do not sustain single-task attention for that long before the next interruption arrives.

Three levels of batching

Effective batching operates at three time scales, and each one reduces a different category of switching cost.

Within the day: block scheduling. This is the most commonly discussed level. You divide your workday into blocks, each dedicated to a single type of cognitive work. The structure might look like this: 8:30 to 11:00 for deep work (your peak window, loaded with your stack's top item). 11:00 to 12:00 for communication (email, Slack, responses). 12:00 to 1:00 for lunch and recovery. 1:00 to 2:30 for collaborative work (meetings, pair work). 2:30 to 3:00 for communication batch two. 3:00 to 4:30 for administrative and planning work. Within each block, you stay in one cognitive mode. Between blocks, you accept the switch as a planned transition with a deliberate break.

The key distinction from generic time blocking is the energy rationale. You are not just scheduling tasks for organizational clarity. You are sequencing cognitive modes to minimize the total number of task-set reconfigurations your prefrontal cortex must perform. The difference between ten planned switches and forty unplanned ones is not merely organizational. It is neurological.

Within the week: theme days. Paul Graham's 2009 essay "Maker's Schedule, Manager's Schedule" identified the fundamental tension: makers (writers, programmers, designers) need large, unbroken blocks; managers need many short interactions. Most knowledge workers are both. The solution is temporal separation — not just within the day but across the week.

Theme days push batching to the daily level. Monday and Wednesday might be "maker days" — no meetings, minimal communication, deep work dominant. Tuesday and Thursday might be "manager days" — meetings, calls, collaboration, administrative work. Friday might be hybrid — morning review and planning, afternoon loose. The advantage is that on maker days, you do not merely protect a block from interruption. You eliminate the entire cognitive mode of reactiveness for the full day. No part of your brain is monitoring for messages, anticipating meeting transitions, or preparing for a context shift. The cognitive landscape is uniform.

Within the task: completion over progress. The subtlest level of batching happens within individual tasks. Leroy's research shows that attention residue is strongest when tasks are left incomplete. This means that stopping a task at a natural completion point — the end of a section, the resolution of a sub-problem, the shipment of a deliverable — generates less residue than stopping mid-thought. When you must leave a task before completion, the next best strategy is what Hemingway reportedly practiced: stop in the middle of a sentence, so you know exactly where to pick up. The cognitive re-entry cost is lower when the resumption point is unambiguous.

This connects directly to your priority stack (The priority stack). The stack's "pop" operation — removing a completed item — is a clean transition. The "swap" operation — suspending a blocked item — is a dirtier transition that generates more residue. The stack's discipline of working the top item to completion before moving to the next is not just a priority management practice. It is an energy conservation practice.

The self-interruption problem

External interruptions get the blame, but research consistently shows that self-interruptions are at least as costly — and more frequent.

Mark's observational studies found that roughly 44 percent of task switches in knowledge work are self-initiated. The person was not interrupted by a colleague, a notification, or a phone call. They interrupted themselves — checking email without a prompt, opening a browser tab, switching to a lower-priority task because the current one hit a point of difficulty or boredom.

The neurological driver is straightforward. When a task becomes difficult, your brain's reward system protests. The prefrontal cortex is exerting effortful control to maintain focus on a demanding, low-immediate-reward task. Meanwhile, the limbic system is offering a constant stream of easier, higher-immediate-reward alternatives: check the inbox (variable reward schedule, like a slot machine), open social media (social comparison and novelty rewards), switch to an easier task (completion reward with lower effort). The self-interruption is not a failure of character. It is a predictable outcome of a conflict between two neural systems — one optimizing for long-term value and one optimizing for immediate reward.

The energy implication is that self-interruptions are doubly expensive. They carry the same switching costs as external interruptions — task-set inhibition, activation, and residue. And they carry an additional cost: the cognitive effort of the interrupted self-regulation cycle. You were exerting effortful control. You lost the fight. Now you must re-exert effortful control to return to the task, starting from a more depleted baseline.

The batching solution addresses self-interruptions by removing the temptation rather than relying on willpower to resist it. During your deep work block, email is closed — not minimized, not on silent, closed. Your phone is in another room, not face-down on the desk. Slack is logged out. Browser tabs unrelated to the current task are closed. The environment is configured so that the path of least resistance is staying on task, because the competing options have been structurally removed.

This is environment design, not discipline. It is the same principle that makes it easier to eat well when your kitchen contains only healthy food. You are not resisting the cookie. The cookie is not there.

Why peak energy and switching costs interact

This lesson connects to Peak energy for peak work (peak energy for peak work) through a mechanism that is easy to miss: context switching does not just waste time. It degrades the quality of whatever energy you have left.

Your peak cognitive window is a finite resource. If you enter that window and immediately begin switching — checking email before starting the deep work block, responding to a message five minutes in, glancing at a notification ten minutes later — you are consuming peak energy on switching costs. The prefrontal cortex resources that should be fueling your best analytical thinking are instead being spent on task-set reconfiguration. You are burning premium fuel in the switching engine instead of the performance engine.

The practical consequence: a two-hour peak window with six interruptions does not deliver two hours of peak-quality work minus the interruption time. It delivers a fragmented experience where you never fully enter the deep cognitive state that makes the peak window valuable in the first place. The fifteen to twenty minutes required to reach flow (Csikszentmihalyi) means that any interruption arriving within that window resets the clock. Six interruptions in two hours means you never reach flow at all. The peak window is consumed without producing peak work.

This is why the connection between priority stacking (The priority stack) and peak protection (Peak energy for peak work) matters so much. When you work your stack's top item during your peak window with no switching, you get the compounding benefit: the most important work, receiving the highest energy, with the lowest switching overhead. Remove any one of those three conditions and the system degrades.

AI as a switching-cost auditor

The challenge with context switching is that you cannot feel the cost in real time. Each individual switch feels trivial — a two-second glance at a notification, a thirty-second email reply. The accumulated cost across a day is invisible from inside the day. You only detect it in the outcome: less output than expected, more fatigue than the tasks warrant, a quality gap between your best focused work and your average fragmented work.

An AI configured to monitor your switching patterns can surface what subjective experience obscures. It can track application switches, tab changes, and communication interruptions throughout the day and produce a switching profile: total switches, average time between switches, longest unbroken blocks, and the correlation between fragmented periods and energy ratings from your daily audit (Energy auditing).

More usefully, an AI can serve as a switching gatekeeper during deep work blocks. Instead of closing all communication channels — which creates anxiety for people who fear missing something urgent — the AI monitors incoming messages and applies a triage filter. Genuinely urgent items (defined by your criteria, not the sender's) break through. Everything else is held and delivered in batch at the next communication window. This gives you the cognitive security of knowing nothing urgent is being missed, without the switching cost of monitoring it yourself.

The pattern detection over weeks reveals your personal switching vulnerabilities. Which days have the highest switch counts? What triggers the self-interruption cascade? Is there a time of day when your self-interruption rate spikes (probably when your ultradian rhythm dips)? Which communication channels generate the most switches per unit of useful information received? The data transforms switching from an invisible, ambient cost into a visible, manageable one — the same transformation that the energy audit (Energy auditing) performed for energy as a whole.

From switching costs to energy leaks

You have now mapped the energy landscape across multiple dimensions: physical foundations, social dynamics, temporal rhythms, peak-window matching, and — with this lesson — transition costs. Each dimension reveals a different mechanism by which energy is consumed, conserved, or wasted.

The next lesson (Energy leaks) addresses a subtler and more insidious form of energy drain: energy leaks. Where context switching is a visible, active cost — you can point to the moment of the switch — energy leaks are background processes. They are the unresolved issues, unfulfilled commitments, and open loops that drain energy continuously even when you are not thinking about them. The unpaid bill. The difficult conversation you are avoiding. The project you committed to but have not started. Each one runs as a background process in your cognitive system, consuming resources below the level of conscious awareness.

But before you turn to leaks, consolidate what this lesson teaches. Your day contains a finite number of task-set reconfigurations that your prefrontal cortex can perform before executive function degrades to the point where focused work becomes impossible. Every switch — planned or unplanned, external or self-initiated — draws from that budget. The question is not how to eliminate all switching. Some transitions are necessary, and the planned transitions between batched blocks carry modest costs. The question is how to eliminate the unnecessary switches — the reflexive email checks, the notification-triggered task jumps, the ambient fragmentation that turns a potentially focused day into a cognitive demolition derby.

Batch similar tasks. Protect your deep work blocks from interruption. Design your environment to make staying on task the path of least resistance. Work your priority stack from the top, pushing items to completion before swapping. Schedule your reactive work into designated windows. And measure. Track your switches for one day. Count them. See the number. Then redesign the structure so the number drops.

The energy you save is not abstract. It is the difference between ending the day depleted and ending the day with reserves. It is the difference between peak work that reaches flow and peak work that never escapes the switching tax. It is the difference between a day that was long and a day that was deep.

Every context switch costs energy. You cannot eliminate the cost. But you can stop paying it forty times when ten would do.

Practice

Track Context Switches in a Google Sheet Energy Audit

You'll create a real-time tracking system in Google Sheets to log every context switch during your workday, then analyze patterns to identify your biggest energy drains.

10 minutes setup + full workday trackingIntermediate

Method: Energy AuditingTool: Google Sheets

1Open Google Sheets and create a new spreadsheet titled 'Context Switch Audit - [Today's Date]'. Create five column headers: Time, Switched From, Switched To, Switch Type (dropdown), and Notes.
2Set up data validation for the Switch Type column by selecting the column, clicking Data > Data validation, and creating a dropdown with three options: 'Planned Transition', 'External Interruption', and 'Self-Interruption'. Keep this sheet open in a browser tab throughout your workday.
3Every time you switch tasks or applications, immediately log the entry in Google Sheets: record the current time, what you were doing, what you're switching to, select the switch type from the dropdown, and optionally note how long it took to reorient (e.g., '30 seconds to remember where I was'). Do not modify your behavior — just observe and record.
4At the end of your workday, create a summary section below your log. Use the COUNTA function to count total switches, manually identify and highlight your three longest unbroken work stretches and three most fragmented hours, then use COUNTIF to calculate how many switches were each type (e.g., '=COUNTIF(D:D,"Self-Interruption")').
5Calculate your self-interruption ratio by dividing self-interruptions by total switches, then review your fragmented hours and estimate total reorientation time lost. Write a brief reflection in a Notes row about which switch types cost you the most energy and which unbroken stretches felt most productive.

Frequently Asked Questions

Common questions about this lesson

Loading lessons

Preparing the next section of the lesson graph.