Day 1: What AI Can and Can't Do — The Manager's Mental Model

Today's Objective

This simple model explains both AI's capabilities and its failures — and it will help you evaluate any AI claim in about 30 seconds.

The Problem with How AI Gets Presented to Managers

Every vendor pitching you an AI tool wants you to believe it can do everything. Every breathless news article makes AI sound like it will either save civilization or end it. Neither is useful to you as a manager trying to run a team and hit quarterly goals.

What you need is not enthusiasm or anxiety — you need a working mental model. A way of thinking about what AI does that is accurate enough to make good decisions, practical enough to apply quickly, and durable enough to use as AI continues to evolve.

Here is one that holds up: AI is a pattern-completion engine. It learns patterns from enormous amounts of existing data and uses those patterns to complete new inputs. That's it. When you type a prompt into ChatGPT, you are providing the beginning of a pattern, and the AI is completing it based on what it has seen in training data.

This simple model explains both AI's capabilities and its failures — and it will help you evaluate any AI claim in about 30 seconds.

The 4 Types of AI Tasks

Everything you will ever be pitched by an AI vendor, everything your team will ever ask about, falls into one of four categories. Understanding these categories is the single most useful framework a non-technical manager can have.

Type 1

Generation

AI creates content from a prompt or set of inputs. This is where most consumer AI use cases live. AI is very good at generation tasks — but the output always needs human review.

Examples: drafting emails, writing meeting summaries, creating first-draft reports, generating social content, writing job descriptions

Type 2

Analysis

AI processes existing content and produces structured insight. Analysis tasks include summarization, extraction, comparison, and synthesis. AI is strong here when given clear instructions.

Examples: summarizing long documents, extracting key data from reports, comparing contract terms, synthesizing customer feedback, analyzing financial data

Type 3

Classification

AI assigns inputs to categories based on learned patterns. This is where AI earns its keep in operations — sorting, routing, flagging, and prioritizing at scale. Very high ROI when well-configured.

Examples: routing support tickets, categorizing incoming requests, flagging anomalies in data, scoring lead quality, tagging content

Type 4

Conversation

AI engages in natural language dialogue to answer questions, guide processes, or assist users. The most visible AI category — chatbots, assistants, and customer-facing AI. Quality varies enormously by implementation.

Examples: internal help desks, customer service bots, employee onboarding assistants, knowledge base Q&A, meeting assistants

How to use this in practice: The next time someone presents you with an AI tool, ask: "What type of AI task is this?" If they can't answer in one sentence — generation, analysis, classification, or conversation — that is a signal they don't understand the product they're selling.

Where AI Reliably Excels

Within those four task types, AI performs best when the following conditions are present:

Clear inputs and outputs: The task has well-defined inputs and a clear definition of what "good" looks like. "Summarize this 50-page report in 5 bullet points for a VP audience" is better than "help me understand this report."
Large volume: The task needs to be done at a scale that would be impractical for humans. Categorizing 10,000 support tickets. Reviewing 500 resumes. Summarizing 200 customer calls.
Pattern-based judgment: The task requires recognizing patterns in data rather than novel reasoning. Identifying anomalies in expense reports. Flagging contracts that deviate from standard terms. Routing requests to the right team.
Language fluency: The task primarily involves working with text. Writing, editing, translating, summarizing, extracting information. AI was trained on text and it shows — these are its strongest capabilities.

Where AI Reliably Fails

Understanding AI's failure modes is as important as understanding its strengths. Here is where managers consistently get burned:

New situations it has never seen

AI learns patterns from past data. It does not reason the way a human does. If you ask it to analyze a situation that is genuinely new — an unusual regulatory scenario, an unprecedented organizational challenge — it will produce output that sounds confident and plausible but may be completely wrong. It's completing "what a good answer looks like" rather than actually working through your specific problem. Always verify AI analysis of novel situations against expert judgment.

Factual accuracy with specific numbers, dates, and citations

AI "hallucinates" (that means it invents facts that don't exist). This is not a bug being fixed — it's a fundamental property of how the technology works. AI will fabricate citations to papers that don't exist. It will state incorrect statistics with full confidence. It will invent specific facts when it doesn't know the real answer.

Any AI output containing specific numbers, named sources, or dates must be independently verified before you act on it or share it.

Judgment calls that depend on context you didn't give it

AI only knows what you put in the prompt. It does not know your organization's history, your team's dynamics, or the unwritten rules of how decisions get made in your context.

When you ask AI for advice on a complex organizational situation, it's giving you its best pattern-match on situations like yours — not actual knowledge of your specific situation. Use it as a starting point, not a final answer.

Genuinely original ideas

AI produces competent, conventional work quickly. It is not good at genuine originality — the insights that come from seeing something nobody has connected before. AI can assist with execution but rarely with the creative leap that changes how an industry thinks. For that, you still need humans.

The Manager's BS Detector for AI Claims

You will be pitched a lot of AI tools. Here is a five-question filter that surfaces the most important information in ten minutes:

Question	What a Good Answer Sounds Like	Red Flag Answer
What specific task does this do?	"It classifies incoming support tickets into 12 categories and routes them to the right team."	"It uses AI to transform your entire operations." (No specifics.)
How accurate is it?	Specific accuracy rate with test methodology. "92% accuracy on a held-out test set of 10,000 tickets."	"It's very accurate" or "it gets better over time." (No numbers.)
What happens when it's wrong?	Clear description of failure modes and how humans catch and correct errors.	"It rarely makes mistakes." (Evasion — all AI makes mistakes.)
Can you show me a live demo on our actual data?	Yes. Demo on realistic data similar to yours.	"We'll set that up in phase 2." (They know it won't perform on real data.)
What do customers who have been using this for a year say?	Specific reference customers you can call, with measurable outcomes they've achieved.	"We're still early in rollout." (No proven track record.)

Day 1 Exercise

Categorize 10 Tasks in Your Team

Think about the work your team does on a weekly basis. List 10 recurring tasks — the things that happen every week or month. Then, for each one, make two assessments:

AI task type: Is this primarily generation, analysis, classification, or conversation? Some tasks are hybrid — pick the dominant type.
AI suitability: Given what you know about where AI excels and where it fails, rate each task: Strong candidate / Possible with human review / Not suitable.

For each task you rate "Strong candidate" or "Possible," note the one specific condition that makes it suitable — or the one concern that needs to be addressed before deployment.

Keep this list. You will use it in Day 2 to frame your tool evaluation and in Day 3 to build a business case.

Key Takeaways from Day 1

AI is a pattern-completion engine. It learns patterns from training data and completes new inputs based on those patterns. This explains both its strengths and its failure modes.
The 4 AI task types: generation, analysis, classification, and conversation. Every AI tool you encounter fits into one or more of these categories.
AI excels when: inputs and outputs are clear, volume is high, judgment is pattern-based, and the task involves language.
AI fails when: the situation is novel, specific factual accuracy matters, context not provided to the AI is important, or genuine originality is required.
The 5-question BS detector surfaces the information you need to evaluate any vendor claim in ten minutes.

Day 1 Checkpoint

Before moving on, confirm understanding of these key concepts:

What is the core concept introduced in this lesson?
How does the main technique or tool work in practice?
What common mistakes should be avoided?
How would this apply in a real-world project?
What is the next logical step to build on this knowledge?

Day 2: Evaluating AI Tools for Your Team →

What AI Can and Can't Do — The Manager's Mental Model

Today's Objective

The Problem with How AI Gets Presented to Managers

The 4 Types of AI Tasks

Where AI Reliably Excels

Where AI Reliably Fails

New situations it has never seen

Factual accuracy with specific numbers, dates, and citations

Judgment calls that depend on context you didn't give it

Genuinely original ideas

The Manager's BS Detector for AI Claims

Categorize 10 Tasks in Your Team

Key Takeaways from Day 1

Day 1 Checkpoint

Want to bring this framework to your whole leadership team?

Supporting References & Reading

Go deeper with these external resources.