Curriculum conceptLast verified June 16, 2026

Machine learning: the predictive AI you already run

Machine learning is software that learns patterns from data instead of following hand-written rules — and most "AI" already inside companies is the predictive kind, distinct from the generative wave in the news.

Working and Deep are Pro — free during launch.

Most of the artificial intelligence already running inside large organizations is not the chatbot kind in the news. It is machine learning: software that learns patterns from historical data instead of following rules a programmer wrote by hand. The credit-card transaction that gets flagged as suspicious, the demand forecast that tells a warehouse how many units to stock, the resume screener that ranks applicants, the churn model that predicts which customers are about to cancel, the dynamic price on an airline seat, the spam filter on corporate email, the recommendation that surfaces the next product or the next show, the fraud score on a loan application, the predictive-maintenance alert on a factory motor, the credit risk model under an underwriting decision, the routing that decides which warehouse ships your order, the anomaly detector watching the network for intrusions, the medical-imaging triage tool flagging a scan for a radiologist's attention. These are the workhorses of applied AI, and almost all of them are some flavor of machine learning. They have been quietly embedded in operations for a decade or more, long before the generative wave arrived.

The core idea is a shift in where the intelligence comes from. In conventional software, a person decides the logic: if a transaction is over a threshold and from a new location, flag it. That works until the rules multiply past what anyone can maintain, and it never adapts on its own. Machine learning inverts this. You show a system thousands or millions of past examples that are already labeled with the right answer — these transactions were fraud, those were legitimate — and the system adjusts its own internal settings until it can separate the two. The output is a statistical model: a mathematical function that takes new inputs it has never seen and produces a prediction, usually a number between zero and one that reads as a probability or a confidence. Nobody hand-wrote the rule. The data, in effect, wrote it. The term itself dates to 1959, when IBM researcher Arthur Samuel built a checkers program that improved by playing against itself, and the field has matured for decades since.

It helps to know the three broad families, because they answer different questions. Supervised learning, by far the most common in business, learns from labeled examples to predict a known target: will this customer churn, is this transaction fraud, what will demand be next quarter. It needs a ground-truth answer for each training example, which is why labeling data is often the expensive, unglamorous bottleneck. Unsupervised learning finds structure in data with no labels at all — grouping customers into natural segments, or spotting transactions that do not look like any normal pattern. Reinforcement learning learns by trial and error against a reward signal, the way a system learns to play a game or tune a logistics route, and it powers some of the post-training that shapes today's chat models. The distinction worth carrying into a meeting is that predictive machine learning answers narrow, measurable questions, while the generative models discussed elsewhere in this curriculum produce open-ended new content. They share mathematical ancestry but solve different problems, and they fail in different ways.

The reason this distinction matters to a leader is governance and budget, not trivia. Predictive models are typically narrow, measurable, and auditable: a fraud model has a precision and a recall you can put on a dashboard, a forecast has an error you can track against reality, and a credit model is already subject to fair-lending scrutiny in most jurisdictions. That makes them easier to govern than open-ended generative systems — but it does not make them safe by default. A model is only as good as the data it learned from. If historical hiring data encoded a bias, a model trained on it will reproduce and often amplify that bias while wearing the costume of mathematical objectivity, a failure mode the fairness concept in this curriculum treats in depth. And a model can be too good at its training data: overfitting means it has memorized the past, including its noise, and performs worse on genuinely new cases than a simpler model would. The numbers on a slide can look excellent while the system is quietly brittle.

There is also a slower failure that catches organizations off guard: the world the model learned from drifts away from the world it now operates in. A demand forecast tuned on pre-pandemic buying collapses when behavior shifts; a fraud model trained on last year's schemes misses this year's; a credit model calibrated in a low-rate environment misreads a high-rate one. The model does not announce this. Its accuracy decays silently while it keeps producing confident-looking outputs, which is why mature operations monitor live performance and retrain on a schedule rather than treating a deployed model as finished. The drift concept in this curriculum is devoted to exactly this problem, because it is the single most common reason a system that passed every test at launch becomes a liability a year later.

For an executive, the practical posture is to recognize what you already own before chasing what is new. The questions that travel well into any review are roughly these: what data did this model learn from, and does that data still resemble the world we operate in today; what exactly is it predicting, and what does it cost us when it is wrong in each direction; who is accountable for monitoring it, and when was it last retrained; can we explain a given decision to a regulator, a customer, or a court if asked. None of these require knowing the mathematics. They require treating a predictive model as a living component with an owner and a maintenance schedule, not a one-time purchase. The generative tools dominating the conversation are real and important, and the rest of this curriculum covers them in depth — but the machine learning already woven through your operations is where the most decisions are quietly being made today, and it deserves the same scrutiny.

Citations

Where this comes from.

Related concepts