Everyone is talking about ChatGPT, Claude, and the latest AI models. If you've spent any time around AI recently, you've probably used one, or at least heard of them. They feel like magic: you type something, they respond like a person.

But here's what most people don't realize: those tools are not where you start if you actually want to understand AI. They're the top floor of a tall building. And NLP (Natural Language Processing) is one of the most important floors, the foundation.

This post is about that foundation. And if you're serious about learning AI, reading it puts you ahead of most people who are currently trying to learn by poking at ChatGPT and hoping something sticks.

What Is Natural Language Processing?

NLP stands for Natural Language Processing.

Break it down:

  • Natural Language: the everyday language humans use, English, Spanish, Korean, any spoken or written language.
  • Processing: getting a computer to read, analyze, and make sense of that language.

Together, NLP is the branch of AI dedicated to teaching machines to understand human language, not just match keywords, but grasp meaning, context, intent and reason over it.

When you type a question into Google and it knows what you meant even if you phrased it badly, that's NLP. When Gmail suggests how to finish your sentence, NLP. When a spam filter catches a suspicious email you never asked it to look for, still NLP.

You've been benefiting from it for years without knowing its name.

NLP Came Before ChatGPT, and That Matters

Here's the thing nobody tells you when you start learning AI. Before we go further, there's something worth pausing on, because it almost never gets mentioned. NLP goes back to the 1950s. The chatbots and LLMs everyone talks about today only emerged in the 2020s. That's roughly 70 years of research, breakthroughs, and real-world application that most people skip entirely. To me, that's a tragedy.

Generative AI vs NLP New vs Old

Google's search engine has been running reliably for over two decades. The reason? NLP. Not LLMs. To this day, NLP is what powers search, not the large language models that dominate the headlines. That alone should tell you something about how foundational and battle-tested this field really is.

ChatGPT, Claude, Gemini, these are called large language models, or LLMs. They are extraordinary tools. But they are also the most advanced, most complex expression of decades of AI research. Jumping straight to them as your entry point into AI is like deciding to learn cooking by staging at a three-Michelin-star restaurant.

You might manage to copy one dish by following instructions closely. But the moment something goes wrong (or you need to adapt) you'll be completely lost. You don't know why anything works. You just know that sometimes it does.

NLP is where the real understanding begins. It covers the concepts, techniques, and mental models that LLMs are built on top of. How does a machine understand the meaning of a word? How does it know that "bank" means something different in "river bank" versus "bank account"? How does it process a sentence, not just a list of words? These questions are NLP questions. And until you can answer them, you're working with AI on borrowed time.

NLP in AI and Machine Learning: How They Connect

NLP sits inside a larger family of technologies. Here's a simple way to think about how they relate:

TermWhat It Means
AI (Artificial Intelligence)The broad field of building machines that do intelligent things
Machine Learning (ML)A subset of AI where machines learn from data rather than explicit rules
NLP (Natural Language Processing)A subset of AI focused specifically on understanding language
LLM (Large Language Model)A type of NLP model trained on vast amounts of text, what powers ChatGPT, Claude, etc.

Think of nesting dolls: AI contains machine learning, machine learning contains NLP, and NLP is what powers the LLMs everyone talks about today.

NLP in machine learning specifically refers to how modern NLP works, not by writing rules ("if the sentence contains 'not', reverse the sentiment"), but by training models on enormous datasets and letting them learn the patterns themselves. This shift is why language AI improved so dramatically over the past decade.

How NLP Actually Works: A Basic Example

You don't need to write code to understand this. Here's an example of what happens under the hood when an NLP system reads a sentence.

Tokenization

The first step is breaking text into smaller units called tokens, usually words or parts of words.

"I love pizza" → ["I", "love", "pizza"]

This gives the system something concrete to analyze, piece by piece.

Parsing: Structure and Grammar

Next, the system looks at how words relate to each other. Which word is the subject? What's the verb? What is being modified? Understanding sentence structure is what separates language processing from simple keyword matching.

Meaning and Context

This is where it gets interesting. Words change meaning depending on context. "Sick" means ill in one sentence and impressive in another. NLP models learn to use surrounding words and context to resolve that ambiguity, something humans do automatically, but machines have to learn explicitly.

Output

Once the system understands the input, it produces a result: a classification, a translation, a summary, an answer. This output is the visible product of everything happening in the steps before it.

Important Note: This is just an orientative example, there are tons of NLP techniques from older to most modern powered with transformers (without them being LLMs) in future blogs we will cover this techniques such as: text preprocessing, normalization, morphology, tokenization, n-grams, corpus analysis techniques, syntactic structures, retrievers... yes, there is 70 years of amazing stuffs stacked.

Core NLP Tasks (and Where You've Already Seen Them)

NLP is not one single thing, it can be divided in different language tasks to achieve various purposes. Here are the most important ones:

Sentiment Analysis Reads text and determines its emotional tone, positive, negative, or neutral. Used everywhere from product review analysis to monitoring brand mentions on social media.

Named Entity Recognition (NER) Identifies specific entities in text: names of people, companies, locations, dates. A system reading a news article can automatically extract that "Apple" is a company, "Tim Cook" is a person, and "Cupertino" is a location.

Text Classification Assigns a category to a piece of text. Spam filters use it. Customer support routing uses it. Any time content needs to be sorted automatically, classification is likely involved.

Text Summarization Compresses a long document into a shorter version while preserving the key information. This is the same capability that lets AI tools summarize a 30-page report in seconds.

Machine Translation Converts text from one language to another, the backbone of tools like Google Translate.

Question Answering Enables a system to find and return an answer to a natural language question from a document or knowledge base.

These aren't just academic exercises. Every one of them is in active production use right now, in enterprise software, consumer apps, and increasingly in AI-powered pipelines built on top of NLP foundations.

Why NLP Is Still Extremely Relevant Today

There's a tendency in tech to treat everything that predates the previous model as obsolete. That's a mistake, especially with NLP.

LLMs are powerful, but they are not always the right tool. They are slow, expensive to run, and often overkill for tasks that a targeted NLP model handles efficiently and reliably. Sentiment analysis on ten thousand customer reviews? A classification pipeline for support tickets? Entity extraction from legal documents? These are problems NLP solves cleanly, at scale, right now.

Practitioners who understand NLP can build production-grade systems that are fast, cost-effective, and explainable. They can also work with LLMs far more effectively, because they understand what those models are actually doing internally.

The people who dismiss NLP as "the old stuff" are usually the same ones who hit a wall six months into their AI journey, because they built on top of a tool they never truly understood.

Real-World Applications of NLP

To make this concrete, here's where NLP is doing serious work across industries today:

  • Healthcare: Extracting structured information from clinical notes and medical records so doctors spend less time on documentation.
  • Finance: Scanning earnings reports, news, and filings for signals that affect investment decisions.
  • Customer service: Routing and prioritizing support tickets, powering chatbots that handle the majority of volume before a human is needed.
  • Legal: Reviewing contracts and flagging clauses that don't match standard terms.
  • E-commerce: Analyzing product reviews to surface quality issues or common customer complaints.
  • Media & publishing: Automatically tagging, categorizing, and summarizing content at scale.

None of these require a frontier LLM. They require solid NLP, and someone who knows how to apply it.

Key Takeaways

  • NLP stands for Natural Language Processing, the field of AI focused on teaching computers to understand human language.
  • It is the foundation that LLMs like ChatGPT and Claude are built on. Skipping it means building on ground you don't understand.
  • Modern NLP is powered by machine learning: models trained on data, not hand-written rules.
  • Core techniques (sentiment analysis, classification, NER, summarization, translation) are in active production use across industries.
  • Mastering NLP gives you a genuine edge: over those who only know how to prompt, and over those who think foundations don't matter.

Frequently Asked Questions

What does NLP stand for?
NLP stands for Natural Language Processing, the field of AI that deals with understanding and generating human language.
What is NLP in machine learning?
In machine learning, NLP refers to models trained on text data to perform language tasks like classification, translation, or summarization, rather than relying on hand-coded rules.
What is the difference between NLP and an LLM?
NLP is the broader field. An LLM (Large Language Model) is a specific, modern type of NLP model trained on massive datasets. All LLMs use NLP, but NLP encompasses much more than just LLMs.
Is NLP still useful now that LLMs exist?
Yes. NLP techniques are faster, cheaper, and more interpretable than LLMs for many tasks. They're widely used in production systems today. Understanding NLP also makes you a significantly better practitioner when working with LLMs.
Do I need to know math or coding to learn NLP?
Not to understand the concepts. To implement NLP in practice, basic Python knowledge is helpful, but you can build a solid conceptual foundation (like this post does) without writing a single line of code.
What is NLP used for in real life?
Spam filters, search engines, voice assistants, translation tools, sentiment analysis, chatbots, document summarization, and much more. If you've used a smartphone today, you've used NLP.
Javier Aguirre

Javier Aguirre

Teaching AI from production reality, not hype.

Build your AI Foundations

Get the free checklist that production engineers use to avoid these common RAG pitfalls.

Get the Free Guide