Key question
What do people actually mean when they say AI?
A one-page curriculum for understanding AI from the top down: the core hierarchy, modern model capabilities, and the OpenAI platform terms teams now need for GPT-5.5, Responses API, agents, tools, and realtime AI.
The 2026 AI Guide supports corporate AI training in Switzerland — from foundation models and Generative AI to Agentic AI, MCP, and governance patterns used in AI Workshop programs.
Chapter 1
Start with the umbrella idea of AI, then narrow into the child layers that make modern systems possible.
By the end of this chapter
You should understand the nested relationship between AI, Machine Learning, Deep Learning, Neural Networks, and Foundation Models.
From Turing machines to Transformer models.
Where it sits in the stack
AI Overview
Key question
What do people actually mean when they say AI?
What to remember
AI is the broad field. Everything else in this guide is a more specific layer inside it.
Artificial Intelligence is the broad discipline of creating intelligent machines. It is the umbrella term that encompasses everything from simple rule-based systems to the complex Large Language Models of today.
Theory 01
To understand AI, visualize concentric circles. The outermost circle is Artificial Intelligence-the grand vision. Inside that is Machine Learning-the technique of learning from data. Inside that is Deep Learning-using neural networks. And at the cutting edge, we find Generative AI-models that create.
Theory 02
We classify AI capability into three stages:
Theory 03
It is the convergence of three factors:
Theory 04
Estimates vary wildly from a few years to a few decades. The rapid progress of LLMs has accelerated these timelines, but significant hurdles in reasoning and physical world understanding remain.
Computers that learn from data and improve with experience.
Where it sits in the stack
AI Overview -> Machine Learning
Key question
How does a machine get better without being reprogrammed for every case?
What to remember
Machine Learning is the learning engine inside AI, powered by data rather than explicit instructions for every outcome.
Machine Learning is a subset of Artificial Intelligence (AI) where computers learn from data and improve through experience without being explicitly programmed. Algorithms are trained to find patterns and correlations in large datasets to make the best decisions and predictions. With practice and more data, these applications become increasingly accurate.
Theory 01
Think of them as concentric circles. Artificial Intelligence is the broad discipline. Machine Learning is a subset within AI that allows machines to learn from data. Inside Machine Learning is Deep Learning, and within that are Artificial Neural Networks. AI processes data to make decisions; ML algorithms allow AI to learn and get smarter without additional programming.
Theory 02
Artificial Neural Networks mimic the biological brain, with nodes (neurons) grouped in layers working in parallel. They strengthen connections to improve pattern recognition.
Deep Learning involves many layers of these networks and huge volumes of complex data. It extracts features hierarchically: a system might recognize a plant in the first layer, a flower in the next, and a yellow daisy in the last.
Theory 03
Theory 04
Theory 05
Bias and Spurious Correlations: Models can learn incorrect associations (e.g., correlating margarine consumption with divorce rates) if the data is flawed or if they find coincidental patterns.
The Black Box Problem: Complex models like deep neural networks can be difficult to interpret. It is often unclear how or why a specific decision was made, which poses risks in critical fields.
AI is the broader concept of machines acting intelligently. Machine Learning is a specific subset of AI where machines learn from data to improve their performance without being explicitly programmed for every task.
Unlocking the power of high-dimensional data.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning
Key question
Why did modern AI become dramatically more capable?
What to remember
Deep Learning is scaled Machine Learning built on many layers, which makes it strong on high-dimensional and unstructured data.
Deep Learning is a specialized subset of Machine Learning inspired by the structure of the human brain. It uses multi-layered neural networks to learn from vast amounts of unstructured data like images, audio, and text.
Theory 01
The "Deep" in Deep Learning refers to the number of layers in the neural network. Traditional neural networks might have 2-3 layers. Deep learning models can have hundreds or thousands. This depth allows the model to learn a hierarchy of features-from simple edges and textures to complex shapes and objects.
Theory 02
In traditional ML, humans had to manually select features (e.g., "does this image have ears?"). In Deep Learning, the network performs automatic feature extraction. It learns what features are important directly from the raw pixels or text.
Theory 03
Deep Learning is the technology behind self-driving cars, voice assistants, facial recognition, and the recent generative AI boom. It thrives on scale-more data and more compute usually lead to better performance.
Theory 04
Deep Learning is the technology behind self-driving cars, voice assistants, facial recognition, and the recent generative AI boom. It thrives on scale-more data and more compute usually lead to better performance.
Deep Learning is essentially the use of *deep* neural networks. So all Deep Learning involves neural networks, but not all neural networks are 'deep' (though in modern context, the terms are often used interchangeably).
The mathematical architecture of the mind.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Neural Networks
Key question
What is the actual mechanism doing the learning inside Deep Learning?
What to remember
Neural Networks are the mathematical structure that powers Deep Learning and makes modern model scaling possible.
Artificial Neural Networks (ANNs) are computing systems vaguely inspired by the biological neural networks that constitute animal brains. They are the fundamental building blocks of Deep Learning.
Theory 01
A neuron takes multiple inputs, multiplies them by weights (importance), adds a bias (threshold), and passes the result through an activation function (non-linearity). If the signal is strong enough, the neuron "fires" and passes information to the next layer.
Theory 02
Theory 03
They learn through a process called Backpropagation. The network makes a guess, compares it to the actual answer to calculate the loss (error), and then works backward to adjust the weights to minimize that error. This is repeated millions of times.
It's a mathematical function (like ReLU or Sigmoid) attached to each neuron that decides whether it should be activated. It introduces non-linearity, allowing the network to learn complex patterns.
Neural networks can be so complex that even their creators don't fully understand how they arrive at a specific decision. This lack of interpretability is a major challenge in high-stakes fields like medicine.
One model, infinite applications.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Foundation Models
Key question
Why do a few large models now power so many different tasks?
What to remember
Foundation Models are general-purpose engines trained at massive scale and adapted to many downstream uses.
A Foundation Model is a large-scale AI model trained on broad data that can be adapted to many downstream tasks. Modern foundation models are increasingly multimodal, tool-capable, and used as reasoning engines inside assistants, APIs, agents, and enterprise workflows.
Theory 01
It must be broadly capable. Unlike previous models designed for one narrow task, a foundation model can draft, analyze, code, translate, summarize, interpret images, use tools, and adapt to many product experiences through prompting, retrieval, fine-tuning, or orchestration.
Theory 02
Foundation models exhibit emergence-capabilities that were not explicitly trained for. For example, a model trained simply to predict the next word in a sentence might emerge with the ability to translate languages, write code, or solve logic puzzles.
Theory 03
The lifecycle involves two stages:
Theory 04
Prominent model families include GPT-5.5 and other GPT models from OpenAI, Claude from Anthropic, Gemini from Google DeepMind, Llama from Meta, Mistral models, and specialist image, audio, and realtime models. The important point is no longer only the model name, but the surrounding platform: tools, state, retrieval, structured outputs, governance, and agent orchestration.
Prominent model families include GPT-5.5 and other GPT models from OpenAI, Claude from Anthropic, Gemini from Google DeepMind, Llama from Meta, Mistral models, and specialist image, audio, and realtime models. The important point is no longer only the model name, but the surrounding platform: tools, state, retrieval, structured outputs, governance, and agent orchestration.
LLMs (Large Language Models) are a *type* of foundation model focused on text. But foundation models can also be multimodal, handling images, audio, and video.
Chapter 2
Once the stack is clear, you can understand how modern systems generate content and who is shaping that layer.
By the end of this chapter
You should be able to explain what Generative AI is, why it depends on foundation models, and how major model providers differ.
From analysis to creation.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Foundation Models -> Generative AI
Key question
How does modern AI move from analysis into creation?
What to remember
Generative AI is the creation layer. It uses modern models to produce new text, images, code, audio, and more.
Generative AI refers to algorithms that can create new content-including audio, code, images, text, simulations, and videos. Unlike traditional AI which classifies or predicts, Generative AI produces novel outputs.
Theory 01
Traditional AI is Discriminative: It draws a line to separate data (e.g., "Is this a cat or a dog?"). Generative AI is Creative: It learns the distribution of the data to create new examples (e.g., "Draw me a cat that never existed").
Theory 02
Generative models, such as Diffusion Models (for images) or Transformers (for text), learn the underlying structure of the training data. They then use probability to assemble new patterns that follow those structures but are not identical copies.
Theory 03
Generative AI is democratizing creativity. It acts as a co-pilot for writers, artists, coders, and designers, allowing them to iterate faster and explore new ideas. It is shifting the bottleneck from "skill" to "imagination".
Theory 04
This is a complex legal and ethical issue. Models learn from existing data, but they don't 'copy-paste'. They learn styles and concepts. However, the rights of the original creators of the training data are a subject of active debate and litigation.
The model providers, platforms, and product ecosystems shaping applied AI.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Foundation Models -> LLM Players
Key question
Who is shaping the model layer that everyone builds on?
What to remember
Model providers are not interchangeable. Each one makes different tradeoffs in openness, performance, integration, and control.
The Large Language Model landscape has moved beyond simple chatbot comparison. Teams now compare model quality, reasoning capability, tool use, privacy controls, multimodal support, agent orchestration, and ecosystem fit.
Theory 01
Theory 02
It depends on the workflow. GPT-5.5 is a strong OpenAI choice for production work that needs reasoning, tool use, coding, and polished output. Claude, Gemini, Llama, Mistral, and other models may fit better when the priority is context handling, ecosystem integration, local control, price, or procurement requirements.
Chapter 3
After the model layer, the next question is how people actually use AI in real work: tools, assistants, agents, realtime experiences, and hands-on practice.
By the end of this chapter
You should understand how AI becomes tools, how to compare products, and why current OpenAI language now includes hosted tools, tool search, MCP, Agents SDK, and Realtime sessions.
From chat interfaces to agentic workflows.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Foundation Models -> Generative AI -> AI Tools
Key question
How do models become useful products in everyday work?
What to remember
AI tools are the interface layer between advanced models and real human tasks.
AI tools are the practical interface between models and work. In 2026, the important shift is from standalone chat prompts to governed workflows with retrieval, structured outputs, hosted tools, MCP connectors, realtime voice, and human approval points.
Theory 01
Theory 02
AI is more likely to change workflows before it fully replaces roles. The practical skill is knowing how to delegate, verify, and govern AI-assisted work.
A curated map of current AI product categories.
Where it sits in the stack
AI Overview -> Machine Learning -> Deep Learning -> Foundation Models -> Generative AI -> AI Tools -> AI Tools Directory
Key question
How do you evaluate the crowded tool market without getting lost?
What to remember
The right tool choice depends on workflow, privacy, integration, and the model sitting underneath the interface.
Navigating AI tools is easier when you organize them by job to be done: assistant, search, agent, code, creative, voice, automation, and enterprise knowledge.
Theory 01
Theory 02
Theory 03
Theory 04
Learn by doing.
Where it sits in the stack
AI Overview -> Machine Learning -> Interactive Exercises
Key question
How do abstract AI ideas become intuition instead of memorized jargon?
What to remember
Interactive learning is the shortest path from theory to intuition.
Theory is essential, but practice makes perfect. These interactive simulations and games are designed to build your intuition for how AI systems actually work.
Theory 01
AI concepts like "Gradient Descent" or "Backpropagation" can be abstract and mathematical. Interactive visualizations allow you to see the math in action, building a deeper, intuitive understanding.
Theory 02
Chapter 4
Finish by reconnecting AI to people: human intelligence, augmented decision-making, and the vocabulary needed for clear conversations.
By the end of this chapter
You should be able to discuss AI with more precision, better judgment, and a shared vocabulary across teams.
IQ, EQ, and AI shape our decision-making.
Where it sits in the stack
AI Overview -> Intelligence
Key question
What still belongs to humans in an AI-shaped world?
What to remember
The strongest future is not human or machine alone. It is augmented intelligence with clear human oversight.
Understanding how human intelligence, artificial intelligence, and augmented intelligence complement each other is key to navigating the future. Intelligence is not just about processing power; it's about the synergy between biological and synthetic cognition.
Theory 01
We can categorize intelligence into three distinct forms that interact in the modern world:
| Feature | Humans | Machines | Augmented Intelligence |
|---|---|---|---|
| Data Handling | Understand and generalize concepts | Process and analyze large volumes of data | Combine context with data-driven insights |
| Repetition | Prone to fatigue | Perform repetitive tasks with high accuracy | Automate tasks while preserving human oversight |
| Creativity | Flexible problem-solving | Limited creative capacity | Enhance human creativity with smart tools |
| Emotional Insight | Empathy & customer care | No emotional understanding | Human-led empathy, supported by smart assistance |
Theory 02
Definition: The ability of machines to think and reflect like humans, attempting to replicate human intelligence with machines.
Capabilities:
Characteristics:
Theory 03
"Why aren't we more compassionate?" - Daniel Goleman
Emotional Intelligence operates through the basal ganglia (the wisdom center of the brain). It guides decisions based on emotional valence (what felt good/bad). Unlike the neocortex, it does not speak in words but is connected to emotions and the gut.
Key Insight: Combine EQ + IQ for better decisions.
Augmented Intelligence is a design pattern for a human-centered partnership model of people and AI working together to enhance cognitive performance, including learning, decision making, and new experiences.
Augmented Intelligence is a design pattern for a human-centered partnership model of people and AI working together to enhance cognitive performance, including learning, decision making, and new experiences.
Speak the language of the future.
Where it sits in the stack
AI Overview -> AI Concepts & Terminology
Key question
Which terms should everyone use consistently after reading this guide?
What to remember
A strong AI guide should end with alignment: people leaving with the same words for the same concepts.
The field of AI is filled with jargon. This dictionary provides clear, concise definitions for the most important terms you need to know.
Theory 01
Agent: A system that can plan, use tools, manage state, and execute multi-step work toward a goal.
Algorithm: A set of rules, calculations, or procedures used by software or models to process information.
Alignment: The work of making AI behavior match human intent, values, and product expectations.
Bias: Systematic errors in data, design, deployment, or outputs that can unfairly affect results.
Computer use: A tool pattern where a model can interact with a computer interface to complete UI-based tasks.
Theory 02
File search: A retrieval tool that lets a model search uploaded or connected files for relevant context.
Fine-tuning: The process of adapting a pre-trained model on a targeted dataset for a more specific behavior or domain.
Function calling: A way to expose custom software actions to a model with typed inputs and predictable outputs.
Hallucination: When an AI generates incorrect or unsupported information confidently.
Hosted tools: OpenAI-provided tools such as web search, file search, code interpreter, image generation, and computer use.
LLM (Large Language Model): A model specialized in understanding and generating language.
Theory 03
MCP (Model Context Protocol): A protocol for connecting models and agents to external tools, systems, and data sources.
Multimodal: AI that can work across multiple input or output types, such as text, images, audio, video, and structured data.
Realtime API: OpenAI sessions for low-latency voice agents, translation, transcription, and live multimodal experiences.
Reasoning effort: A model setting that controls how much reasoning budget is used for supported models.
Responses API: The OpenAI API surface for model responses, tools, multimodal input, state, and agentic workflows.
Structured Outputs: A way to make model output follow a defined schema for reliable downstream use.
Token: A unit of text or data processed by a model.
Tool search: A pattern for loading only the relevant tool definitions when a tool catalog is large.
File search: A retrieval tool that lets a model search uploaded or connected files for relevant context.
Fine-tuning: The process of adapting a pre-trained model on a targeted dataset for a more specific behavior or domain.
Function calling: A way to expose custom software actions to a model with typed inputs and predictable outputs.
Hallucination: When an AI generates incorrect or unsupported information confidently.
Hosted tools: OpenAI-provided tools such as web search, file search, code interpreter, image generation, and computer use.
LLM (Large Language Model): A model specialized in understanding and generating language.
These are the extra concepts from the workshop guide that help people connect the model stack to what they actually see in products today.
A Large Language Model is a very large model specialized in understanding and generating language. In practice, it acts like a text engine that can summarize, draft, explain, translate, and reason over documents.
Think of someone who has read a vast library and can help you write, explain, or compare ideas on demand.
The Responses API is the OpenAI API surface designed for model responses, multimodal input, tool use, multi-turn state, and agentic workflows. For teams building with current OpenAI models, it is the vocabulary to learn before older chat-only patterns.
A multimodal system can understand more than one type of input, such as text, images, audio, or video, and combine them in one answer.
You upload a photo of a chart and ask a question in text. The model uses both the image and the language prompt to answer.
An AI agent is a system that can plan steps, use tools, execute tasks, and sometimes coordinate with other agents to reach a goal. In OpenAI's current platform language, the Agents SDK adds patterns for orchestration, tracing, handoffs, guardrails, human review, and state management.
Realtime sessions keep a live connection open while an app streams audio, receives events, calls tools, and updates state. Use them when a product needs interactive voice or multimodal behavior, not just a single request and response.
A machine doing tasks that normally require human intelligence.
Use: Used to understand, decide, recommend, and create.
Example: Spam filters, translation, recommendations.
The part of AI that learns from examples instead of following only fixed rules.
Use: Used to predict, classify, and segment.
Example: Spam or not spam, price estimation.
A large model specialized in understanding and generating text.
Use: Used to summarize, draft, explain, and translate.
Example: ChatGPT, Claude, Gemini.
A current OpenAI model family designed for strong reasoning, tool-heavy agents, coding, grounded assistants, and polished customer-facing workflows.
Use: Used when quality, execution, and reliability matter more than the fastest possible turn.
Example: A customer support agent that searches, reasons, drafts, and cites evidence.
The OpenAI API layer for model responses, multimodal input, tools, state, and agentic workflows.
Use: Used to build current OpenAI applications instead of relying only on older chat-only patterns.
Example: A workflow that uses file search, structured output, and previous_response_id.
A setting that controls how much reasoning budget a supported model uses.
Use: Use low or medium for many production workflows, and increase only when evals justify the cost and latency.
Example: Set higher effort for complex legal review, lower effort for simple classification.
A way to make model output follow a JSON schema or exact structure with stronger validation.
Use: Use it when downstream software depends on reliable fields, not prose.
Example: Return a CRM lead object with company, role, budget, urgency, and next action.
A system that plans steps, uses tools, and executes a task with state, review rules, and traceable actions.
Use: Used to automate multi-step workflows where a model needs tools or handoffs.
Example: Research + file search + spreadsheet analysis + summary + approval request.
A standard for connecting models to tools and external data sources.
Use: Lets AI fetch context or act through tools.
Example: A remote MCP server that exposes CRM, calendar, or document tools.
A live connection for low-latency audio, translation, transcription, and voice-agent experiences.
Use: Used when the user experience depends on streaming events rather than a single request.
Example: A browser voice agent using WebRTC and server-side tools.
Keep these five lines in your head
AI Workshop Switzerland helps organisations move from AI experimentation to governed daily use. We combine facilitator-led training, executive coaching, and optional Develop programs for MCP-connected agents.
Our clients include corporate teams in finance, pharma, professional services, and public-sector adjacent organisations. We work with approved tools, realistic data boundaries, and human-in-the-loop review.
Explore training programs by role and maturity, read AI insights on corporate adoption in Switzerland, or contact us to compare formats. We deliver in Zurich, Geneva, Lausanne, Basel, Bern, and across the country.