The definitive AI Glossary of Terms

The AI technology space is expanding rapidly and with that a whole new vocabulary needs to be learned. We have gathered many of those that we come across on a frequent basis.

Glossary

A

Azure Search: A cloud-based search-as-a-service solution provided by Microsoft Azure. It allows developers to integrate sophisticated search capabilities into applications and websites.

Azure Cognitive Search: An evolution of Azure Search, this service combines the core search capabilities with Azure Cognitive Services. It offers AI-powered content understanding, enabling richer search experiences by extracting insights from the data. Azure Cognitive Search - Vector Search it's in Private Preview, allowing you to do a vector search or hybrid search (vector and keyword retrieval).

Azure Cognitive Services: A collection of AI services and APIs offered by Microsoft Azure, covering domains like vision, speech, language, and decision-making.

Agentic AI: Systems with autonomy and goal-directed behavior capable of making independent decisions, planning, and adapting to achieve specific objectives without direct, ongoing human input.

AI Ethics: The study of ethical considerations and the development of guidelines and principles related to the design, implementation, and use of artificial intelligence systems.

AI Fairness: The development and evaluation of AI systems to ensure that they treat all individuals and groups fairly, without discrimination or bias, and promote equal.

AI Orchestration: The process of coordinating and managing various AI models and plugins to deliver enhanced user experiences.

Application programming interface (API): Intermediary software components that allow two applications to talk to each other; a structured way for AI systems to programmatically access (usually external) models, data sets, or other pieces of software.

Artificial General Intelligence (AGI): An AI system that can perform any intellectual task that a human being can do, possessing a broad range of cognitive abilities and understanding.

Autonomous Systems: AI-powered machines or devices that can operate independently, without human intervention, to make decisions and perform tasks based on their programming and learned experience.

Azure OpenAI Service: The cloud service where the LLMs used for Microsoft 365 Copilot are hosted.

B

Bias: The presence of systematic errors in AI models, often due to biases in the training data, which may lead to unfair or discriminatory outcomes.

Big data: Large data sets that can be studied to reveal patterns and trends to support business decisions. It’s called “big” data because organizations can now gather massive amounts of complex data using data collection tools and systems.

Business Chat: A feature in Microsoft Teams that, with Copilot, can bring together information from multiple sources to update users on topics or events they might have missed.

C

Chatbot: A computer program designed to simulate conversation with human users, typically using text or voice-based interfaces.

Composite AI: Helps create more robust, ethical, and efficient AI systems by combining different approaches to achieve better results while mitigating some of the risks and limitations that can come with using a single AI model.

Computer Vision: A field of artificial intelligence that teaches computers to interpret and understand visual information from the world, such as images, videos, and real-time camera feeds.

Connectors: Tools provided by Semantic Kernel that enable developers to integrate AI services into their applications. They help in adding memories and models to apps.

Context Engineering: The practice of structuring inputs and environments so that AI systems can interpret information accurately and perform effectively. It involves curating language, metadata, prompts, environmental signals, workflows, and user interfaces to “speak” to AI in a way that maximizes relevance and minimizes ambiguity.

D

DALL-E: An AI model developed by OpenAI for generating images from textual descriptions.

Deep Learning: A subset of machine learning that involves training artificial neural networks to recognize patterns in data and make predictions or decisions.

E

Embeddings: Embeddings are a type of word representation that captures the semantic meaning of words based on their context in a high-dimensional space. In machine learning and natural language processing, embeddings transform discrete categorical variables (like words) into continuous vectors of fixed dimensions. These vectors capture semantic relationships between words, meaning that words with similar meanings tend to have vectors that are close to each other in this space.

Embeddings (Semantic Kernel): Representations of data in a format suitable for machine learning. In the context of Semantic Kernel, embeddings can be used to give apps "memories".

F

Feature Engineering: The process of selecting, transforming, and creating relevant features or attributes from raw data to improve the performance of machine learning models.

Foundation models: Deep learning models trained on vast quantities of unstructured, unlabeled data that can be used for a wide range of tasks out of the box or adapted to specific tasks through fine-tuning. Examples of these models are DALL-E 2, GPT-4, PaLM, and Stable Diffusion.

G

GPT and GAI (Generative Artificial Intelligence): Advanced AI models and techniques used for generating content.

H

Hallucination or artificial hallucination: is a confident response by an AI that does not seem to be justified by its training data.

J

Jail break: Content restrictions have been placed on AI due to notorious incidents. Ask an AI to describe how to do something illegal or unethical, and they will refuse an answer. However, it's possible to "jailbreak" them – which means to bypass those safeguards using creative language, hypothetical scenarios, and trickery.

K

k Nearest Neighbors (kNN): An algorithm used in vector search to find the most similar items to a given query in the vector space.

Knowledge graph: Also known as Semantic Networks. This helps machines understand how concepts are related.

L

Large Language Models (LLMs): Advanced AI models capable of understanding and generating human-like text based on the input they receive.

M

Machine Learning: A subset of artificial intelligence that involves training computer algorithms to learn patterns and relationships in data, allowing them to make predictions or decisions without being explicitly programmed to do so.

Memory: In the context of Semantic Kernel, memory refers to the capability to retrieve information from past interactions or data. It can be simulated in AI apps using specialized plugins.

Microsoft 365 Copilot: An AI-powered productivity tool that integrates with Microsoft Graph and Microsoft 365 Apps. It offers real-time intelligent assistance to enhance user creativity, productivity, and skills.

Modality: A high-level data category such as numbers, text, images, video, and audio.

Multimodal capabilities: The ability of an AI system to process and generate various types of data (text, images, audio, video) simultaneously, enabling complex tasks and rich outputs.

N

Native Functions: Functions that allow the kernel to call C# or Python code directly. They can be used to manipulate data or perform operations that are not suitable for LLMs, such as calculations.

Natural Language Processing (NLP): A field of artificial intelligence that focuses on the interaction between computers and human language, enabling machines to understand, interpret, and generate human language.

Neural Network: A computing model inspired by the structure and function of the human brain, consisting of interconnected artificial neurons that process information and learn from data.

O

OpenAI Plugin Specification: A standard adopted by Microsoft for plugins to ensure interoperability across major AI apps and services like ChatGPT, Bing, and Microsoft 365.

P

Planner: A tool within Semantic Kernel that can automatically create chains or sequences of actions to meet specific user needs. It does this by combining various plugins loaded into the kernel.

Plugins: Components that allow applications to interact with the real world. They consist of prompts and native functions that can respond to triggers and perform actions.

Prompts: User queries or questions that Microsoft 365 Copilot responds to using the sophisticated map.

Prompt Engineering: The process of designing and optimizing prompts to get desired outputs from AI models.

R

Reasoning AI: AI systems that perform logical thinking, step-by-step planning, problem solving, and decision making using structured or unstructured data, going beyond pattern recognition to draw conclusions and solve
complex problems.

Reinforcement Learning: A type of machine learning where an AI agent learns to make decisions by interacting with an environment and receiving feedback in the form of rewards or penalties.

Retrieval Augmented Generation (RAG): An architecture that combines the capabilities of retrieval-based and generative models for tasks like question answering.

S

Semantic Functions: Represent the "ears" and "mouth" of the AI app, allowing it to listen to user requests and respond with natural language. These functions connect to the "brain" of the AI app through connectors.

Semantic Index for Copilot: A tool designed to help organizations prepare their data and users for Copilot by creating a sophisticated map of user and corporate data. Formed by encoding and indexing keyword searches into a vector that combines phrases, meanings, relationships, and context of the data. This map aids Microsoft 365 Copilot in understanding more about an organization.

Semantic Kernel: An open-source SDK that allows the combination of AI services like OpenAI, Azure OpenAI, and Hugging Face with conventional programming languages such as C# and Python. It facilitates the creation of AI apps that merge the capabilities of AI and traditional programming.

Superagency: A state where individuals, empowered by AI, amplify their creativity, productivity, and positive impact. Even those
not directly engaging with AI can benefit from its broader effects on knowledge, efficiency, and innovation.

Supervised Learning: A machine learning task where an algorithm is trained on a labeled dataset, with input-output pairs, to learn the relationship between inputs and outputs.

Synthetic data is information that's artificially generated rather than produced by real-world events.

T

Transfer Learning: A technique in machine learning where an AI model, pre-trained on one task or dataset, is fine-tuned or adapted to perform a different but related task.

U

Unsupervised Learning: A machine learning task where an algorithm is trained on an unlabeled dataset, without input-output pairs, to discover patterns or structures in the data.

V

Vector: A mathematical representation that combines phrases, meanings, relationships, and context of data.

Vector Database: A database designed to store and manage vector data, which can be used in conjunction with embeddings.

Vectorization: The process of converting data into vector form. In the context of this article, it refers to the transformation of content (like text or images) into vector representations.

Vector Search: A method of information retrieval where documents and queries are represented as vectors instead of plain text. This allows for similarity search, multi-modal search, recommendation engines, and applications implementing the Retrieval Augmented Generation (RAG) architecture.

Glossary of Terms

A

B

C

D

E

F

G

H

J

K

L

M

N

O

P

R

S

T

U

V

You may also be interested in...

The role metadata plays in findability, discoverability and AI

Topics

Glossary of Terms

A

B

C

D

E

F

G

H

J

K

L

M

N

O

P

R

S

T

U

V

You may also be interested in...

Stay in touch!

The role metadata plays in findability, discoverability and AI

Topics

Related posts

Knowledge is the Sweet Spot for AI

The importance of AI governance: compliance and ethical use of AI

Context Engineering in AI and Gen AI: The Key to Better Results